Zuoyebang Tech Team
Zuoyebang Tech Team
Jul 29, 2022 · Artificial Intelligence

Boosting Chinese‑English Code‑Switching Speech Recognition with Language ID and LM Enhancements

This report details a series of experiments on Chinese‑English mixed‑language speech recognition, introducing language‑identification loss and language‑model integration to improve acoustic modeling, reduce mixed error rates, and achieve significant gains over a baseline end‑to‑end ASR system.

Code-SwitchingSpeech Recognitiondeep learning
0 likes · 16 min read
Boosting Chinese‑English Code‑Switching Speech Recognition with Language ID and LM Enhancements
Volcano Engine Developer Services
Volcano Engine Developer Services
Oct 20, 2021 · Artificial Intelligence

How ByteDance’s AI Transforms Music Creation and Discovery on TikTok

ByteDance leverages advanced AI models such as SpectTNT, semi‑supervised music tagging transformers, language identification, chord recognition, contrastive representation learning, and source separation to power TikTok’s massive music library, enabling seamless music‑video interaction, smarter recommendations, and new creative tools for creators worldwide.

audio processingdeep learninglanguage identification
0 likes · 10 min read
How ByteDance’s AI Transforms Music Creation and Discovery on TikTok