How AI Is Transforming Real‑Time Audio: Insights from LiveVideoStackCon 2020

The LiveVideoStackCon 2020 conference highlighted AI's growing role in audio processing, detailing Dr. Hao Yiya's modular AI approach, current challenges like computational load and robustness, and NetEase Cloud's data‑driven advancements that are reshaping real‑time communication audio.

NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
How AI Is Transforming Real‑Time Audio: Insights from LiveVideoStackCon 2020

From October 31 to November 1, 2020, LiveVideoStackCon 2020 was held in Beijing under the theme “Multimedia Opens a New Vision”, covering audio, video, and image technologies across many industries.

LiveVideoStackCon conference
LiveVideoStackCon conference

Artificial‑intelligence topics were especially popular. Dr. Hao Yiya, senior audio algorithm engineer at NetEase Cloud, delivered a keynote titled “Productizing AI Audio Algorithms in RTC” and shared practical AI applications in a round‑table.

She explained that AI, driven by CNN, RNN and other deep‑learning models, is increasingly used for audio denoising, scene classification, echo control and blind source separation.

However, AI adoption in real‑time communication (RTC) audio faces three main challenges: computational complexity, limited generalization ability, and robustness under diverse scenarios.

Because of these constraints, AI has not yet replaced traditional signal‑processing methods. Dr. Hao suggested a modular approach: split end‑to‑end processing into separate AI modules, allowing each to be optimized and handled by specialized experts.

Using AI‑based noise‑estimation as an example, a lightweight deep‑learning model can replace the traditional noise‑estimation module, offering a simple training target and better suitability for DNNs.

Looking ahead, she expects more advanced neural‑network architectures and efficient GPUs to drive AI integration in RTC, while massive data collection remains essential. NetEase Cloud already maintains a full data‑collection pipeline and has applied AI to audio denoising and video super‑resolution.

Since 2015, NetEase Cloud has focused on instant‑messaging and audio‑video technologies, serving over one million developers, handling 10 trillion messages, and reaching 300 million daily active users. The company continues to innovate by embedding AI into its upgraded audio‑video calling product.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

AIreal-time communicationAudio Processingnoise reductionmodular AI
NetEase Smart Enterprise Tech+
Written by

NetEase Smart Enterprise Tech+

Get cutting-edge insights from NetEase's CTO, access the most valuable tech knowledge, and learn NetEase's latest best practices. NetEase Smart Enterprise Tech+ helps you grow from a thinker into a tech expert.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.