Tagged articles
4 articles
Page 1 of 1
Old Meng AI Explorer
Old Meng AI Explorer
Jan 8, 2026 · Artificial Intelligence

How Microsoft’s Open‑Source VibeVoice Gives AI Speech Real Emotion

Microsoft’s open‑source VibeVoice model transforms text‑to‑speech by adding fine‑grained emotional control, multi‑scene styles, and support for over 100 languages, offering free commercial use, low‑latency local deployment, and detailed parameter settings that let developers and creators generate expressive, context‑aware audio for videos, audiobooks, chatbots, and more.

AI voiceDeploymentVibeVoice
0 likes · 10 min read
How Microsoft’s Open‑Source VibeVoice Gives AI Speech Real Emotion
Huolala Tech
Huolala Tech
Sep 10, 2025 · Artificial Intelligence

How AI Voice Humanization Cuts Call‑Center Costs: ASR, Smart Interrupt & TTS Deep Dive

This article examines how AI‑driven voice humanization—covering advanced ASR, intelligent interruption, and expressive TTS—addresses high labor costs, efficiency bottlenecks, and inconsistent service quality in inbound and outbound call‑center operations, presenting technical evaluations, optimization strategies, and future research directions.

AI voiceASRCustomer Service
0 likes · 13 min read
How AI Voice Humanization Cuts Call‑Center Costs: ASR, Smart Interrupt & TTS Deep Dive
Cognitive Technology Team
Cognitive Technology Team
Jul 1, 2025 · Artificial Intelligence

How We Built a Live‑Streaming TTS Engine: From Data Pipelines to AI Voice Generation

This article presents a comprehensive practice summary of building an intelligent digital‑human system, covering six core modules—LLM content generation, LLM interaction, TTS synthesis, visual driving, audio‑video engineering, and backend services—while detailing data collection, signal processing, ASR annotation, speaker clustering, model optimization (V1‑V4), evaluation metrics, and future research directions.

AI voiceAudio ProcessingLLM
0 likes · 23 min read
How We Built a Live‑Streaming TTS Engine: From Data Pipelines to AI Voice Generation
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 20, 2019 · Artificial Intelligence

Unlock Cutting-Edge Voice AI: Highlights from Alibaba’s Speech & Signal Processing eBook

This article introduces Alibaba's new e‑book collection of five ICASSP‑accepted papers that showcase advances in speech recognition, synthesis, and emotion detection, detailing novel models like DFSMN, A‑LSTM, and speaker‑adaptation techniques that dramatically improve speed, size, and accuracy.

AI voiceEmotion RecognitionICASSP
0 likes · 6 min read
Unlock Cutting-Edge Voice AI: Highlights from Alibaba’s Speech & Signal Processing eBook