Tag

voice synthesis

1 views collected around this technical thread.

Amap Tech
Amap Tech
May 27, 2025 · Artificial Intelligence

Gaode Map Custom Voice Pack: End‑to‑End TTS Model Architecture and Deployment

This article explains how Gaode Map leverages lightweight edge TTS models, dual‑autoregressive large‑model data augmentation, and a configurable audio‑processing DAG to enable users to create highly realistic personalized voice packs from just three recorded sentences.

Gaode MapsTTSdata augmentation
0 likes · 8 min read
Gaode Map Custom Voice Pack: End‑to‑End TTS Model Architecture and Deployment
DaTaobao Tech
DaTaobao Tech
Mar 31, 2025 · Artificial Intelligence

AI Audio Generation and Voice Synthesis Practices at Taobao

The article surveys Taobao’s AI‑generated audio pipeline, detailing eight technical papers on image‑to‑video, OpenAI o1, multimodal video, and large‑model voice synthesis, while highlighting advances like VALL‑E, CosyVoice, F5‑TTS, data‑cleaning methods, and e‑commerce applications such as voice‑cloned live streams, multilingual TTS, AI video‑audio integration, and audiobook production.

AI audioTTSdata cleaning
0 likes · 11 min read
AI Audio Generation and Voice Synthesis Practices at Taobao
DataFunTalk
DataFunTalk
May 13, 2022 · Artificial Intelligence

Design and Applications of AI‑to‑AI Conversational Systems in Immersive Virtual Social Environments

This article explores why AI‑to‑AI dialogue is needed, outlines the overall architecture of an AI conversation system, details text‑generation and voice‑synthesis techniques, and examines how such technology can power immersive metaverse social experiences, illustrated with the XiaoIce Island platform.

AI conversationdialogue systemsmetaverse
0 likes · 21 min read
Design and Applications of AI‑to‑AI Conversational Systems in Immersive Virtual Social Environments