Tagged articles

ControlAudio

2 articles · Page 1 of 1

Machine Learning Algorithms & Natural Language Processing

Apr 23, 2026 · Artificial Intelligence

ControlAudio: Script‑Driven, Time‑Precise Text‑to‑Audio Generation Presented at ACL 2026

ControlAudio, a progressive diffusion framework introduced by Tsinghua researchers, unifies text, timing, and phoneme modeling to enable precise control over when sounds occur and what is spoken, achieving superior alignment and intelligibility while preserving high‑fidelity audio generation.

ACL 2026ControlAudioText-to-Audio

0 likes · 11 min read

ControlAudio: Script‑Driven, Time‑Precise Text‑to‑Audio Generation Presented at ACL 2026

Machine Heart

Apr 21, 2026 · Artificial Intelligence

ControlAudio Enables Scripted Timing and Speech Control in Text-to-Audio Generation

ControlAudio, a progressive diffusion model presented at ACL 2026, jointly models text, timing, and phoneme information to achieve precise event timing and intelligible speech in text-to-audio generation, backed by a large mixed real‑synthetic dataset and competitive experimental results.

ControlAudioProgressive DiffusionText-to-Audio

0 likes · 10 min read

ControlAudio Enables Scripted Timing and Speech Control in Text-to-Audio Generation

ControlAudio

ControlAudio: Script‑Driven, Time‑Precise Text‑to‑Audio Generation Presented at ACL 2026

ControlAudio Enables Scripted Timing and Speech Control in Text-to-Audio Generation

ControlAudio: Script‑Driven, Time‑Precise Text‑to‑Audio Generation Presented at ACL 2026