AntTech
Aug 1, 2025 · Artificial Intelligence
How Ant Group Dominated the 2025 DCASE Audio Question Answering Challenge
The article details the 2025 DCASE Audio Question Answering (AQA) track, outlines its technical challenges, describes Ant Group's three‑stage data, model, and training pipeline, presents performance gains of their Qwen2‑Audio‑R1‑8B and Kimi‑Audio‑SFT‑12B models, and outlines future research directions.
Audio Question AnsweringDCASEmodel training
0 likes · 8 min read
