Weekly Large Model Application
May 29, 2026 · Artificial Intelligence
From Direct Transcription to Reasoning ASR and Parallel Decoding: CoT‑ASR vs Whisfusion
ASR is shifting from direct verbatim transcription to two new paradigms—Chain‑of‑Thought reasoning (CoT‑ASR) that cuts WER and entity error rates, and diffusion‑based parallel decoding (Whisfusion) that slashes latency by over eight times—offering complementary routes for smarter, faster speech recognition.
ASRChain-of-ThoughtCoT-ASR
0 likes · 12 min read
