Why Do Reasoning LLMs Lose Instruction-Following Ability? A Deep Dive into Recent Findings
This article compares two recent papers that investigate why large reasoning models such as Llama and Qwen show degraded instruction‑following performance when using chain‑of‑thought prompting, analyzing attention patterns, training effects, and proposed mitigation strategies.
