Tagged articles
1 articles
Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Jun 22, 2026 · Artificial Intelligence

Why Large Language Models Need Not Run CoT on Every Question: Tencent Hunyuan’s On‑Demand CoT Trigger

The paper analyzes the efficiency and reward‑signal shortcomings of conventional generative reward models (GRM) and presents the E‑GRM framework, which uses model‑internal uncertainty to dynamically trigger chain‑of‑thought reasoning, employs a consensus‑based routing decision and a mixed‑loss discriminative scorer, achieving significant speed‑up and accuracy gains on benchmarks such as MATH, RM‑Bench and RewardBench.

Chain-of-ThoughtDynamic RoutingEfficiency
0 likes · 15 min read
Why Large Language Models Need Not Run CoT on Every Question: Tencent Hunyuan’s On‑Demand CoT Trigger