Dec 19, 2025 · Artificial Intelligence

SABER: Switchable and Balanced Training for Efficient LLM Reasoning

SABER introduces a reinforcement‑learning framework that lets large language models dynamically switch among four token‑budgeted reasoning modes, dramatically cutting inference length while preserving or improving accuracy across math, code, and logic tasks.

Budgeted ComputationChain-of-ThoughtEfficient Reasoning

0 likes · 13 min read

SABER: Switchable and Balanced Training for Efficient LLM Reasoning

AntTech

May 31, 2025 · Artificial Intelligence

Machine Reasoning and Deep Thinking: Insights from Ant Financial’s NLP Lead Wu Wei

The article explores how DeepSeek R1 and long‑thinking chains have revived interest in machine reasoning, tracing the evolution of natural‑language models, defining reasoning as logical knowledge composition, and outlining future research directions in efficient reasoning architectures and deep‑thinking applications.

AI researchEfficient ReasoningLarge Language Models

0 likes · 8 min read

Machine Reasoning and Deep Thinking: Insights from Ant Financial’s NLP Lead Wu Wei

Efficient Reasoning

SABER: Switchable and Balanced Training for Efficient LLM Reasoning

Machine Reasoning and Deep Thinking: Insights from Ant Financial’s NLP Lead Wu Wei

Machine Reasoning and Deep Thinking: Insights from Ant Financial’s NLP Lead Wu Wei