Bilibili Tech
Dec 19, 2025 · Artificial Intelligence
SABER: Switchable and Balanced Training for Efficient LLM Reasoning
SABER introduces a reinforcement‑learning framework that lets large language models dynamically switch among four token‑budgeted reasoning modes, dramatically cutting inference length while preserving or improving accuracy across math, code, and logic tasks.
Budgeted ComputationEfficient ReasoningLLM
0 likes · 13 min read
