Amap Tech
Sep 4, 2025 · Artificial Intelligence
How Hierarchical Sampling Boosts Self‑Taught Reasoning in LLMs
HS‑STAR introduces a three‑stage hierarchical sampling framework that identifies high‑utility boundary problems, reallocates computation budget to them, and fine‑tunes large language models, achieving significant accuracy gains on math reasoning benchmarks without extra sampling cost.
HS-STARHierarchical Samplingdifficulty estimation
0 likes · 10 min read
