Amap Tech
Amap Tech
Sep 4, 2025 · Artificial Intelligence

How Hierarchical Sampling Boosts Self‑Taught Reasoning in LLMs

HS‑STAR introduces a three‑stage hierarchical sampling framework that identifies high‑utility boundary problems, reallocates computation budget to them, and fine‑tunes large language models, achieving significant accuracy gains on math reasoning benchmarks without extra sampling cost.

HS-STARHierarchical Samplingdifficulty estimation
0 likes · 10 min read
How Hierarchical Sampling Boosts Self‑Taught Reasoning in LLMs