Machine Learning Algorithms & Natural Language Processing
Apr 29, 2026 · Artificial Intelligence
Dual Engine for Training and Inference: How Princeton’s SD‑ZERO and AggAgent Redefine Complex Reasoning
The article reviews two recent Princeton papers—SD‑ZERO, which introduces self‑revision training and on‑policy self‑distillation to turn a model’s own error traces into dense supervision, and AggAgent, which actively aggregates parallel long‑horizon trajectories—showing how internal trajectory mining can cut compute costs and boost accuracy on challenging math and code benchmarks.
AggAgentOn-Policy Distillationcomplex reasoning
0 likes · 10 min read
