PaperAgent
May 20, 2026 · Artificial Intelligence
AutoTTS Shows How AI Agents Can Outperform Human‑Designed Test‑Time Scaling Strategies
The paper “LLMs Improving LLMs” introduces AutoTTS, an environment where a Claude‑based explorer agent automatically searches test‑time scaling policies, achieving up to 69.5% token savings and superior accuracy on unseen models, all for $39.9 and 160 minutes without any LLM calls during evaluation.
AutoTTSClaudeLLM agents
0 likes · 7 min read
