PaperAgent
PaperAgent
Apr 20, 2026 · Artificial Intelligence

How 9 Parallel Claude Agents Surpassed Human Researchers in Weak‑to‑Strong Supervision

Anthropic’s Automated Weak‑to‑Strong Researcher (AAR) system uses nine parallel Claude Opus agents to replace human researchers, achieving a Performance Gap Recovered (PGR) of 0.97 in five days at a cost of about $18,000, demonstrating that AI‑driven automation can outperform humans on well‑defined alignment tasks.

AARAI alignmentAgentic AI
0 likes · 9 min read
How 9 Parallel Claude Agents Surpassed Human Researchers in Weak‑to‑Strong Supervision