PaperAgent
May 2, 2026 · Artificial Intelligence
Can Harnesses Self‑Evolve? Fudan & Peking University’s Agentic Harness Engineering Breakthrough
The paper introduces Agentic Harness Engineering (AHE), showing that a 10‑round evolution improves Coding Agent pass@1 from 69.7% to 77.0% on Terminal‑Bench 2—outperforming Codex‑CLI—and that the evolved harness transfers zero‑shot to SWE‑bench and multiple model families, thanks to three observability pillars.
Ablation StudyAgentic AIBenchmark
0 likes · 11 min read
