AI Frontier Lectures
Jan 21, 2026 · Artificial Intelligence
How AP2O‑Coder Cuts LLM Code Errors by Up to 3% with Adaptive Preference Optimization
The paper introduces AP2O‑Coder, an adaptive progressive preference optimization framework that systematically captures error types, progressively refines LLM code generation, and dynamically adapts training data, achieving up to a 3% pass@k improvement across multiple open‑source models while reducing data requirements.
AP2O-CoderLLMcode generation
0 likes · 11 min read
