AntTech
Apr 1, 2025 · Artificial Intelligence
AReaL‑boba: Open‑Source Reinforcement Learning Training Framework v0.2 with SOTA Performance
The Ant Research Institute and Tsinghua University's Wu Yi team released AReaL‑boba 0.2, an open‑source reinforcement‑learning training framework that dramatically speeds up large‑scale model training, achieves state‑of‑the‑art mathematical reasoning results, and provides all code, data, and scripts for reproducible research.
AILarge ModelsPerformance
0 likes · 5 min read