Machine Heart
Jun 17, 2026 · Artificial Intelligence
Can a 3B Model Rival Opus 4.5 in Programming? Inside the Domestic VibeThinker‑3B
VibeThinker‑3B, a 3‑billion‑parameter Chinese‑built model, achieves programming benchmark scores comparable to top‑tier models like Opus 4.5, excelling in AIME, HMMT, LiveCodeBench and LeetCode contests, thanks to its Spectrum‑to‑Signal training pipeline, Claim‑Level reliability evaluation, and multi‑stage SFT and RL refinements.
AI researchClaim-Level ReliabilitySpectrum-to-Signal
0 likes · 7 min read
