Tencent Technical Engineering
Tencent Technical Engineering
May 23, 2025 · Artificial Intelligence

Can a 3B Open‑Source Multimodal Model Beat GPT‑4V in Math? A Deep Dive into VLR1‑3B

The preview release of the 3‑billion‑parameter VLR1‑3B multimodal model demonstrates state‑of‑the‑art reasoning on math benchmarks, outperforms many commercial closed‑source models, and shows promising results on geometry, physics, and general vision tasks, while also revealing typical hallucination issues.

Open SourceVLR1-3Bbenchmark
0 likes · 8 min read
Can a 3B Open‑Source Multimodal Model Beat GPT‑4V in Math? A Deep Dive into VLR1‑3B