Network Intelligence Research Center (NIRC)
Aug 27, 2025 · Artificial Intelligence
Perception‑R1: A Rule‑Based RL Method that Elevates Multimodal Model Vision
Perception‑R1, a post‑training framework that applies rule‑based reinforcement learning to existing multimodal LLMs, dramatically improves visual perception tasks such as grounding, OCR, counting and object detection, as demonstrated by extensive benchmarks and ablation studies.
GRPOMultimodal LLMPerception Policy
0 likes · 10 min read
