Machine Heart
Jul 3, 2026 · Artificial Intelligence
ICML 2026: Enabling Multimodal Large Models to Reason Over Time with the Open‑Source TaRO Framework
The paper introduces the Temporal‑Aware Reasoning Optimization (TaRO) framework, which equips multimodal video large models with time‑aware reasoning via template‑based exploration, a temporal‑sensitivity reward, and progressive curriculum learning, achieving state‑of‑the‑art zero‑shot performance on several video temporal grounding benchmarks, including long‑video datasets.
Multimodal LearningTaROTemporal Reasoning
0 likes · 9 min read
