Tagged articles

Video Temporal Grounding

1 articles · Page 1 of 1
Machine Heart
Machine Heart
Jul 3, 2026 · Artificial Intelligence

ICML 2026: Enabling Multimodal Large Models to Reason Over Time with the Open‑Source TaRO Framework

The paper introduces the Temporal‑Aware Reasoning Optimization (TaRO) framework, which equips multimodal video large models with time‑aware reasoning via template‑based exploration, a temporal‑sensitivity reward, and progressive curriculum learning, achieving state‑of‑the‑art zero‑shot performance on several video temporal grounding benchmarks, including long‑video datasets.

Multimodal LearningTaROTemporal Reasoning
0 likes · 9 min read
ICML 2026: Enabling Multimodal Large Models to Reason Over Time with the Open‑Source TaRO Framework