Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Apr 9, 2025 · Artificial Intelligence

Why Scaling Laws Fail for Video MLLMs: Uncovering the Temporal Hacking Problem

The article analyzes the anti‑scaling phenomenon in video large‑language models, identifies a “temporal hacking” shortcut where models focus on a few key frames, formalizes it via reward‑hacking theory, introduces the Temporal Perplexity (TPL) metric, and proposes an Unhackable Temporal Rewarding (UTR) framework to mitigate the issue.

Reinforcement LearningTemporal PerplexityUTR
0 likes · 14 min read
Why Scaling Laws Fail for Video MLLMs: Uncovering the Temporal Hacking Problem