Machine Heart
Jun 11, 2026 · Artificial Intelligence
MBench: Tsinghua and Tencent Define Long-Term Memory for Video World Models
MBench, a new benchmark from Tsinghua University and Tencent, systematically evaluates the long‑term memory ability of streaming video generation models across entity, environment, and causal consistency, introduces a trigger‑conditioned scoring scheme, and reveals that memory remains a major bottleneck for current SOTA models.
AIbenchmarklong-term consistency
0 likes · 8 min read
