Can World Models Truly Understand Interaction? Inside the Omni-WorldBench
Omni-WorldBench introduces a comprehensive benchmark that shifts world‑model evaluation from visual fidelity to interactive response, detailing its two‑part suite, metric design, extensive prompt taxonomy, and experimental results that reveal current models' strengths and limitations in causal and temporal reasoning.
