Jun 15, 2026 · Artificial Intelligence

How Close Is Video Generation to Being Beautiful, Useful, Accurate? 1080‑Prompt, 7‑Model KIVI Benchmark

Researchers introduce KIVI, a knowledge‑intensive video generation benchmark with 1080 real‑world prompts, evaluating seven models using new FactP and HelpS metrics, revealing systematic errors such as entity mis‑depiction, procedural mistakes, and component misplacement, and showing a gap between human‑crafted and AI‑generated videos.

FactPHelpSKIVI

0 likes · 9 min read