Machine Heart
Jun 15, 2026 · Artificial Intelligence
How Close Is Video Generation to Being Beautiful, Useful, Accurate? 1080‑Prompt, 7‑Model KIVI Benchmark
Researchers introduce KIVI, a knowledge‑intensive video generation benchmark with 1080 real‑world prompts, evaluating seven models using new FactP and HelpS metrics, revealing systematic errors such as entity mis‑depiction, procedural mistakes, and component misplacement, and showing a gap between human‑crafted and AI‑generated videos.
FactPHelpSKIVI
0 likes · 9 min read
