Gemini Omni Tested: Turn Sketches into Blockbuster Videos with a Single Prompt
Google DeepMind unveiled Gemini Omni at I/O, a multimodal world model that combines reasoning and generation to edit videos via conversational prompts, supports digital avatars, demonstrates emergent cross‑modal improvements, and incorporates safety cages such as Avatar Flow and dual watermarks, signaling a step toward AGI‑level video AI.
