Testing Gemini Omni: Turn Sketches into Cinematic Videos with One Prompt
Google unveiled Gemini Omni at I/O, a multimodal world model that lets users edit videos by speaking a single sentence, turning simple sketches into cinematic clips, while offering conversational editing, digital‑twin avatars, emergent style‑transfer and scene‑continuation capabilities, all backed by a new multimodal training objective.
