Gemini Omni Review: Turning Sketches into Cinematic Videos with a Single Prompt
Google unveiled Gemini Omni at I/O, a multimodal world model that combines reasoning and generation to create realistic video, edit scenes via conversation, and demonstrate emergent abilities such as style transfer and scene continuation, while introducing safety cages like Avatar Flow and mandatory watermarks.
