Tagged articles

AI video

15 articles · Page 1 of 1

Jun 15, 2026 · Artificial Intelligence

Gemini Omni Tested: Turn Sketches into Blockbuster Videos with a Single Prompt

Google DeepMind unveiled Gemini Omni at I/O, a multimodal world model that combines reasoning and generation to edit videos via conversational prompts, supports digital avatars, demonstrates emergent cross‑modal improvements, and incorporates safety cages such as Avatar Flow and dual watermarks, signaling a step toward AGI‑level video AI.

AI videoGemini OmniMultimodal AI

0 likes · 10 min read

Gemini Omni Tested: Turn Sketches into Blockbuster Videos with a Single Prompt

JD Cloud Developers

Jun 11, 2026 · Artificial Intelligence

How JD’s Open‑Source JoyAI‑Echo Tackles the Three Big Challenges of Long‑Form Video Generation

JD’s newly open‑source JoyAI‑Echo framework addresses long‑video generation’s three major pain points—character inconsistency, unstable speaker timbre, and slow rendering—through a cross‑modal memory bank, memory‑driven training, a conversational Director Agent, and real‑time super‑resolution, delivering up to 7.5× speed gains and superior benchmark results.

AI videoBenchmarkJoyAI-Echo

0 likes · 6 min read

How JD’s Open‑Source JoyAI‑Echo Tackles the Three Big Challenges of Long‑Form Video Generation

Design Hub

May 25, 2026 · Artificial Intelligence

How Meituan’s Open‑Source Avatar Redefines Digital Human Voice‑Over Costs (Beyond HeyGen)

LongCat‑Video‑Avatar‑1.5, Meituan’s open‑source audio‑driven video generation model, upgrades its encoder, stability, multi‑character support and 8‑step distillation, provides a detailed workflow, benchmark evaluation, and examines its impact on designers, operators, e‑commerce and marketing while highlighting deployment and compliance challenges.

AI videoAudio-driven Video GenerationContent Automation

0 likes · 22 min read

Lao Guo's Learning Space

Apr 21, 2026 · Artificial Intelligence

HappyOyster: Build an Explorable Interactive World with a Single Prompt

Alibaba’s ATH team unveiled HappyOyster, a real‑time world‑model platform that lets users generate and explore interactive 3D environments from a single sentence or image, offering two modes—Wander for exploration and Direct for creation—while detailing its streaming architecture, multimodal foundation, competitive advantages, use cases, and current limitations.

AI videoGame DevelopmentGenerative AI

0 likes · 11 min read

HappyOyster: Build an Explorable Interactive World with a Single Prompt

Lao Guo's Learning Space

Apr 12, 2026 · Artificial Intelligence

Who Wins the AI Video Throne? HappyHorse-1.0 vs ByteDance Seedance 2.0

The article dissects the April 2026 showdown between the anonymous 15‑billion‑parameter HappyHorse‑1.0 and ByteDance’s two‑year‑old Seedance 2.0, detailing Elo score gaps, contrasting single‑stream versus dual‑branch Transformer designs, speed advantages, quality trade‑offs, and offering a decision tree for different production needs.

AI videoElo rankingMultimodal

0 likes · 11 min read

Who Wins the AI Video Throne? HappyHorse-1.0 vs ByteDance Seedance 2.0

Coder Circle

Apr 10, 2026 · Industry Insights

Fei‑Fei Li Becomes Alibaba Cloud CTO as AI Market Faces Disruption and Growth

The AI Daily for April 10, 2026 reports that Fei‑Fei Li has been appointed Alibaba Cloud CTO, World Labs raised $1 billion, U.S. SaaS stocks fell nearly 40% due to AI agents, Amazon unveiled a faster Trainium3 chip and plans to sell it, Florida opened an OpenAI probe, and China’s token economy and AI‑driven video content are reshaping the industry.

AI videoAlibaba CloudAmazon AI chips

0 likes · 7 min read

Fei‑Fei Li Becomes Alibaba Cloud CTO as AI Market Faces Disruption and Growth

Machine Heart

Apr 4, 2026 · Artificial Intelligence

Is AI Video Generation Shifting From Model Showcases to Integrated Workflows?

The article analyzes how AI video generation, after the launch of OpenAI's Sora, is moving from a focus on model performance to embedding video capabilities into existing platforms and business workflows, highlighting timeline shifts, key players, and emerging competitive criteria.

AI videoGenerative AIMarket Trends

0 likes · 7 min read

Is AI Video Generation Shifting From Model Showcases to Integrated Workflows?

AI Explorer

Mar 25, 2026 · Artificial Intelligence

Why OpenAI Shut Down Sora After 6 Months and Disney’s $1B Deal Crumbled

OpenAI announced the abrupt shutdown of its AI video app Sora, ending the iOS app, API and website, while Disney cancelled a planned $1 billion investment; the move stems from compute constraints, IPO pressures, deep‑fake controversies, and the product’s failure to find a sustainable market fit.

AI videoDisneyOpenAI

0 likes · 12 min read

Why OpenAI Shut Down Sora After 6 Months and Disney’s $1B Deal Crumbled

Kuaishou Tech

May 26, 2025 · Artificial Intelligence

CineMaster: A 3D‑Aware and Controllable Framework for Cinematic Text‑to‑Video Generation

Researchers introduce CineMaster, a SIGGRAPH‑2025 paper presenting a 3D‑aware, controllable text‑to‑video generation framework that lets users define target objects and camera motions via an interactive workflow, enabling cinematic video creation with high‑quality, user‑directed results.

3D-awareAI videoCineMaster

0 likes · 6 min read

CineMaster: A 3D‑Aware and Controllable Framework for Cinematic Text‑to‑Video Generation

Code Mala Tang

Apr 5, 2025 · Artificial Intelligence

Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge

While most eyes remain on familiar AI giants, China’s Alibaba and DeepSeek are unveiling open‑source video and inference models that run on consumer GPUs, sparking a regulatory scramble and threatening the dominance of closed‑source AI, heralding a rapid, disruptive shift across the industry.

AI localizationAI regulationAI video

0 likes · 10 min read

Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge

Meituan Technology Team

Feb 9, 2025 · Artificial Intelligence

NTIRE 2025 XGC AI-Generated Video Quality Assessment Challenge

The NTIRE 2025 XGC AI‑Generated Video Quality Assessment Challenge, hosted at the CVPR workshop, invites participants to build VQA models that predict mean opinion scores for 34,029 AI‑generated videos created from 4,689 prompts using 14 generation models, with training, validation, and test splits provided as JSON, and submissions evaluated by the average of PLCC and SROCC, while key dates run from February 5 to June 15 2025 and prize money up to $1,200 is offered.

AI videoCVPRNTIRE

0 likes · 6 min read

NTIRE 2025 XGC AI-Generated Video Quality Assessment Challenge

58UXD

Dec 18, 2024 · Artificial Intelligence

Transform Your Designs with AI: 5 Steps to Create Stunning Videos

Learn how designers can harness AI tools in five practical steps—from script generation and AI‑driven image creation to video synthesis, music production, and final editing—to craft compelling, high‑quality videos that boost creativity and efficiency.

AI toolsAI videocreative AI

0 likes · 4 min read

Transform Your Designs with AI: 5 Steps to Create Stunning Videos

Alibaba Cloud Big Data AI Platform

Jun 4, 2024 · Artificial Intelligence

EasyAnimate: High‑Resolution Video Generation via Diffusion Transformers

EasyAnimate, an open‑source DiT‑based video generation framework from Alibaba Cloud AI Platform PAI, offers a complete pipeline—including data preprocessing, VAE and DiT training, LoRA fine‑tuning, motion‑module integration, and scalable inference up to 768×768 resolution and 144 frames—leveraging Diffusion Transformers to produce longer, higher‑quality videos.

AI videoLoRAVAE

0 likes · 14 min read

EasyAnimate: High‑Resolution Video Generation via Diffusion Transformers

NewBeeNLP

Mar 20, 2024 · Artificial Intelligence

How Open‑Sora 1.0 Replicates Sora: Architecture, Training Pipeline & Performance Insights

This article provides a comprehensive technical walkthrough of Open‑Sora 1.0, covering its Diffusion‑Transformer architecture, three‑stage training strategy, data‑preprocessing scripts, generation quality, and the Colossal‑AI acceleration that together make Sora‑level video synthesis openly reproducible.

AI videoOpen-Soradiffusion transformer

0 likes · 12 min read

How Open‑Sora 1.0 Replicates Sora: Architecture, Training Pipeline & Performance Insights

Tencent Cloud Developer

Apr 26, 2018 · Industry Insights

How Tencent Video Cloud Is Shaping the Future of Streaming and AI‑Powered Video

The article reviews the evolution of the video industry, analyzes Tencent Cloud's VOD, live, real‑time communication, short‑video and AI video services, explains their technical architecture, and discusses market trends such as "live +" integration and AI‑driven enhancements.

AI videoLive StreamingVOD

0 likes · 21 min read

How Tencent Video Cloud Is Shaping the Future of Streaming and AI‑Powered Video