Tagged articles

AI video

15 articles · Page 1 of 1
Top Architect
Top Architect
Jun 15, 2026 · Artificial Intelligence

Gemini Omni Tested: Turn Sketches into Blockbuster Videos with a Single Prompt

Google DeepMind unveiled Gemini Omni at I/O, a multimodal world model that combines reasoning and generation to edit videos via conversational prompts, supports digital avatars, demonstrates emergent cross‑modal improvements, and incorporates safety cages such as Avatar Flow and dual watermarks, signaling a step toward AGI‑level video AI.

AI videoGemini OmniMultimodal AI
0 likes · 10 min read
Gemini Omni Tested: Turn Sketches into Blockbuster Videos with a Single Prompt
JD Cloud Developers
JD Cloud Developers
Jun 11, 2026 · Artificial Intelligence

How JD’s Open‑Source JoyAI‑Echo Tackles the Three Big Challenges of Long‑Form Video Generation

JD’s newly open‑source JoyAI‑Echo framework addresses long‑video generation’s three major pain points—character inconsistency, unstable speaker timbre, and slow rendering—through a cross‑modal memory bank, memory‑driven training, a conversational Director Agent, and real‑time super‑resolution, delivering up to 7.5× speed gains and superior benchmark results.

AI videoBenchmarkJoyAI-Echo
0 likes · 6 min read
How JD’s Open‑Source JoyAI‑Echo Tackles the Three Big Challenges of Long‑Form Video Generation
Design Hub
Design Hub
May 25, 2026 · Artificial Intelligence

How Meituan’s Open‑Source Avatar Redefines Digital Human Voice‑Over Costs (Beyond HeyGen)

LongCat‑Video‑Avatar‑1.5, Meituan’s open‑source audio‑driven video generation model, upgrades its encoder, stability, multi‑character support and 8‑step distillation, provides a detailed workflow, benchmark evaluation, and examines its impact on designers, operators, e‑commerce and marketing while highlighting deployment and compliance challenges.

AI videoAudio-driven Video GenerationContent Automation
0 likes · 22 min read
How Meituan’s Open‑Source Avatar Redefines Digital Human Voice‑Over Costs (Beyond HeyGen)
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 21, 2026 · Artificial Intelligence

HappyOyster: Build an Explorable Interactive World with a Single Prompt

Alibaba’s ATH team unveiled HappyOyster, a real‑time world‑model platform that lets users generate and explore interactive 3D environments from a single sentence or image, offering two modes—Wander for exploration and Direct for creation—while detailing its streaming architecture, multimodal foundation, competitive advantages, use cases, and current limitations.

AI videoGame DevelopmentGenerative AI
0 likes · 11 min read
HappyOyster: Build an Explorable Interactive World with a Single Prompt
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 12, 2026 · Artificial Intelligence

Who Wins the AI Video Throne? HappyHorse-1.0 vs ByteDance Seedance 2.0

The article dissects the April 2026 showdown between the anonymous 15‑billion‑parameter HappyHorse‑1.0 and ByteDance’s two‑year‑old Seedance 2.0, detailing Elo score gaps, contrasting single‑stream versus dual‑branch Transformer designs, speed advantages, quality trade‑offs, and offering a decision tree for different production needs.

AI videoElo rankingMultimodal
0 likes · 11 min read
Who Wins the AI Video Throne? HappyHorse-1.0 vs ByteDance Seedance 2.0
Coder Circle
Coder Circle
Apr 10, 2026 · Industry Insights

Fei‑Fei Li Becomes Alibaba Cloud CTO as AI Market Faces Disruption and Growth

The AI Daily for April 10, 2026 reports that Fei‑Fei Li has been appointed Alibaba Cloud CTO, World Labs raised $1 billion, U.S. SaaS stocks fell nearly 40% due to AI agents, Amazon unveiled a faster Trainium3 chip and plans to sell it, Florida opened an OpenAI probe, and China’s token economy and AI‑driven video content are reshaping the industry.

AI videoAlibaba CloudAmazon AI chips
0 likes · 7 min read
Fei‑Fei Li Becomes Alibaba Cloud CTO as AI Market Faces Disruption and Growth
Machine Heart
Machine Heart
Apr 4, 2026 · Artificial Intelligence

Is AI Video Generation Shifting From Model Showcases to Integrated Workflows?

The article analyzes how AI video generation, after the launch of OpenAI's Sora, is moving from a focus on model performance to embedding video capabilities into existing platforms and business workflows, highlighting timeline shifts, key players, and emerging competitive criteria.

AI videoGenerative AIMarket Trends
0 likes · 7 min read
Is AI Video Generation Shifting From Model Showcases to Integrated Workflows?
AI Explorer
AI Explorer
Mar 25, 2026 · Artificial Intelligence

Why OpenAI Shut Down Sora After 6 Months and Disney’s $1B Deal Crumbled

OpenAI announced the abrupt shutdown of its AI video app Sora, ending the iOS app, API and website, while Disney cancelled a planned $1 billion investment; the move stems from compute constraints, IPO pressures, deep‑fake controversies, and the product’s failure to find a sustainable market fit.

AI videoDisneyOpenAI
0 likes · 12 min read
Why OpenAI Shut Down Sora After 6 Months and Disney’s $1B Deal Crumbled
Code Mala Tang
Code Mala Tang
Apr 5, 2025 · Artificial Intelligence

Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge

While most eyes remain on familiar AI giants, China’s Alibaba and DeepSeek are unveiling open‑source video and inference models that run on consumer GPUs, sparking a regulatory scramble and threatening the dominance of closed‑source AI, heralding a rapid, disruptive shift across the industry.

AI localizationAI regulationAI video
0 likes · 10 min read
Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge
Meituan Technology Team
Meituan Technology Team
Feb 9, 2025 · Artificial Intelligence

NTIRE 2025 XGC AI-Generated Video Quality Assessment Challenge

The NTIRE 2025 XGC AI‑Generated Video Quality Assessment Challenge, hosted at the CVPR workshop, invites participants to build VQA models that predict mean opinion scores for 34,029 AI‑generated videos created from 4,689 prompts using 14 generation models, with training, validation, and test splits provided as JSON, and submissions evaluated by the average of PLCC and SROCC, while key dates run from February 5 to June 15 2025 and prize money up to $1,200 is offered.

AI videoCVPRNTIRE
0 likes · 6 min read
NTIRE 2025 XGC AI-Generated Video Quality Assessment Challenge
58UXD
58UXD
Dec 18, 2024 · Artificial Intelligence

Transform Your Designs with AI: 5 Steps to Create Stunning Videos

Learn how designers can harness AI tools in five practical steps—from script generation and AI‑driven image creation to video synthesis, music production, and final editing—to craft compelling, high‑quality videos that boost creativity and efficiency.

AI toolsAI videocreative AI
0 likes · 4 min read
Transform Your Designs with AI: 5 Steps to Create Stunning Videos
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jun 4, 2024 · Artificial Intelligence

EasyAnimate: High‑Resolution Video Generation via Diffusion Transformers

EasyAnimate, an open‑source DiT‑based video generation framework from Alibaba Cloud AI Platform PAI, offers a complete pipeline—including data preprocessing, VAE and DiT training, LoRA fine‑tuning, motion‑module integration, and scalable inference up to 768×768 resolution and 144 frames—leveraging Diffusion Transformers to produce longer, higher‑quality videos.

AI videoLoRAVAE
0 likes · 14 min read
EasyAnimate: High‑Resolution Video Generation via Diffusion Transformers
NewBeeNLP
NewBeeNLP
Mar 20, 2024 · Artificial Intelligence

How Open‑Sora 1.0 Replicates Sora: Architecture, Training Pipeline & Performance Insights

This article provides a comprehensive technical walkthrough of Open‑Sora 1.0, covering its Diffusion‑Transformer architecture, three‑stage training strategy, data‑preprocessing scripts, generation quality, and the Colossal‑AI acceleration that together make Sora‑level video synthesis openly reproducible.

AI videoOpen-Soradiffusion transformer
0 likes · 12 min read
How Open‑Sora 1.0 Replicates Sora: Architecture, Training Pipeline & Performance Insights