Tagged articles
5 articles
Page 1 of 1
Machine Heart
Machine Heart
Apr 23, 2026 · Artificial Intelligence

UniLS: End-to-End Audio-Driven Framework Eliminates the ‘Poker Face’ in Digital Human Dialogue

UniLS, the first end‑to‑end audio‑driven framework that jointly generates speaking and listening facial motions for digital humans, achieves state‑of‑the‑art speaking accuracy, improves listening naturalness by 44.1 %, and runs at over 500 FPS, as demonstrated on the CVPR 2026‑accepted paper with extensive quantitative and user studies.

CVPR 2026Speech synthesisaudio-driven animation
0 likes · 9 min read
UniLS: End-to-End Audio-Driven Framework Eliminates the ‘Poker Face’ in Digital Human Dialogue
AntTech
AntTech
Nov 27, 2024 · Artificial Intelligence

EchoMimicV2: An End-to-End Audio‑Driven Semi‑Body Human Animation Framework

EchoMimicV2, an open‑source project from Ant Group's Alipay AI team, introduces an end‑to‑end audio‑driven framework that generates high‑quality semi‑body portrait videos by jointly coordinating audio, pose, and image inputs, while addressing challenges of condition complexity, model stability, and computational cost.

Digital Humanaudio-driven animationdiffusion models
0 likes · 16 min read
EchoMimicV2: An End-to-End Audio‑Driven Semi‑Body Human Animation Framework
Alipay Experience Technology
Alipay Experience Technology
Nov 27, 2024 · Artificial Intelligence

EchoMimicV2: High‑Quality Audio‑Driven Half‑Body Human Animation with Simple Inputs

EchoMimicV2 is an open‑source digital‑human framework that generates high‑quality half‑body animation videos from a single reference image, an audio clip, and a hand‑gesture sequence, addressing challenges of facial portrait limits, complex condition injection, and inference latency in audio‑driven animation.

AI researchDigital HumanVideo Generation
0 likes · 18 min read
EchoMimicV2: High‑Quality Audio‑Driven Half‑Body Human Animation with Simple Inputs
AntTech
AntTech
Jul 24, 2024 · Artificial Intelligence

EchoMimic: An Open‑Source AIGC‑Driven Framework for 2D/3D Digital Human Generation

EchoMimic, an open‑source project from Ant Group, presents a flexible, audio‑ and pose‑driven digital human generation pipeline that combines 2D, 3D and AIGC techniques, reduces production costs, achieves real‑time inference, and includes a detailed architecture, related work analysis, and future research directions.

AIGCDigital Humanaudio-driven animation
0 likes · 18 min read
EchoMimic: An Open‑Source AIGC‑Driven Framework for 2D/3D Digital Human Generation