Tag

video diffusion

0 views collected around this technical thread.

Amap Tech
Amap Tech
May 8, 2025 · Artificial Intelligence

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

FantasyTalking generates high-fidelity, coherent talking portraits from a single static image by employing a two-stage audio-visual alignment—global segment-level motion and frame-level lip refinement—combined with face-centric cross-attention for identity preservation and a motion-intensity module that lets users control expression and body movement, achieving superior realism, synchronization, and performance over prior methods.

Deep Learningaudio-visual alignmentidentity preservation
0 likes · 10 min read
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Kuaishou Tech
Kuaishou Tech
Oct 31, 2023 · Artificial Intelligence

Kuaishou’s Nine Accepted Papers at ACM MM 2023: Summaries and Links

This article presents concise English summaries of nine Kuaishou research papers accepted at ACM MM 2023, covering topics such as no‑reference video quality assessment, adaptive video quality models, blind image super‑resolution, audio‑visual‑language transfer learning, motion‑aware video diffusion, large‑scale e‑commerce retrieval, and interactive segmentation.

Image Super-Resolutionaiaudio-visual language
0 likes · 18 min read
Kuaishou’s Nine Accepted Papers at ACM MM 2023: Summaries and Links