Tag

Digital Human

0 views collected around this technical thread.

Efficient Ops
Efficient Ops
Mar 16, 2025 · Artificial Intelligence

How AI Digital Humans Transform Banking Services: Architecture, Capabilities, and Use Cases

This article explains how AI-powered digital humans can modernize banking by offering modular, multi‑modal interaction, personalized multilingual service, 24‑hour availability, and risk‑aware automation, while detailing the underlying AI foundation, decision engine, visual rendering, and deployment strategies.

AICustomer ServiceDigital Human
0 likes · 7 min read
How AI Digital Humans Transform Banking Services: Architecture, Capabilities, and Use Cases
AntTech
AntTech
Nov 27, 2024 · Artificial Intelligence

EchoMimicV2: An End-to-End Audio‑Driven Semi‑Body Human Animation Framework

EchoMimicV2, an open‑source project from Ant Group's Alipay AI team, introduces an end‑to‑end audio‑driven framework that generates high‑quality semi‑body portrait videos by jointly coordinating audio, pose, and image inputs, while addressing challenges of condition complexity, model stability, and computational cost.

AI researchDigital HumanMultimodal Generation
0 likes · 16 min read
EchoMimicV2: An End-to-End Audio‑Driven Semi‑Body Human Animation Framework
DataFunSummit
DataFunSummit
Oct 10, 2024 · Artificial Intelligence

AIGC‑Assisted Marketing Material Generation at Shujia Technology

This article describes Shujia Technology's use of artificial intelligence to generate marketing images and videos, outlining the background, challenges of high-volume content production, detailed solutions for image and video assets—including layout models, diffusion models, and digital human synthesis—and future research directions.

AIGCDigital HumanImage Generation
0 likes · 12 min read
AIGC‑Assisted Marketing Material Generation at Shujia Technology
AntTech
AntTech
Jul 24, 2024 · Artificial Intelligence

EchoMimic: An Open‑Source AIGC‑Driven Framework for 2D/3D Digital Human Generation

EchoMimic, an open‑source project from Ant Group, presents a flexible, audio‑ and pose‑driven digital human generation pipeline that combines 2D, 3D and AIGC techniques, reduces production costs, achieves real‑time inference, and includes a detailed architecture, related work analysis, and future research directions.

AIGCComputer VisionDigital Human
0 likes · 18 min read
EchoMimic: An Open‑Source AIGC‑Driven Framework for 2D/3D Digital Human Generation
DaTaobao Tech
DaTaobao Tech
Jan 12, 2024 · Artificial Intelligence

AI‑Powered Photo‑to‑3D Avatar Generation in Taobao Life 2

Taobao Life 2’s new AI‑driven “photo‑face” feature automatically converts a single portrait into a stylized 3D avatar in under five seconds by using a 3D morphable model, lightweight MLP mapping, and fine‑grained attribute classification, cutting manual sculpting time from half an hour to seconds while preserving user‑specific details.

3D face reconstructionAIComputer Vision
0 likes · 13 min read
AI‑Powered Photo‑to‑3D Avatar Generation in Taobao Life 2
政采云技术
政采云技术
Oct 26, 2023 · Game Development

Creating and Driving a Digital Human with Unreal Engine MetaHuman and AI Face Swapping

This guide walks through building a 3D digital human using Unreal Engine’s MetaHuman Creator, driving it with live facial capture from an iPhone, and applying AI‑based face swapping (roop) to replace the character’s face with Mr. Bean, covering all required tools, setup, and export steps.

AI Face SwapDigital HumanMetaHuman
0 likes · 8 min read
Creating and Driving a Digital Human with Unreal Engine MetaHuman and AI Face Swapping
DataFunTalk
DataFunTalk
Oct 6, 2023 · Artificial Intelligence

Music‑Driven Digital Human: Algorithms, System Architecture, and Practical Applications

This article presents a comprehensive overview of the Music XR Maker framework, detailing how music‑driven AI techniques enable digital human creation, dance generation, lip‑sync, and expressive performance, and discusses data pipelines, model architectures, 3D rendering, product integration, and real‑time deployment within Tencent Music’s Tianqin Lab.

AI algorithmsDance GenerationDigital Human
0 likes · 15 min read
Music‑Driven Digital Human: Algorithms, System Architecture, and Practical Applications
DataFunSummit
DataFunSummit
May 15, 2023 · Artificial Intelligence

Music-Driven Digital Human: Algorithms and Practices

This article presents the Music XR Maker framework and its four core components—music-driven system architecture, dance generation, lip-sync driven by singing voice, and expressive singing facial animation—detailing data sources, AI generation pipelines, 3D rendering, product applications, and future research directions.

3D renderingAIDance Generation
0 likes · 15 min read
Music-Driven Digital Human: Algorithms and Practices
Kuaishou Large Model
Kuaishou Large Model
Mar 31, 2023 · Artificial Intelligence

How Kuaishou Elevates Video Quality and AI Performance at NVIDIA GTC 2023

At NVIDIA GTC 2023, Kuaishou engineers unveiled cutting‑edge solutions ranging from video quality assessment and enhancement, 3D digital‑human live streaming, a custom TensorRT‑based performance framework, large‑scale recommendation model acceleration, to multimodal massive‑model deployment for short‑video scenarios.

AI optimizationDigital HumanMultimodal Models
0 likes · 9 min read
How Kuaishou Elevates Video Quality and AI Performance at NVIDIA GTC 2023
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Mar 30, 2023 · Artificial Intelligence

How Kuaishou Elevates Short‑Video Quality and AI Performance at NVIDIA GTC 2023

At NVIDIA GTC 2023, Kuaishou engineers presented cutting‑edge solutions ranging from video quality assessment and enhancement to digital‑human live streaming, custom performance‑optimization frameworks, large‑scale recommendation model acceleration, and multimodal massive‑model deployment for short‑video applications.

AI optimizationDigital Humanlarge recommendation models
0 likes · 9 min read
How Kuaishou Elevates Short‑Video Quality and AI Performance at NVIDIA GTC 2023
DataFunSummit
DataFunSummit
Mar 1, 2023 · Artificial Intelligence

Automating High-Fidelity Digital Human Creation: Scanning, Driving, and Remaining Challenges

The article details YINGMOU's research on automating the production of high‑fidelity digital humans, covering their rapid 3‑5‑day pipeline, extensive face‑asset database, advanced light‑field scanning, automatic topology reconstruction, AI‑driven rigging, dynamic mapping, and the unresolved issues of hair and cloth.

AI AutomationDigital HumanPBR materials
0 likes · 12 min read
Automating High-Fidelity Digital Human Creation: Scanning, Driving, and Remaining Challenges
DataFunSummit
DataFunSummit
Jan 13, 2023 · Artificial Intelligence

2022 Digital Human System Basic Capability Evaluation and Observations

This report presents the background, methodology, evaluation model, results, and key observations of the 2022 digital human system basic capability assessment, highlighting technical, engineering, and security challenges, industry standards development, and future work to advance digital human technologies.

Artificial IntelligenceCapability EvaluationDigital Human
0 likes · 12 min read
2022 Digital Human System Basic Capability Evaluation and Observations
DataFunTalk
DataFunTalk
Dec 20, 2022 · Artificial Intelligence

Baidu Smart Cloud Digital Human Platform: Development, Architecture, and Solution Overview

This article provides a comprehensive overview of Baidu's Smart Cloud Digital Human platform, detailing its evolution since 2019, core AI-driven architecture, platform components such as persona management and business orchestration, various industry solutions, and technical Q&A on rendering, latency, and deployment.

AI PlatformBaiduDigital Human
0 likes · 13 min read
Baidu Smart Cloud Digital Human Platform: Development, Architecture, and Solution Overview
DataFunSummit
DataFunSummit
Dec 19, 2022 · Artificial Intelligence

Multimodal Large‑Model Driven Virtual Digital Humans: Background, Methods, and Applications

This article introduces the rapid development of multimodal digital humans powered by large AI models, covering their background, current challenges, NeRF‑GAN based modeling methods, multimodal dialogue capabilities, and real‑world application cases such as virtual assistants, tourism guides, and sign‑language avatars.

AIGCDigital HumanHuman-Computer Interaction
0 likes · 14 min read
Multimodal Large‑Model Driven Virtual Digital Humans: Background, Methods, and Applications
DataFunSummit
DataFunSummit
Nov 26, 2022 · Artificial Intelligence

Multimodal Digital Human Driving: Motionverse Engine and Metaverse Applications

This article introduces the evolution of digital human technology, explains the five maturity levels (L1‑L5), describes the Motionverse multimodal motion‑generation platform and its large‑scale data and AI models, and outlines SDK integration strategies for diverse metaverse scenarios.

Digital HumanMetaversemotion generation
0 likes · 11 min read
Multimodal Digital Human Driving: Motionverse Engine and Metaverse Applications
DataFunSummit
DataFunSummit
Nov 10, 2022 · Artificial Intelligence

Voice‑Driven Facial Animation for Digital Humans: Techniques and OPPO XiaoBu Assistant Practice

This article introduces digital‑human voice‑driven facial animation technologies, compares motion‑capture, audio‑driven and key‑point methods, details OPPO XiaoBu Assistant's end‑side and cloud‑side Audio2Lip pipelines, explores BlendShape versus Mesh approaches, and discusses current challenges and future research directions.

Digital HumanOPPOReal-time Rendering
0 likes · 15 min read
Voice‑Driven Facial Animation for Digital Humans: Techniques and OPPO XiaoBu Assistant Practice
DataFunSummit
DataFunSummit
Oct 26, 2022 · Artificial Intelligence

Digital Human Technology on the Soul Platform: Architecture, Key Techniques, and Application Scenarios

This article introduces Soul's digital‑human solution, covering the platform’s social metaverse concept, the self‑developed N⋀W⋀ rendering engine, its AI‑driven head, half‑body and full‑body capture pipelines, rendering capabilities, design resources, practical use cases, and future research directions.

AIAR/VRDigital Human
0 likes · 10 min read
Digital Human Technology on the Soul Platform: Architecture, Key Techniques, and Application Scenarios
Baidu Geek Talk
Baidu Geek Talk
Sep 7, 2022 · Artificial Intelligence

Design and Architecture of AI Digital Human Live Streaming System

The paper presents a cloud‑native architecture for AI‑driven digital‑human live‑streaming, detailing three‑layer asset, interaction, and media modules, real‑time script and Q&A scheduling, fault‑tolerant rendering and control services, and demonstrates how virtual anchors can deliver continuous, lifelike 24/7 e‑commerce streams.

AIDigital HumanLive Streaming
0 likes · 21 min read
Design and Architecture of AI Digital Human Live Streaming System
DataFunSummit
DataFunSummit
Aug 3, 2022 · Artificial Intelligence

AliMe MKG: Multimodal Knowledge Graph for Live E‑commerce and Its Technical Exploration

This report presents AliMe MKG, a multimodal knowledge graph designed for live e‑commerce, detailing its business background, construction and application, the three types of multimodal knowledge (triples, sentences, and visual media), the underlying extraction techniques, and its deployment in digital‑human anchors and intelligent live‑room assistants.

AIDigital Humane-commerce
0 likes · 19 min read
AliMe MKG: Multimodal Knowledge Graph for Live E‑commerce and Its Technical Exploration
DataFunSummit
DataFunSummit
Apr 14, 2022 · Artificial Intelligence

Advances in Alibaba's Digital Human Technology: Construction, Performance, Interaction, and the MMTK Multimodal Algorithm Library

This article reviews Alibaba's digital‑human (virtual avatar) research over the past few years, covering the product’s evolution, a six‑stage pipeline for building digital humans, solutions to key challenges in realism, multimodal interaction, and the open‑source MMTK algorithm library.

Digital HumanEmotion ModelingSpeech Synthesis
0 likes · 12 min read
Advances in Alibaba's Digital Human Technology: Construction, Performance, Interaction, and the MMTK Multimodal Algorithm Library