Tagged articles
4 articles
Page 1 of 1
Machine Heart
Machine Heart
Apr 8, 2026 · Artificial Intelligence

Beyond Simple Motions: How SentiAvatar Redefines 3D Digital Human Action Generation

SentiAvatar introduces a two‑stage plan‑then‑infill framework that separates sentence‑level semantic planning from frame‑level prosody‑driven motion infill, leveraging a 200K‑sequence Motion Foundation Model and the newly released 21k‑clip SuSuInterActs dataset to achieve state‑of‑the‑art, real‑time expressive 3D digital human animation.

3D digital humansMotion Foundation ModelSentiAvatar
0 likes · 13 min read
Beyond Simple Motions: How SentiAvatar Redefines 3D Digital Human Action Generation
Sohu Tech Products
Sohu Tech Products
Oct 29, 2025 · Information Security

Why a New Multimodal AI Security Dataset Is Essential for Detecting Deepfakes

As multimodal AI models become capable of generating realistic images, videos, and audio, the OpenMMSec benchmark provides a comprehensive, open‑source dataset and evaluation metrics that help researchers and developers detect and localize AI‑generated forgeries across all three modalities, addressing emerging security challenges.

AI securityEvaluation MetricsOpenMMSec
0 likes · 18 min read
Why a New Multimodal AI Security Dataset Is Essential for Detecting Deepfakes
Kuaishou Tech
Kuaishou Tech
Sep 25, 2023 · Artificial Intelligence

LPR4M: A Large-Scale Multimodal Livestreaming Product Recognition Dataset and the RICE Cross‑View Semantic Alignment Model

This paper introduces LPR4M, a 4‑million‑pair multimodal dataset for livestreaming product recognition, and proposes the RICE model that combines instance‑level contrastive learning with patch‑level cross‑view semantic alignment, demonstrating state‑of‑the‑art performance on both LPR4M and MovingFashion benchmarks.

Deep Learningcross-view alignmentlivestreaming
0 likes · 19 min read
LPR4M: A Large-Scale Multimodal Livestreaming Product Recognition Dataset and the RICE Cross‑View Semantic Alignment Model