Tagged articles
5 articles
Page 1 of 1
HyperAI Super Neural
HyperAI Super Neural
Dec 23, 2025 · Artificial Intelligence

NeurIPS 2025‑Selected Multi‑Stream Control Framework Achieves Precise Audio‑Visual Sync via Audio Demixing

The paper introduces a NeurIPS 2025‑selected multi‑stream video generation framework that demixes audio into speech, effects, and music, using dedicated control streams and a multi‑stage training strategy to achieve markedly better lip‑sync, event timing, and overall visual quality than prior methods.

MTV frameworkNeurIPS 2025audio demixing
0 likes · 9 min read
NeurIPS 2025‑Selected Multi‑Stream Control Framework Achieves Precise Audio‑Visual Sync via Audio Demixing
DataFunSummit
DataFunSummit
Mar 30, 2025 · Artificial Intelligence

RLHF Techniques and Challenges in Large Language Models and Multimodal Applications

This article reviews reinforcement learning, RLHF, and related alignment techniques for large language models and multimodal systems, covering fundamentals, recent advances such as InstructGPT, Constitutional AI, RLAIF, Super Alignment, GPT‑4o, video LLMs, and experimental evaluations of proposed methods.

RLHFmultimodal alignmentpreference learning
0 likes · 26 min read
RLHF Techniques and Challenges in Large Language Models and Multimodal Applications
Architect
Architect
Mar 24, 2025 · Artificial Intelligence

How Multimodal Alignment Is Shaping the Future of Large Language Models

This article provides a systematic review of recent advances in multimodal alignment for large language models, covering key contributions, application scenarios, dataset construction, evaluation benchmarks, future challenges, and insights from LLM alignment research to guide both academia and industry.

AI SafetyMLLMdataset construction
0 likes · 26 min read
How Multimodal Alignment Is Shaping the Future of Large Language Models
AIWalker
AIWalker
Jan 13, 2025 · Artificial Intelligence

ArtCrafter: A Controllable, Diverse Style Transfer Framework from Tsinghua

ArtCrafter introduces a novel text‑image style transfer framework that leverages attention‑based style extraction, text‑image alignment enhancement, and explicit modulation to achieve controllable, diverse, and high‑fidelity visual results, outperforming existing methods in both qualitative and quantitative evaluations.

Attention MechanismStyle Transferdiffusion models
0 likes · 10 min read
ArtCrafter: A Controllable, Diverse Style Transfer Framework from Tsinghua
DataFunSummit
DataFunSummit
Aug 3, 2023 · Artificial Intelligence

Integrating Vector Databases with Large Language Models for Enterprise AI Applications

The article explains how combining vector databases with large language models can help governments and enterprises leverage massive private data for AI, covering semantic search, approximate nearest neighbor techniques, alignment challenges across modalities, and future directions for fine‑grained data integration.

AIapproximate nearest neighborlarge language model
0 likes · 7 min read
Integrating Vector Databases with Large Language Models for Enterprise AI Applications