AIWalker
AIWalker
Mar 5, 2026 · Artificial Intelligence

How ViDA-UGC Leverages Large Multimodal Models for Fine-Grained Visual Quality Assessment

The article introduces ViDA-UGC, a large‑scale UGC visual‑quality dataset and its companion benchmark ViDA‑Bench, explains the MILP‑driven sampling, expert annotation pipeline, and CoT‑based evaluation framework, and shows how fine‑tuning popular multimodal LLMs on this data markedly improves low‑level quality perception, grounding, and description capabilities.

benchmarkchain of thoughtdataset
0 likes · 12 min read
How ViDA-UGC Leverages Large Multimodal Models for Fine-Grained Visual Quality Assessment
Bilibili Tech
Bilibili Tech
Jan 16, 2024 · Industry Insights

How Bilibili Cuts Video Bandwidth: Theory and Practice of Transcoding Optimization

This article analyzes the fundamental goals of video transcoding, presents an information‑theoretic framework for bitrate reduction, compares traditional and deep‑learning codecs, and shares Bilibili's practical system design, parameter‑decision strategies, and visual‑quality‑aware optimizations that dramatically lower bandwidth consumption.

VMAFVideo Transcodingbandwidth optimization
0 likes · 24 min read
How Bilibili Cuts Video Bandwidth: Theory and Practice of Transcoding Optimization