Bilibili Tech
Author

Bilibili Tech

Provides introductions and tutorials on Bilibili-related technologies.

396
Articles
0
Likes
1.2k
Views
0
Comments
Recent Articles

Latest from Bilibili Tech

100 recent articles max
Bilibili Tech
Bilibili Tech
Aug 12, 2025 · Artificial Intelligence

How AI Recreates Original Voices in Multilingual Video Dubbing

This article explains the technical challenges and innovative AI solutions behind preserving speaker identity, emotion, and timing while translating video content into multiple languages, covering speech generation modeling, speaker segmentation, adversarial reinforcement learning, proper‑noun adaptation, and audio‑visual alignment techniques.

AI voice cloningaudio-visual alignmentcross-modal translation
0 likes · 22 min read
How AI Recreates Original Voices in Multilingual Video Dubbing
Bilibili Tech
Bilibili Tech
Aug 12, 2025 · Artificial Intelligence

How Bilibili Scaled AI Model Training with Alluxio Cache Acceleration

This article details Bilibili's multi-layer storage architecture and Alluxio‑based cache acceleration for large‑scale AI model training, covering challenges of high‑throughput, low‑latency file access, metadata scalability, fault tolerance, and the engineering solutions that boosted I/O performance up to ten‑fold.

AIAlluxioCaching
0 likes · 24 min read
How Bilibili Scaled AI Model Training with Alluxio Cache Acceleration
Bilibili Tech
Bilibili Tech
Aug 8, 2025 · Artificial Intelligence

Can Language‑Centric Tree Reasoning Transform Video Question Answering?

This article introduces a language‑centric tree reasoning (LTR) framework that recursively decomposes VideoQA queries into perceptual sub‑questions and performs bottom‑up logical inference with video assistance, achieving significantly higher accuracy and explainability across eleven benchmark datasets.

Artificial IntelligenceExplainabilityTree Reasoning
0 likes · 17 min read
Can Language‑Centric Tree Reasoning Transform Video Question Answering?
Bilibili Tech
Bilibili Tech
Aug 5, 2025 · Artificial Intelligence

How Bilibili’s IndexTTS2 Achieves Real‑Time, Emotion‑Rich Voice Translation

IndexTTS2 introduces a cross‑modal, multi‑language voice translation system that preserves speaker identity, acoustic space, and multi‑source timbre, while tackling challenges like voice personality loss, subtitle cognitive load, localization costs, multi‑speaker diarization, and cultural adaptation through novel time‑coding, adversarial RL, and diffusion‑based lip‑sync techniques.

adversarial reinforcement learningcross-language TTSlip sync
0 likes · 20 min read
How Bilibili’s IndexTTS2 Achieves Real‑Time, Emotion‑Rich Voice Translation
Bilibili Tech
Bilibili Tech
Aug 1, 2025 · Mobile Development

Revamping a Mobile Video Editor: MVVM + UDF Architecture and Redo/Undo Design

This article details the comprehensive redesign of a mobile video‑editing page, covering background challenges, requirement analysis, MVVM + UDF architectural choices, module decomposition, dependency‑injection implementation, and the design of a robust Redo/Undo system to improve maintainability and user experience.

MVVMUDFarchitecture
0 likes · 19 min read
Revamping a Mobile Video Editor: MVVM + UDF Architecture and Redo/Undo Design
Bilibili Tech
Bilibili Tech
Jul 25, 2025 · Big Data

How Unified Metadata Lineage Transforms Big Data Governance and Security

This article introduces the comprehensive design and evolution of a unified metadata lineage platform for big data, covering background, data processing chain, lineage models, system architecture, quality metrics, application scenarios, and future plans to enhance data governance, quality, and security.

Big DataData GovernanceData Security
0 likes · 27 min read
How Unified Metadata Lineage Transforms Big Data Governance and Security
Bilibili Tech
Bilibili Tech
Jul 18, 2025 · Frontend Development

How We Built a Unified Frontend Material Platform to Replace cnpm and Boost Efficiency

This article describes the design and implementation of a new frontend material platform that consolidates npm packages, UMD scripts, and icons, introduces a custom publishing tool (bpm), integrates with Nexus3, adds unified authentication, improves search with MeiliSearch, and outlines migration, classification, and documentation strategies.

Package ManagementSearchdocumentation
0 likes · 27 min read
How We Built a Unified Frontend Material Platform to Replace cnpm and Boost Efficiency
Bilibili Tech
Bilibili Tech
Jul 11, 2025 · Artificial Intelligence

IndexTTS2: Emotionally Expressive, Duration-Controlled Zero-Shot TTS

IndexTTS2 introduces a novel auto-regressive zero-shot text-to-speech model that achieves precise duration control and fine-grained emotional expression through a universal time‑encoding mechanism, decoupled voice‑style and emotion modeling, and a GPT‑style latent feature, outperforming state‑of‑the‑art baselines across multiple benchmarks.

duration controlemotional synthesisspeech generation
0 likes · 23 min read
IndexTTS2: Emotionally Expressive, Duration-Controlled Zero-Shot TTS
Bilibili Tech
Bilibili Tech
Jul 4, 2025 · Operations

Solving CPU Performance Layering in Heterogeneous Data Centers: A Practical Guide

This article explains why heterogeneous servers cause CPU performance layering, describes how to detect the issue using metrics such as NUMA hit/miss rates, cache miss ratios and frequency states, and provides step‑by‑step remediation techniques—including NUMA binding, cache isolation, recompilation and frequency locking—to improve resource pooling efficiency in modern data centers.

CPU performanceData CenterNUMA
0 likes · 24 min read
Solving CPU Performance Layering in Heterogeneous Data Centers: A Practical Guide