Bilibili Tech
Author

Bilibili Tech

Provides introductions and tutorials on Bilibili-related technologies.

396
Articles
0
Likes
1.2k
Views
0
Comments
Recent Articles

Latest from Bilibili Tech

100 recent articles max
Bilibili Tech
Bilibili Tech
Apr 14, 2026 · Artificial Intelligence

Can 10% of Instruction Data Match Full-Scale Fine-Tuning? The SPICE Solution

The SPICE method leverages Fisher Information Matrix submodularity and a novel gradient‑conflict penalty to select a small, high‑quality subset of instruction‑tuning data, achieving comparable or superior performance to full‑data fine‑tuning while dramatically reducing training cost.

Fisher informationGradient ConflictInstruction Tuning
0 likes · 13 min read
Can 10% of Instruction Data Match Full-Scale Fine-Tuning? The SPICE Solution
Bilibili Tech
Bilibili Tech
Mar 31, 2026 · Artificial Intelligence

Can AI Generate Real‑Time, Editable Motion Graphics? Inside Neon Vibe Motion

This article examines Neon Vibe Motion, an open‑source platform that lets users describe motion effects in natural language, uses LLMs to generate executable Canvas/WebGL code with adjustable parameters, and details the architecture, workflow, prompt engineering, and export options that enable real‑time, controllable motion graphics.

AI motion graphicsCanvas 2DLLM code generation
0 likes · 25 min read
Can AI Generate Real‑Time, Editable Motion Graphics? Inside Neon Vibe Motion
Bilibili Tech
Bilibili Tech
Mar 12, 2026 · Frontend Development

Turning Canvas Snapshots into Real Video Cards with ProseMirror

This article recounts Bilibili’s evolution from using Canvas‑based screenshot tricks to render video cards in the Quill editor to adopting ProseMirror with Tiptap, detailing the technical debt, the new schema and NodeView architecture, performance optimizations, batch link validation, and strategies for backward compatibility and real‑time interactive components.

PerformanceProseMirrorfrontend
0 likes · 17 min read
Turning Canvas Snapshots into Real Video Cards with ProseMirror
Bilibili Tech
Bilibili Tech
Feb 13, 2026 · Artificial Intelligence

Self-Forcing: Turning Global Video Diffusion into Causal Streaming for Long-Form Generation

This article examines the Wan2.1 video diffusion model, identifies its scalability bottlenecks for long and real‑time video generation, and introduces the Self‑Forcing causal framework together with sequence‑parallel and RoPE optimizations that achieve sub‑second latency and up to 1.5× speed‑up on modern GPUs.

GPU Optimizationcausal inferencelarge video generation
0 likes · 14 min read
Self-Forcing: Turning Global Video Diffusion into Causal Streaming for Long-Form Generation
Bilibili Tech
Bilibili Tech
Jan 28, 2026 · Artificial Intelligence

Boosting Video Generation Inference: Full Graph Compilation with torch.compile

This article examines the challenges of optimizing video generation model inference, moving from operator-level tweaks to full-graph compilation using torch.compile, and details systematic strategies to eliminate Graph Breaks, handle dynamic shapes, KV-Cache indexing, and Python-side caches, achieving a 47.6% speedup on a 14B model without accuracy loss.

AIInference AccelerationVideo Generation
0 likes · 14 min read
Boosting Video Generation Inference: Full Graph Compilation with torch.compile
Bilibili Tech
Bilibili Tech
Dec 19, 2025 · Artificial Intelligence

SABER: Switchable and Balanced Training for Efficient LLM Reasoning

SABER introduces a reinforcement‑learning framework that lets large language models dynamically switch among four token‑budgeted reasoning modes, dramatically cutting inference length while preserving or improving accuracy across math, code, and logic tasks.

Budgeted ComputationEfficient ReasoningLLM
0 likes · 13 min read
SABER: Switchable and Balanced Training for Efficient LLM Reasoning
Bilibili Tech
Bilibili Tech
Dec 12, 2025 · Artificial Intelligence

Turning a Simple JS Function into a Cross‑Platform AI Tool with MCP

This article details how we built an AI‑tool ecosystem by evolving a basic online JS cloud‑function platform into a unified, reusable capability layer that integrates with Flowise, LangChain StructuredTool, and the Model Context Protocol (MCP) to provide secure, cross‑platform tool calls for agents.

AI toolsLangChainMCP
0 likes · 20 min read
Turning a Simple JS Function into a Cross‑Platform AI Tool with MCP
Bilibili Tech
Bilibili Tech
Nov 28, 2025 · Artificial Intelligence

How We Built an LLM‑Powered AI Hub to Read and Analyze Community Chats

This article details the design and deployment of a multi‑layer LLM system that automatically reads massive creator group chats, extracts structured insights, mitigates hallucinations with dual‑model verification, uses few‑shot prompting for stable output, and delivers real‑time risk alerts and operational reports.

AI OperationsFew‑Shot LearningLLM
0 likes · 14 min read
How We Built an LLM‑Powered AI Hub to Read and Analyze Community Chats
Bilibili Tech
Bilibili Tech
Nov 27, 2025 · Artificial Intelligence

Mastering Agentic Systems with Blades: Concepts, Code, and Workflow Patterns

This article explains what an AI Agent is, distinguishes it from traditional workflows, and demonstrates how to build and customize agents using the Go‑based Blades framework, covering core concepts, code examples, five workflow patterns, best‑practice guidelines, and reference resources.

AIAgentBlades
0 likes · 11 min read
Mastering Agentic Systems with Blades: Concepts, Code, and Workflow Patterns
Bilibili Tech
Bilibili Tech
Nov 21, 2025 · Backend Development

How Bilibili Scaled Its Private Messaging System to Handle 10× Traffic

This article analyzes the current bottlenecks of Bilibili's private messaging service, explains the technical challenges of massive data volume and traffic spikes, and presents a comprehensive multi‑layer architecture upgrade—including cache strategies, BFF refactoring, database sharding, and consistency mechanisms—to ensure scalability and reliability.

BFFConsistencyMessaging
0 likes · 16 min read
How Bilibili Scaled Its Private Messaging System to Handle 10× Traffic