Tagged articles
4 articles
Page 1 of 1
Bilibili Tech
Bilibili Tech
Aug 13, 2024 · Big Data

How Bilibili Re‑engineered Its Search Indexing with Distributed Storage and Spark

This article details Bilibili's transformation of its search offline indexing pipeline, moving from manual MySQL‑based processes to a high‑capacity, distributed KV store and Spark‑driven builds, addressing performance, maintenance, and scalability challenges while improving resource efficiency and iteration speed.

Big DataBilibiliKV Store
0 likes · 24 min read
How Bilibili Re‑engineered Its Search Indexing with Distributed Storage and Spark
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 16, 2020 · Databases

How Youku Scales Billions of Video Nodes with Real‑Time Graph Databases

Facing billions of video entities and edges, Youku’s engineering team replaced traditional relational stores with a graph‑based knowledge platform, leveraging Alibaba’s Blink streaming engine and Lindorm to enable real‑time, incremental updates, unified UDF logic, and scalable feature computation for search and recommendation.

Big DataGraph DatabaseReal-time Streaming
0 likes · 10 min read
How Youku Scales Billions of Video Nodes with Real‑Time Graph Databases
Java Backend Technology
Java Backend Technology
May 12, 2019 · Big Data

How 58.com Scales 10 B Posts with 10 K Attributes: Architecture Secrets

58.com tackles the challenge of storing and searching billions of heterogeneous posts by employing a unified post center, a category‑attribute service, and an external search engine, using vertical table splitting, JSON‑based extensible fields, compressed keys, and horizontally sharded indexes to achieve massive scalability and high throughput.

Service Architecturehorizontal scalinglarge-scale storage
0 likes · 12 min read
How 58.com Scales 10 B Posts with 10 K Attributes: Architecture Secrets
Qunar Tech Salon
Qunar Tech Salon
Feb 16, 2016 · Backend Development

Evolution and Scaling Strategies for Large Websites: Architecture, Session Management, and Database Optimization

The article reviews the evolution of large‑scale website architecture, explaining how business complexity, multi‑server deployment, session handling, load balancing, database read/write separation, caching, and search indexing together address availability, concurrency, and performance challenges in modern web systems.

Read-Write SeparationSession Managementcaching
0 likes · 16 min read
Evolution and Scaling Strategies for Large Websites: Architecture, Session Management, and Database Optimization