Tagged articles

search indexing

4 articles · Page 1 of 1

Aug 13, 2024 · Big Data

How Bilibili Re‑engineered Its Search Indexing with Distributed Storage and Spark

This article details Bilibili's transformation of its search offline indexing pipeline, moving from manual MySQL‑based processes to a high‑capacity, distributed KV store and Spark‑driven builds, addressing performance, maintenance, and scalability challenges while improving resource efficiency and iteration speed.

Big DataBilibiliDistributed storage

0 likes · 24 min read

How Bilibili Re‑engineered Its Search Indexing with Distributed Storage and Spark

Alibaba Cloud Developer

Jun 16, 2020 · Databases

How Youku Scales Billions of Video Nodes with Real‑Time Graph Databases

Facing billions of video entities and edges, Youku’s engineering team replaced traditional relational stores with a graph‑based knowledge platform, leveraging Alibaba’s Blink streaming engine and Lindorm to enable real‑time, incremental updates, unified UDF logic, and scalable feature computation for search and recommendation.

Big DataReal-time Streaminggraph database

0 likes · 10 min read

How Youku Scales Billions of Video Nodes with Real‑Time Graph Databases

Java Backend Technology

May 12, 2019 · Big Data

How 58.com Scales 10 B Posts with 10 K Attributes: Architecture Secrets

58.com tackles the challenge of storing and searching billions of heterogeneous posts by employing a unified post center, a category‑attribute service, and an external search engine, using vertical table splitting, JSON‑based extensible fields, compressed keys, and horizontally sharded indexes to achieve massive scalability and high throughput.

Service Architecturehorizontal scalinglarge-scale storage

0 likes · 12 min read

How 58.com Scales 10 B Posts with 10 K Attributes: Architecture Secrets

Qunar Tech Salon

Feb 16, 2016 · Backend Development

Evolution and Scaling Strategies for Large Websites: Architecture, Session Management, and Database Optimization

The article reviews the evolution of large‑scale website architecture, explaining how business complexity, multi‑server deployment, session handling, load balancing, database read/write separation, caching, and search indexing together address availability, concurrency, and performance challenges in modern web systems.

CachingRead‑Write Separationdatabase scaling

0 likes · 16 min read

Evolution and Scaling Strategies for Large Websites: Architecture, Session Management, and Database Optimization

search indexing

How Bilibili Re‑engineered Its Search Indexing with Distributed Storage and Spark

How Youku Scales Billions of Video Nodes with Real‑Time Graph Databases

How 58.com Scales 10 B Posts with 10 K Attributes: Architecture Secrets

Evolution and Scaling Strategies for Large Websites: Architecture, Session Management, and Database Optimization

How 58.com Scales 10 B Posts with 10 K Attributes: Architecture Secrets