Databases 33 min read

Design and Application of High‑Performance Database Clusters: Read/Write Separation, Sharding, and NoSQL

This article explains the principles and practical designs of high‑performance database clusters, covering read/write separation, master‑slave replication lag, distribution mechanisms, sharding (both vertical and horizontal), business‑level partitioning, and the role of NoSQL technologies such as key‑value stores, document databases, columnar databases, and full‑text search engines.

Top Architecture Tech Stack

Nov 20, 2023

Design and Application of High‑Performance Database Clusters: Read/Write Separation, Sharding, and NoSQL

Read/Write Separation Principle

The basic idea of read/write separation is to split database read and write operations onto different nodes, typically a master‑slave cluster where the master handles writes and the slaves handle reads. The master replicates data to each slave, and application servers direct write requests to the master and read requests to the slaves.

Key points to note: the terminology "master‑slave" differs from "master‑backup"; slaves provide read access while backups only provide redundancy.

Replication Lag

In MySQL, master‑slave replication delay can reach seconds or even minutes under heavy load. If an application reads immediately after a write, it may hit a stale slave and see outdated data (e.g., a newly registered user cannot log in because the read goes to a lagging slave).

Common mitigation methods:

Send post‑write reads to the master.

Retry on slave failure by reading from the master ("second read").

Direct critical read/write paths to the master while using separation for non‑critical operations.

Distribution Mechanism

Two main ways to route read/write traffic:

Programmatic encapsulation : Implement a data‑access layer in code (e.g., using Hibernate) that decides which datasource to use.

Middleware encapsulation : Deploy a separate middleware (e.g., Taobao TDDL, MySQL Router, 360 Atlas) that presents a unified SQL endpoint and internally routes queries to the appropriate master or slave.

Business‑Level Partitioning (Sharding)

When data volume reaches tens of millions of rows, a single database becomes a bottleneck. Business partitioning splits data by functional modules (e.g., user, product, order) onto separate databases, but introduces join and transaction challenges.

Vertical Sharding

Moves rarely used, large columns to a separate table, reducing the amount of data scanned for common queries.

Horizontal Sharding

Splits rows of a single table across multiple tables/databases. Routing strategies include range routing, hash routing, and configuration‑table routing, each with trade‑offs in scalability, uniformity, and maintenance complexity.

Implementation Methods

Both read/write separation and sharding can be realized via programmatic encapsulation or middleware. Middleware must parse SQL to identify target tables, routing rules, and handle operations like COUNT(), ORDER BY, and GROUP BY across shards.

High‑Performance NoSQL

Relational databases have limitations (rigid schema, high I/O for large data, weak full‑text search). NoSQL solutions complement SQL by addressing specific pain points.

Key‑Value Store

Redis provides fast in‑memory structures (string, hash, list, set, sorted set, bitmap, hyperloglog). Example list operations:

LPOP key   // pop leftmost element
LINDEX key index   // get element by index
LLEN key   // get list length
RPOP key   // pop rightmost element

Document Database

MongoDB stores schema‑less JSON/BSON documents, allowing flexible fields and nested structures. Example document:

{
  "id": 10000,
  "name": "James",
  "sex": "male",
  "hobbies": ["football", "playing", "singing"],
  "email": "[email protected]",
  "address": {
    "province": "GuangDong",
    "city": "GuangZhou",
    "district": "Tianhe",
    "detail": "PingYun Road 163"
  },
  "education": [
    {"begin": "2000-09-01", "end": "2004-07-01", "school": "UESTC", "major": "Computer Science & Technology"},
    {"begin": "2004-09-01", "end": "2007-07-01", "school": "SCUT", "major": "Computer Science & Technology"}
  ]
}

Columnar Database

Column‑oriented stores (e.g., HBase) excel at analytical workloads that read a few columns from massive tables, offering higher compression and lower I/O than row‑oriented databases.

Full‑Text Search Engine

Elasticsearch builds inverted indexes on JSON documents, enabling fast keyword and fuzzy searches that relational indexes cannot handle efficiently.

Conclusion

The article covered read/write separation, its latency and routing complexities, business‑level sharding (both vertical and horizontal), and introduced NoSQL technologies that address relational database shortcomings. Properly combining these techniques can build scalable, high‑performance data architectures.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Performance Database Sharding Read‑Write Separation NoSQL

Written by

Top Architecture Tech Stack

Sharing Java and Python tech insights, with occasional practical development tool tips.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.