Tagged articles
16 articles
Page 1 of 1
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Aug 13, 2025 · Databases

Mastering Data Sharding: Vertical & Horizontal Splitting Strategies for Scalable Systems

This article explains essential data sharding techniques for large-scale architectures, emphasizing the priority of vertical splitting by business modules, followed by horizontal partitioning of hot tables, and outlines stable sharding rules, hotspot avoidance, and minimizing cross‑shard transactions to ensure optimal performance.

Database designdata shardinghorizontal partitioning
0 likes · 4 min read
Mastering Data Sharding: Vertical & Horizontal Splitting Strategies for Scalable Systems
Cognitive Technology Team
Cognitive Technology Team
Mar 26, 2025 · Game Development

Designing Scalable Game Leaderboards with Redis: Core Requirements, Data Structures, and Architecture

This article analyzes the essential requirements of massive‑scale game leaderboards, explains how Redis sorted sets and hash tables provide fast ranking and lookup, and presents a multi‑layered architecture—including hot‑key sharding, dynamic partitioning, tiered storage, read/write separation, pipeline batching, and hybrid persistence—to achieve real‑time, billion‑user performance.

Game DevelopmentPersistenceScalability
0 likes · 5 min read
Designing Scalable Game Leaderboards with Redis: Core Requirements, Data Structures, and Architecture
MaGe Linux Operations
MaGe Linux Operations
Jan 17, 2025 · Databases

Understanding Redis Cluster: Architecture, Data Distribution, and Fault Tolerance

Redis Cluster provides a scalable, fault‑tolerant distributed Redis solution, explaining why it’s needed, its architecture, virtual slot partitioning, data distribution methods, limitations, smart client optimization, and automatic failover mechanisms, while highlighting key operational considerations for high‑performance deployments.

ClusterVirtual Slotsdata sharding
0 likes · 11 min read
Understanding Redis Cluster: Architecture, Data Distribution, and Fault Tolerance
Architect's Guide
Architect's Guide
Oct 31, 2024 · Databases

Designing an Efficient Pipeline for Importing One Billion Records into MySQL

This article presents a comprehensive engineering guide for importing one billion 1 KB unstructured log records stored in HDFS or S3 into MySQL, covering data sizing, B‑tree limits, batch insertion strategies, storage‑engine choices, sharding, file‑reading techniques, concurrency control, and reliable task coordination using Redis, Redisson, and Zookeeper.

Batch InsertPerformance OptimizationZooKeeper
0 likes · 17 min read
Designing an Efficient Pipeline for Importing One Billion Records into MySQL
dbaplus Community
dbaplus Community
Dec 10, 2023 · Databases

How to Import 1 Billion Records into MySQL in Record Time

This article outlines a comprehensive strategy for rapidly loading one billion 1 KB log records stored in HDFS or S3 into MySQL, covering data partitioning, B‑tree limits, batch inserts, storage‑engine selection, sharding, concurrent write control, efficient file reading, task coordination, fault‑tolerant progress tracking with Redis, and distributed locking with Redisson.

Batch InsertFile Readingdata sharding
0 likes · 20 min read
How to Import 1 Billion Records into MySQL in Record Time
macrozheng
macrozheng
Apr 7, 2023 · Backend Development

Designing Scalable Order Systems: Architecture, ID Strategies, and Distributed Transactions

This article explores the comprehensive design of order business systems, covering background considerations, order lifecycle management, technical solutions such as ID generation, parallel and asynchronous processing, timeout handling, distributed transactions, and data strategies like sharding and synchronization.

Backend ArchitectureDistributed TransactionsID generation
0 likes · 12 min read
Designing Scalable Order Systems: Architecture, ID Strategies, and Distributed Transactions
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Feb 24, 2023 · Big Data

What Makes MPP Databases the Powerhouse Behind Modern Data Analytics?

MPP (Massive Parallel Processing) databases, designed for large‑scale analytical workloads, use distributed, shared‑nothing architectures with multiple control and compute nodes, offering high scalability, diverse data‑sharding strategies, and powerful SQL compatibility, as illustrated by vendors like Teradata, Vertica, Greenplum, and emerging open‑source solutions.

Big DataGreenplumMPP
0 likes · 15 min read
What Makes MPP Databases the Powerhouse Behind Modern Data Analytics?
21CTO
21CTO
Nov 30, 2022 · Big Data

Mastering Data Sharding: Hash, Range, and Consistent Hash Techniques

This article explains core data sharding concepts and models—including hash‑based, range‑based, and consistent hashing—detailing their mappings, routing strategies, scalability considerations, and practical implementation examples for handling massive datasets in distributed systems.

Big DataHashingconsistent hashing
0 likes · 11 min read
Mastering Data Sharding: Hash, Range, and Consistent Hash Techniques
dbaplus Community
dbaplus Community
Oct 15, 2022 · Backend Development

Why Unitized Architecture Is the Key to Scalable Financial IT Systems

The article explains why large financial institutions need a unitized (Set‑based) architecture to improve resource utilization, achieve city‑level disaster recovery, and support massive user traffic, then defines the concept and outlines a four‑step process for building such a system.

data shardingdisaster recoveryfinancial IT
0 likes · 8 min read
Why Unitized Architecture Is the Key to Scalable Financial IT Systems
DeWu Technology
DeWu Technology
Jun 22, 2022 · Databases

Rainbow Bridge: A ShardingSphere‑Based Database Middleware for Scalability and Reliability

To support the rapidly growing DeWu App, the team created Rainbow Bridge—a ShardingSphere‑based middleware that unifies horizontal sharding, read‑write separation, shadow‑DB routing, rate limiting, and active‑active traffic correction, while adding a graphical console, stability enhancements, fine‑grained observability, hybrid JDBC/Proxy switching, and ongoing open‑source contributions.

Database MiddlewarePerformance OptimizationShardingSphere
0 likes · 18 min read
Rainbow Bridge: A ShardingSphere‑Based Database Middleware for Scalability and Reliability
macrozheng
macrozheng
Nov 16, 2021 · Backend Development

Master Enterprise Data Management with MyBatis‑Mate: Sharding, Encryption & Auditing

This article introduces MyBatis‑Mate, an official MyBatis‑Plus extension that provides enterprise‑grade features such as sharding, data auditing, field encryption, sensitive data masking, dictionary binding, dynamic DDL maintenance, and flexible data‑source switching, complete with usage examples and code snippets.

Data AuditingField EncryptionJava
0 likes · 10 min read
Master Enterprise Data Management with MyBatis‑Mate: Sharding, Encryption & Auditing
dbaplus Community
dbaplus Community
Mar 31, 2019 · Databases

Unlocking Distributed Databases with Apache ShardingSphere: Features and JD.com Case Study

This article introduces Apache ShardingSphere’s ecosystem, core sharding and governance capabilities, access endpoints such as Sharding-JDBC, Sharding-Proxy and Sharding-Sidecar, and details JD.com’s real‑world implementation, including data‑sharding strategies, distributed primary keys, hint routing, performance optimizations, and future roadmap.

Apache ShardingSphereDatabase MiddlewareJD.com
0 likes · 18 min read
Unlocking Distributed Databases with Apache ShardingSphere: Features and JD.com Case Study
Architecture Digest
Architecture Digest
Mar 9, 2016 · Databases

Overview of Distributed Database Systems: Concepts, Classification, Features, and Data Sharding

This article provides a comprehensive overview of distributed database systems, covering their evolution from centralized architectures, classifications, key characteristics, advantages and disadvantages, data sharding methods, allocation strategies, architectural models, and management considerations, highlighting the benefits and challenges of distributed data management.

DDBMSDatabase Architecturedata allocation
0 likes · 9 min read
Overview of Distributed Database Systems: Concepts, Classification, Features, and Data Sharding