Tagged articles
1414 articles
Page 8 of 15
IT Architects Alliance
IT Architects Alliance
Jun 20, 2021 · Databases

Master‑Slave Replication Pitfalls and Deep Dive into Redis Sentinel

This article examines the limitations of Redis master‑slave replication, such as manual failover and single‑node bottlenecks, and provides an in‑depth exploration of Redis Sentinel’s architecture, configuration parameters, detection mechanisms, automatic failover process, and best‑practice recommendations for achieving high availability.

Replicationdatabasefailover
0 likes · 11 min read
Master‑Slave Replication Pitfalls and Deep Dive into Redis Sentinel
Programmer DD
Programmer DD
Jun 19, 2021 · Databases

How Redis Read/Write Separation Boosts Performance and Cuts Costs

This article explains how Redis read/write separation introduces a read‑only replica role, compares star and chain replication architectures, and outlines the transparent compatibility, high availability, and performance benefits that help users reduce costs while handling read‑heavy workloads.

Database ArchitectureRead-Write SeparationReplication
0 likes · 9 min read
How Redis Read/Write Separation Boosts Performance and Cuts Costs
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 18, 2021 · Fundamentals

What Makes Distributed File Systems Tick? Design Principles and Architecture Explained

This article examines the core concepts, requirements, architectural models, persistence strategies, scalability, high‑availability mechanisms, performance optimizations, security models, and practical considerations of distributed file systems such as HDFS, GFS, and Ceph, offering a comprehensive guide for engineers and researchers.

Data ConsistencyDistributed File SystemScalability
0 likes · 21 min read
What Makes Distributed File Systems Tick? Design Principles and Architecture Explained
IT Architects Alliance
IT Architects Alliance
Jun 17, 2021 · Operations

What Makes a System Highly Available? 6 Proven Architecture Patterns

This article explains the essential characteristics of high‑availability architectures—master‑slave failover, load balancing, and horizontal scaling—and reviews six practical solutions including LVS + Keepalive, NGINX, Zookeeper, client‑side strategies, service‑level replication, and middleware approaches.

LVSNginxZooKeeper
0 likes · 7 min read
What Makes a System Highly Available? 6 Proven Architecture Patterns
Volcano Engine Developer Services
Volcano Engine Developer Services
Jun 16, 2021 · Backend Development

How ByteDance’s Video Processing Platform Achieves Billion‑Scale High Availability

This article explains how ByteDance’s Volcano Engine video platform handles the entire video lifecycle—from client‑side capture to cloud processing, delivery, and playback—by employing a multi‑plane architecture, scalable workflow system, function compute platform, and the dynamic BMF framework to meet massive scale, ensure high availability, improve user experience, and reduce costs.

Function ComputeVideo processinghigh availability
0 likes · 19 min read
How ByteDance’s Video Processing Platform Achieves Billion‑Scale High Availability
Aikesheng Open Source Community
Aikesheng Open Source Community
Jun 16, 2021 · Databases

High‑Availability Strategies for Redis: Sentinel Service Discovery, VIP Binding, Keepalived VRRP, and Proxy Approaches

The article explains various Redis high‑availability solutions—including Sentinel service discovery, VIP binding, keepalived VRRP, and proxy middleware—for both master‑slave and cluster architectures, and shows how clients can reliably locate the current master node after failover.

JedisProxyhigh availability
0 likes · 7 min read
High‑Availability Strategies for Redis: Sentinel Service Discovery, VIP Binding, Keepalived VRRP, and Proxy Approaches
21CTO
21CTO
Jun 11, 2021 · Cloud Native

How Meituan’s Shepherd API Gateway Boosts Efficiency and Reliability

This article introduces Meituan’s Shepherd unified API gateway, covering its background, key technical designs—including architecture, high‑availability, usability, and extensibility—and outlines future cloud‑native evolution plans, offering insights for developers and operations engineers.

Cloud NativeMicroservicesService Orchestration
0 likes · 21 min read
How Meituan’s Shepherd API Gateway Boosts Efficiency and Reliability
IT Architects Alliance
IT Architects Alliance
Jun 10, 2021 · Cloud Native

Designing High‑Availability Stateless Services: Load Balancing, Scaling, and Deployment Strategies

This article explains how to achieve high availability for stateless services by employing redundancy, vertical and horizontal scaling, various load‑balancing algorithms (random, round‑robin, weighted, least‑connections, source‑hash), and automatic scaling techniques in cloud‑native environments, while also covering performance monitoring and CDN/OSS usage.

Auto ScalingCloud NativeVertical Scaling
0 likes · 10 min read
Designing High‑Availability Stateless Services: Load Balancing, Scaling, and Deployment Strategies
Qingyun Technology Community
Qingyun Technology Community
Jun 7, 2021 · Databases

Why Xenon Is the Modern MySQL HA Alternative to MHA

This article compares the legacy MHA solution with the modern open‑source Xenon tool, explaining Xenon's Raft‑based architecture, automatic leader election, GTID integration, and enterprise‑grade features that make it a superior high‑availability option for MySQL deployments.

GTIDMHARaft
0 likes · 7 min read
Why Xenon Is the Modern MySQL HA Alternative to MHA
IT Architects Alliance
IT Architects Alliance
Jun 6, 2021 · Fundamentals

Design Principles and Architecture of Distributed File Systems

This article provides a comprehensive overview of distributed file systems, covering their historical evolution, essential requirements, architectural models with and without central nodes, persistence strategies, scalability, high availability, performance optimization, security mechanisms, and additional considerations such as space allocation, file deletion, small‑file handling, and deduplication.

ConsistencyDistributed File SystemScalability
0 likes · 21 min read
Design Principles and Architecture of Distributed File Systems
IT Architects Alliance
IT Architects Alliance
Jun 1, 2021 · Backend Development

Design and Implementation of Zhihu's High‑Throughput Read Service Using TiDB

This article presents a detailed case study of Zhihu's read‑service architecture, describing its high‑availability, high‑performance, and easy‑scalability design goals, key components such as Proxy, Cache and TiDB storage, performance metrics, and the migration from MySQL to TiDB with practical lessons learned.

Backend ArchitectureTiDBhigh availability
0 likes · 18 min read
Design and Implementation of Zhihu's High‑Throughput Read Service Using TiDB
ITFLY8 Architecture Home
ITFLY8 Architecture Home
May 31, 2021 · Backend Development

How Zhihu Scales Read Filtering: A Deep Dive into High‑Performance Backend Architecture

This article explains how Zhihu built a highly available, low‑latency read‑filtering service for its homepage, detailing the system’s design goals, architecture components such as proxy, cache and storage, massive data scale, migration from MySQL to TiDB, and performance results after adopting TiDB 3.0.

BackendScalabilityTiDB
0 likes · 21 min read
How Zhihu Scales Read Filtering: A Deep Dive into High‑Performance Backend Architecture
ITFLY8 Architecture Home
ITFLY8 Architecture Home
May 30, 2021 · Backend Development

Master Nginx: Reverse Proxy, Load Balancing, and High‑Availability Guide

This comprehensive guide explains Nginx’s core concepts—including reverse and forward proxy, load balancing, static‑dynamic separation, common commands, configuration blocks, and high‑availability setup with keepalived—providing step‑by‑step examples, diagrams, and practical tips for deploying scalable, high‑performance web services.

BackendConfigurationhigh availability
0 likes · 12 min read
Master Nginx: Reverse Proxy, Load Balancing, and High‑Availability Guide
Full-Stack Internet Architecture
Full-Stack Internet Architecture
May 29, 2021 · Backend Development

Peer-to-Peer Decentralized Architecture and High‑Availability Configuration for Eureka Server Cluster

This article explains how to build a peer‑to‑peer, decentralized Eureka service‑registry cluster, covering the underlying architecture, CAP trade‑offs, Maven and YAML configurations for multiple peers, client load‑balancing, and a step‑by‑step test deployment.

Spring Cloudbackend-developmenthigh availability
0 likes · 9 min read
Peer-to-Peer Decentralized Architecture and High‑Availability Configuration for Eureka Server Cluster
Aikesheng Open Source Community
Aikesheng Open Source Community
May 28, 2021 · Databases

Comprehensive MySQL Inspection Checklist and Command Reference

This guide presents a detailed MySQL inspection checklist covering operating‑system metrics, critical MySQL parameters, status queries, replication health, high‑availability components, and useful SQL scripts, enabling DBAs to efficiently monitor performance, detect issues, and maintain reliable database services.

Replicationhigh availabilityinspection
0 likes · 11 min read
Comprehensive MySQL Inspection Checklist and Command Reference
Efficient Ops
Efficient Ops
May 27, 2021 · Big Data

Why a Kafka Broker Crash Can Halt All Consumers – The Hidden Offset Pitfall

This article explains Kafka’s high‑availability design, including multi‑replica redundancy, ISR synchronization, leader election, and the critical role of the __consumer_offset internal topic, showing why a single broker failure can render the whole cluster unusable and how to configure replication factors and ack settings to prevent it.

Consumer OffsetISRReplication
0 likes · 9 min read
Why a Kafka Broker Crash Can Halt All Consumers – The Hidden Offset Pitfall
Efficient Ops
Efficient Ops
May 25, 2021 · Operations

Master HAProxy: From Basics to High-Availability L7/L4 Load Balancing

This comprehensive guide explains what HAProxy is, its core functions and key features, then walks through installing, configuring, and running both L7 and L4 load balancers on Linux, adding logging, log rotation, monitoring, and achieving high availability with Keepalived.

ConfigurationHAProxyL4
0 likes · 30 min read
Master HAProxy: From Basics to High-Availability L7/L4 Load Balancing
ITFLY8 Architecture Home
ITFLY8 Architecture Home
May 22, 2021 · Operations

How Active‑Active Data Centers Boost Resilience and Resource Efficiency

The article explains hot standby, cold standby, and active‑active (dual‑active) data center architectures, compares their advantages and drawbacks, outlines deployment challenges, and highlights the role of cloud computing and automation in achieving high availability and optimal resource utilization.

Active-Activecloud computingdisaster recovery
0 likes · 12 min read
How Active‑Active Data Centers Boost Resilience and Resource Efficiency
Meituan Technology Team
Meituan Technology Team
May 20, 2021 · Cloud Native

How Meituan’s Shepherd API Gateway Achieves High‑Performance, Cloud‑Native Scalability

This article examines the background, design principles, high‑availability techniques, extensibility, and future roadmap of Meituan’s internally built Shepherd API gateway, showing how it streamlines microservice API management, improves developer productivity, and evolves toward a cloud‑native architecture.

Cloud NativeMicroservicesService Orchestration
0 likes · 23 min read
How Meituan’s Shepherd API Gateway Achieves High‑Performance, Cloud‑Native Scalability
ITPUB
ITPUB
May 19, 2021 · Databases

Mastering SQL Server AlwaysOn: Enterprise‑Ready High Availability Architecture

This article explains SQL Server's evolution from legacy high‑availability solutions to the modern AlwaysOn architecture, detailing its data‑synchronization process, synchronous and asynchronous commit modes, failover scenarios, and practical deployment recommendations for enterprises handling both moderate and terabyte‑scale workloads.

AlwaysOnDatabase ReplicationSQL Server
0 likes · 8 min read
Mastering SQL Server AlwaysOn: Enterprise‑Ready High Availability Architecture
Big Data Technology & Architecture
Big Data Technology & Architecture
May 16, 2021 · Big Data

Kuaishou's Data Service Platform: Architecture, Key Technologies, and Future Directions

This article introduces Kuaishou's data service platform, outlining the background challenges of data development, the platform's architecture and service models, key technologies such as configuration‑driven development, multi‑mode APIs, data acceleration, high‑availability mechanisms, and discusses its current performance and future development directions.

Data AccelerationService Architecturehigh availability
0 likes · 11 min read
Kuaishou's Data Service Platform: Architecture, Key Technologies, and Future Directions
Architects' Tech Alliance
Architects' Tech Alliance
May 16, 2021 · Operations

Interconnection Solutions for Distributed Data Centers: Storage, Layer‑2, and Layer‑3 Networks

This article examines distributed data center deployment, detailing three interconnection approaches—storage network, layer‑2 network, and layer‑3 network—along with the technical requirements and challenges of VM cross‑center migration and high‑availability server clusters in modern enterprise environments.

distributed data centerhigh availabilitylayer2 interconnect
0 likes · 12 min read
Interconnection Solutions for Distributed Data Centers: Storage, Layer‑2, and Layer‑3 Networks
Architecture Digest
Architecture Digest
May 15, 2021 · Backend Development

Design and Migration of Zhihu's Read Service: High Availability, Performance, and TiDB Adoption

This article details Zhihu's read‑service architecture, covering its business requirements, high‑availability and high‑performance design goals, key components such as Proxy, Cache and Storage, extensive performance metrics, the migration from MySQL to TiDB, and the benefits brought by TiDB 3.0 features.

Backend ArchitectureScalabilityTiDB
0 likes · 18 min read
Design and Migration of Zhihu's Read Service: High Availability, Performance, and TiDB Adoption
Architects' Tech Alliance
Architects' Tech Alliance
May 14, 2021 · Industry Insights

Why Distributed Active/Active Data Centers Are the Future of Enterprise IT

The article examines how enterprises are moving from traditional primary‑backup and two‑site‑three‑center architectures toward distributed active/active data centers, outlining the concepts of distribution and multi‑activity, the technical challenges involved, and the operational benefits of higher availability and resource efficiency.

Active-ActiveIT Operationscloud computing
0 likes · 9 min read
Why Distributed Active/Active Data Centers Are the Future of Enterprise IT
JD Retail Technology
JD Retail Technology
May 13, 2021 · Big Data

Evolution and Architecture of JD.com Self‑Operated Rebate Platform

The article details the development, challenges, and redesign of JD.com’s self‑operated rebate system, describing its early monolithic architecture, data‑intensive processing pipeline, migration to a modular, high‑availability platform built on Spark, Hive, and Elasticsearch, and the resulting performance and operational improvements.

Big DataETLSpark
0 likes · 16 min read
Evolution and Architecture of JD.com Self‑Operated Rebate Platform
Full-Stack Internet Architecture
Full-Stack Internet Architecture
May 13, 2021 · Databases

Database High‑Availability Architectures: Master‑Slave, Master‑Master, and Automatic Failover

This article explains common database high‑availability designs—including master‑slave, master‑master, and automatic failover architectures—their topologies, advantages, disadvantages, and practical considerations such as replication lag, manual intervention, and data consistency challenges.

Master‑SlaveReplicationdatabase
0 likes · 7 min read
Database High‑Availability Architectures: Master‑Slave, Master‑Master, and Automatic Failover
IT Architects Alliance
IT Architects Alliance
May 12, 2021 · Operations

Designing High‑Availability, High‑Performance, Scalable and Secure Web Application Architecture

This article explains how to build a highly available, high‑performance, easily extensible, scalable and secure web application system by describing the evolution of large‑site architectures, common patterns such as layering, clustering, caching, asynchronous processing, and the core architectural factors of performance, availability, scalability, extensibility and security.

Distributed SystemsScalabilityarchitecture
0 likes · 20 min read
Designing High‑Availability, High‑Performance, Scalable and Secure Web Application Architecture
Full-Stack Internet Architecture
Full-Stack Internet Architecture
May 12, 2021 · Databases

Database Scaling Strategies: From Single Instance to Smooth Horizontal Expansion

The article explains how to evolve a simple single‑instance database into a highly available, horizontally sharded system, detailing offline scaling, online dual‑write migration, and a smooth four‑node expansion process with configuration changes, data migration, and cleanup steps.

Configuration Managementdatabase scalinghigh availability
0 likes · 6 min read
Database Scaling Strategies: From Single Instance to Smooth Horizontal Expansion
dbaplus Community
dbaplus Community
May 10, 2021 · Backend Development

How JD.com’s Order Fulfillment Scales: Data Heterogeneity & Complexity‑Driven Architecture

This talk explains JD.com’s order‑fulfillment workflow, the underlying storage stack of Redis, MySQL and Elasticsearch, the challenges of high‑traffic alert‑sound queries, the adoption of Canal for data heterogeneity, and a 4R‑based complexity‑oriented design methodology for building resilient backend systems.

Backend ArchitectureCanalElasticsearch
0 likes · 18 min read
How JD.com’s Order Fulfillment Scales: Data Heterogeneity & Complexity‑Driven Architecture
Code Ape Tech Column
Code Ape Tech Column
May 10, 2021 · Industry Insights

Why Kafka Beats Redis List: A Deep Dive into Modern Messaging Middleware

This article compares Redis list, Kafka, and Pulsar as messaging middleware, explaining their architectures, strengths, and weaknesses—including queue fundamentals, partitioning, cursor management, consumer groups, high‑availability mechanisms, storage strategies, and consumption models—to help readers choose the right solution for large‑scale systems.

Distributed SystemsKafkaMessaging
0 likes · 30 min read
Why Kafka Beats Redis List: A Deep Dive into Modern Messaging Middleware
HelloTech
HelloTech
May 9, 2021 · Cloud Native

Distributed Message Governance and Microservice High‑Availability Practices

The guide details how to build a distributed message‑governance platform for the Hello mobility service, covering unified SDK design, RocketMQ pitfalls, client and cluster health monitoring, risk mitigation, and a tiered microservice high‑availability architecture that uses circuit‑breaking, rate‑limiting, and pre‑heating to ensure resilient traffic handling.

Distributed SystemsMessage QueueRocketMQ
0 likes · 14 min read
Distributed Message Governance and Microservice High‑Availability Practices
JD Cloud Developers
JD Cloud Developers
May 8, 2021 · Cloud Native

How to Build a High‑Availability Jenkins Distributed Cluster with Kubernetes and LVS

This article explains how to design and implement a highly available Jenkins distributed architecture using master‑slave nodes, label‑based scheduling, containerized deployment on Kubernetes, and LVS load balancing to improve resource utilization, scalability, and fault tolerance for large‑scale CI pipelines.

Distributed BuildJenkinsKubernetes
0 likes · 9 min read
How to Build a High‑Availability Jenkins Distributed Cluster with Kubernetes and LVS
Code Ape Tech Column
Code Ape Tech Column
May 7, 2021 · Industry Insights

How to Design a Scalable, High‑Performance Distributed E‑Commerce Architecture

This article provides a comprehensive technical overview of large‑scale distributed website architecture, covering characteristics, goals, common patterns, high‑performance and high‑availability designs, scalability, extensibility, security, agility, a seven‑layer reference model, and the evolutionary steps of modern e‑commerce systems.

Distributed SystemsMicroservicesScalability
0 likes · 26 min read
How to Design a Scalable, High‑Performance Distributed E‑Commerce Architecture
IT Architects Alliance
IT Architects Alliance
May 6, 2021 · Industry Insights

Designing High‑Performance, Highly‑Available Large‑Scale Web Architectures

This article provides a comprehensive technical overview of large‑scale distributed website architecture, covering characteristics, goals, common patterns, high‑performance and high‑availability designs, scalability, extensibility, security, agility, a seven‑layer reference model, and a detailed e‑commerce case study with practical optimization steps.

Distributed SystemsScalabilitycaching
0 likes · 26 min read
Designing High‑Performance, Highly‑Available Large‑Scale Web Architectures
Architects' Tech Alliance
Architects' Tech Alliance
May 4, 2021 · Backend Development

Technical Summary of Large‑Scale Distributed Website Architecture

This article provides a comprehensive technical overview of large‑scale distributed website architecture, covering characteristics, design goals, architectural patterns, performance, high availability, scalability, extensibility, security, agility, evolution stages, and a detailed e‑commerce case study.

Scalabilitycachinghigh availability
0 likes · 24 min read
Technical Summary of Large‑Scale Distributed Website Architecture
Architect
Architect
May 1, 2021 · Backend Development

Technical Summary of Large‑Scale Distributed Website Architecture

This article provides a comprehensive technical overview of large‑scale distributed website architecture, covering characteristics, design goals, high‑performance, high‑availability, scalability, security, and detailed patterns such as layered design, caching, clustering, load balancing, database sharding, CDN, and service‑oriented decomposition.

Distributed Systemscachinghigh availability
0 likes · 23 min read
Technical Summary of Large‑Scale Distributed Website Architecture
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Apr 24, 2021 · Databases

Deep Dive into Redis Cluster Architecture and Principles

This article provides a comprehensive analysis of Redis Cluster, covering node and slot assignment, command execution, resharding, redirection, fault‑tolerance, gossip communication, scaling strategies, configuration limits, and practical code examples for building and operating a high‑availability sharded Redis deployment.

ClusterGossip Protocolfailover
0 likes · 21 min read
Deep Dive into Redis Cluster Architecture and Principles
ITPUB
ITPUB
Apr 23, 2021 · Backend Development

How to Set Up a High‑Availability RocketMQ Cluster on Two Servers (2‑Master‑2‑Slave Async)

This guide walks through the complete process of installing RocketMQ 4.8.0, configuring a two‑master‑two‑slave asynchronous high‑availability cluster across two Linux servers, setting up the web console, and troubleshooting common pitfalls such as missing JDK, memory limits, firewall rules, and network address issues.

LinuxMessage QueueRocketMQ
0 likes · 20 min read
How to Set Up a High‑Availability RocketMQ Cluster on Two Servers (2‑Master‑2‑Slave Async)
Intelligent Backend & Architecture
Intelligent Backend & Architecture
Apr 23, 2021 · Backend Development

Why Message Queues Are Essential: Benefits, Pitfalls, and Best Practices

This article explains the role of message queues in handling traffic spikes, decoupling services, and ensuring reliability, compares popular MQ solutions such as RabbitMQ, Kafka, RocketMQ and ActiveMQ, and discusses their architectures, advantages, drawbacks, idempotency, ordering, high‑availability and scaling strategies.

IdempotencyKafkaMQ
0 likes · 38 min read
Why Message Queues Are Essential: Benefits, Pitfalls, and Best Practices
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Apr 22, 2021 · Operations

Designing Highly Available Stateless Services: Load Balancing and Scaling Strategies

This article explains how to build highly available stateless services by using redundant deployment, vertical and horizontal scaling, various load‑balancing algorithms, and automatic recovery mechanisms, while also covering monitoring, high‑concurrency identification, and the role of CDN and OSS in resilient architecture.

CDNOSShigh availability
0 likes · 10 min read
Designing Highly Available Stateless Services: Load Balancing and Scaling Strategies
vivo Internet Technology
vivo Internet Technology
Apr 21, 2021 · Operations

System Health Check: Principles and Implementation

System health checks, akin to medical exams, are vital for modern IT infrastructure, using active and passive monitoring, failover strategies, and tools like Spring Boot Actuator to detect hardware, network, load, or software issues, prevent single points of failure, and ensure continuous high‑availability service operation.

Network ReliabilityRocketMQSpring Boot Actuator
0 likes · 12 min read
System Health Check: Principles and Implementation
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Apr 20, 2021 · Backend Development

Understanding Seata: Principles, Deployment, and Real‑World Issues in Distributed Transactions

This article introduces Seata's background, explains the causes of data inconsistency in micro‑service architectures, details Seata's three core roles, transaction flow, isolation levels, common pitfalls with code examples, and provides guidance on high‑availability deployment, multi‑node setups, disaster recovery, and hands‑on demos.

Distributed TransactionsMicroservicesSeata
0 likes · 15 min read
Understanding Seata: Principles, Deployment, and Real‑World Issues in Distributed Transactions
DataFunTalk
DataFunTalk
Apr 20, 2021 · Databases

Meituan's Graph Database Selection and Platform Construction

This article presents Meituan's comprehensive evaluation of open‑source graph databases, the rationale for selecting NebulaGraph, and the design of a high‑availability, high‑throughput graph database platform that supports multi‑hop queries, massive data ingestion, real‑time synchronization, and visualization for various business scenarios.

Graph DatabaseMeituanNebulaGraph
0 likes · 21 min read
Meituan's Graph Database Selection and Platform Construction
Java Interview Crash Guide
Java Interview Crash Guide
Apr 16, 2021 · Databases

Mastering MySQL Replication: Types, Setup, and Performance Tips

An in‑depth guide to MySQL master‑slave replication covering replication modes, deployment prerequisites, core mechanisms, common issues with solutions such as semi‑synchronous and parallel replication, configuration commands, monitoring techniques, and error handling to ensure high availability and performance.

databasehigh availabilitymysql
0 likes · 7 min read
Mastering MySQL Replication: Types, Setup, and Performance Tips
Programmer DD
Programmer DD
Apr 13, 2021 · Big Data

What Makes HDFS the Backbone of Big Data? Overview, Architecture & Key Features

This article provides a comprehensive overview of HDFS—including its design goals, core components, data read/write workflows, high‑availability mechanisms, federation, storage policies, colocation benefits, and practical usage scenarios—explaining why it is the foundational distributed file system for large‑scale data processing.

Big DataFederationHDFS
0 likes · 17 min read
What Makes HDFS the Backbone of Big Data? Overview, Architecture & Key Features
Top Architect
Top Architect
Apr 12, 2021 · Databases

Designing a High‑Availability Redis Service with Sentinel

This article explains how to build a highly available Redis service by analyzing common failure scenarios, evaluating single‑instance, master‑slave with one or multiple Sentinel processes, and ultimately recommending a three‑Sentinel architecture combined with a virtual IP for seamless client usage.

Master‑Slavefailoverhigh availability
0 likes · 11 min read
Designing a High‑Availability Redis Service with Sentinel
Xueersi Online School Tech Team
Xueersi Online School Tech Team
Apr 9, 2021 · Databases

Redis Overview: Data Types, Persistence, High Availability, and Common Cache Issues

This article provides a comprehensive guide to Redis, covering its role as a non‑relational database, detailed data types and commands, persistence mechanisms (RDB and AOF), memory eviction strategies, high‑availability features such as replication, Sentinel and clustering, and solutions to typical cache problems like penetration, avalanche, and hot‑key handling.

NoSQLPersistencecaching
0 likes · 20 min read
Redis Overview: Data Types, Persistence, High Availability, and Common Cache Issues
Java Interview Crash Guide
Java Interview Crash Guide
Apr 6, 2021 · Backend Development

Mastering Redis: From Basics to High Availability in Java Backend

This article walks through what Redis is, its data types, caching usage with Spring Boot, performance advantages, eviction policies, persistence mechanisms, master‑slave replication, Sentinel high‑availability, and practical interview‑style Q&A, providing a comprehensive guide for backend developers.

Spring Bootdatabasehigh availability
0 likes · 33 min read
Mastering Redis: From Basics to High Availability in Java Backend
MaGe Linux Operations
MaGe Linux Operations
Apr 3, 2021 · Operations

Designing a Scalable, High‑Availability Monitoring System with Prometheus & Thanos

This article explores the challenges of building a reliable monitoring platform, compares open‑source solutions such as Elasticsearch, Nagios, Zabbix and Prometheus, and details how to achieve high availability and horizontal scaling using Prometheus, Thanos, sharding, remote‑write, and Kubernetes orchestration.

Thanoshigh availabilityobservability
0 likes · 22 min read
Designing a Scalable, High‑Availability Monitoring System with Prometheus & Thanos
21CTO
21CTO
Apr 2, 2021 · Databases

Choosing the Right Database Architecture: Principles, Schemes, and Consistency Solutions

This article outlines core database architecture principles, compares four common deployment schemes (master‑slave, dual‑master, master‑slave with read/write separation, and hybrid), and presents practical consistency solutions for both primary‑replica and DB‑cache mismatches, followed by personal evolution insights.

Consistencyarchitecturedatabase
0 likes · 11 min read
Choosing the Right Database Architecture: Principles, Schemes, and Consistency Solutions
HelloTech
HelloTech
Apr 2, 2021 · Backend Development

Traffic Governance and Protection: Threshold Configuration

Effective traffic governance and protection rely on properly configuring thresholds—either a fixed global limit that stays constant regardless of node count, or a per‑machine allocation that scales the total capacity as nodes are added or removed—to prevent sudden surges from overwhelming services and ensure high availability.

Cluster Managementbackend-developmenthigh availability
0 likes · 4 min read
Traffic Governance and Protection: Threshold Configuration
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Apr 2, 2021 · Operations

Understanding Redis Sentinel: High‑Availability Mechanism and Automatic Failover

This article explains how Redis Sentinel provides high‑availability for Redis by continuously monitoring master and replica nodes, detecting failures through subjective and objective down states, electing a new master via quorum‑based voting, and notifying clients of the failover using Pub/Sub events.

Replicationfailoverhigh availability
0 likes · 19 min read
Understanding Redis Sentinel: High‑Availability Mechanism and Automatic Failover
ITPUB
ITPUB
Apr 2, 2021 · Databases

Choosing the Right MySQL Architecture: Master‑Standby, Dual‑Master, and Consistency Strategies

This article examines core MySQL architecture principles—high availability, performance, consistency, and scalability—then compares four common deployment patterns, analyzes their trade‑offs, and presents practical consistency solutions for both master‑slave replication and cache synchronization, ending with personal insights on evolution and sharding.

ConsistencyDatabase ArchitectureRead-Write Separation
0 likes · 12 min read
Choosing the Right MySQL Architecture: Master‑Standby, Dual‑Master, and Consistency Strategies
Programmer DD
Programmer DD
Apr 2, 2021 · Databases

3 Proven Paths to Master MySQL Replication and What the New Book Reveals

Discover three effective strategies for mastering a technology, illustrated with MySQL replication, and learn how the new book 'Deep Understanding of MySQL Master‑Slave Principles' combines code analysis, comparative study, and expert insight to guide readers through GTID, binary‑log events, and high‑availability architectures.

GTIDReplicationhigh availability
0 likes · 7 min read
3 Proven Paths to Master MySQL Replication and What the New Book Reveals
Meituan Technology Team
Meituan Technology Team
Apr 1, 2021 · Databases

Meituan's Graph Database Selection and Platform Construction

Meituan evaluated open‑source distributed graph databases against strict latency, scale, and import criteria, selected NebulaGraph for its superior multi‑hop query and bulk‑load performance, and built a four‑layer, highly available platform that ingests petabyte‑scale data in real time, supports diverse business use cases, and provides interactive visualization.

Distributed SystemsGraph DatabaseNebulaGraph
0 likes · 21 min read
Meituan's Graph Database Selection and Platform Construction
Ops Development Stories
Ops Development Stories
Apr 1, 2021 · Operations

Zookeeper Leader Election Explained: Cluster Architecture & Code Walkthrough

This article provides a comprehensive overview of Zookeeper's cluster deployment, explains the four server states, details the leader election process—including initialization, voting, and decision logic—and presents key source code snippets to help developers understand and implement Zookeeper's high‑availability mechanisms.

Cluster ManagementDistributed SystemsZooKeeper
0 likes · 10 min read
Zookeeper Leader Election Explained: Cluster Architecture & Code Walkthrough
Tencent Cloud Developer
Tencent Cloud Developer
Mar 31, 2021 · Backend Development

Design and Implementation of a Distributed Timer Service at Tencent

The article details Tencent’s design and implementation of a distributed timer service—using a multi‑level time‑wheel stored in a KV system, a scheduler daemon, broker, and notifier—to support reliable, low‑latency delayed tasks such as order closure and SMS activation, while addressing isolation, rate limiting, disaster recovery, and delivery guarantees.

KV storageTime Wheeldistributed timer
0 likes · 15 min read
Design and Implementation of a Distributed Timer Service at Tencent
Senior Brother's Insights
Senior Brother's Insights
Mar 29, 2021 · Backend Development

Why Does a Single Kafka Broker Failure Break All Consumers?

A Kafka broker outage can halt consumer consumption despite remaining brokers, due to replication settings, ISR mechanics, and the internal __consumer_offsets topic’s default replication factor, which this article explains and resolves with practical configuration steps.

ACKBackendConsumer Offsets
0 likes · 11 min read
Why Does a Single Kafka Broker Failure Break All Consumers?
Alibaba Cloud Native
Alibaba Cloud Native
Mar 25, 2021 · Cloud Native

How ChaosBlade‑Box Empowers Cloud‑Native High Availability with Chaos Engineering

The article introduces ChaosBlade‑Box, an open‑source cloud‑native chaos‑engineering console that builds on Alibaba’s ChaosBlade tool, explains the high‑availability challenges of cloud‑native systems, details the platform’s design, features, multi‑language support, deployment workflow, example experiments, and future roadmap for resilient architectures.

Cloud NativeKuberneteschaos engineering
0 likes · 12 min read
How ChaosBlade‑Box Empowers Cloud‑Native High Availability with Chaos Engineering
21CTO
21CTO
Mar 24, 2021 · Backend Development

Mastering Backend High Availability: From Cold Backups to Multi‑Active Deployments

This article examines stateful backend services and compares various high‑availability strategies—including cold backup, dual‑machine hot standby, same‑city and cross‑city active‑active, and multi‑active architectures—highlighting their benefits, drawbacks, and practical implementation considerations.

Backend Architecturecold backupdisaster recovery
0 likes · 14 min read
Mastering Backend High Availability: From Cold Backups to Multi‑Active Deployments
Baidu Geek Talk
Baidu Geek Talk
Mar 22, 2021 · Operations

How Baidu Achieved 99.999% Uptime for Its Massive Feed Recommendation System

This article details Baidu's Feed recommendation system architecture, explaining how a combination of dynamic retry scheduling, real‑time stop‑loss mechanisms, multi‑recall frameworks, ranking layer fallbacks, and IDC‑level multi‑master designs collectively ensure five‑nine availability across billions of daily requests.

Distributed SystemsMicroservicesOperations
0 likes · 18 min read
How Baidu Achieved 99.999% Uptime for Its Massive Feed Recommendation System
Open Source Linux
Open Source Linux
Mar 22, 2021 · Backend Development

Master Nginx: Reverse Proxy, Load Balancing, and High‑Availability Made Simple

This guide walks you through Nginx's high‑performance architecture, explains forward and reverse proxy concepts, demonstrates load‑balancing and static‑dynamic separation techniques, shows step‑by‑step installation and common commands, and details a high‑availability setup with Keepalived, all illustrated with practical examples and diagrams.

LinuxNginxWeb server
0 likes · 13 min read
Master Nginx: Reverse Proxy, Load Balancing, and High‑Availability Made Simple
JavaEdge
JavaEdge
Mar 21, 2021 · Databases

From Single MySQL to Cluster: Master‑Slave Replication, High Availability, and Scaling Strategies

This article explains why growing MySQL workloads require moving from a single instance to a clustered architecture, details the mechanics of master‑slave replication, asynchronous, semi‑synchronous and group replication, and evaluates various high‑availability solutions and read‑write splitting techniques.

Group ReplicationMHAOrchestrator
0 likes · 18 min read
From Single MySQL to Cluster: Master‑Slave Replication, High Availability, and Scaling Strategies
Aikesheng Open Source Community
Aikesheng Open Source Community
Mar 15, 2021 · Databases

Improving MHA Network Tolerance: Testing ping_interval and secondary_check_script

This article analyzes how adjusting the MHA ping_interval and enabling secondary_check_script can increase MySQL high‑availability cluster tolerance to network packet loss and corruption, presenting test setups, parameter configurations, log observations, and conclusions on their impact on failover behavior.

Database operationsMHANetwork Tolerance
0 likes · 16 min read
Improving MHA Network Tolerance: Testing ping_interval and secondary_check_script
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Mar 15, 2021 · Operations

How Meituan Scales Instant Delivery with a Distributed Architecture

Meituan's instant logistics platform evolved over five years, adopting distributed, fault‑tolerant systems, AI‑driven optimization, and multi‑IDC strategies to handle massive order volumes, extreme traffic spikes, and stringent real‑time reliability requirements while continuously improving scalability and cost efficiency.

AI OptimizationDistributed SystemsMicroservices
0 likes · 10 min read
How Meituan Scales Instant Delivery with a Distributed Architecture
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Mar 11, 2021 · Databases

Understanding Redis Master‑Slave Replication: Architecture, Setup, and Mechanisms

This article provides a comprehensive guide to Redis high‑availability through master‑slave replication, covering the underlying architecture, step‑by‑step setup, full and partial synchronization processes, buffer management, practical pitfalls, and how the mechanism fits into broader Redis HA solutions.

Full SyncMaster‑SlavePartial Sync
0 likes · 18 min read
Understanding Redis Master‑Slave Replication: Architecture, Setup, and Mechanisms
MaGe Linux Operations
MaGe Linux Operations
Mar 8, 2021 · Operations

How to Build a Highly Available etcd Cluster with SSL Security

This guide explains the fundamentals of etcd, its Raft‑based architecture, cluster planning, secure certificate generation, installation steps, service configuration, and verification commands to deploy a reliable, SSL‑protected etcd cluster for service discovery and configuration management.

ClusterConfiguration ManagementRaft
0 likes · 16 min read
How to Build a Highly Available etcd Cluster with SSL Security
Liangxu Linux
Liangxu Linux
Mar 8, 2021 · Operations

Mastering Nginx: Reverse Proxy, Load Balancing, and High‑Availability Setup

This guide explains Nginx’s core concepts—including forward and reverse proxy, load‑balancing strategies, static‑dynamic separation, installation commands, configuration file anatomy, practical reverse‑proxy and load‑balancing examples, and a Keepalived high‑availability solution—providing step‑by‑step instructions and essential code snippets for reliable backend deployment.

BackendConfigurationNginx
0 likes · 11 min read
Mastering Nginx: Reverse Proxy, Load Balancing, and High‑Availability Setup
NetEase Yanxuan Technology Product Team
NetEase Yanxuan Technology Product Team
Mar 4, 2021 · Backend Development

How We Built a High‑Availability Distributed ID Service with Leaf

This article details the motivation, design choices, architecture, performance optimizations, and operational lessons learned while implementing a distributed ID generation system using Leaf's segment mode to achieve global uniqueness, high availability, and low latency for large‑scale e‑commerce services.

Backend ArchitectureLeafdistributed-id
0 likes · 13 min read
How We Built a High‑Availability Distributed ID Service with Leaf
vivo Internet Technology
vivo Internet Technology
Mar 3, 2021 · Backend Development

Design and Implementation of a Hundred-Million-Level User Center System

The article outlines a microservice‑based user‑center architecture for handling over one hundred million users, detailing gateway, core, and async services, interface design, vertical and horizontal database sharding, token graceful degradation, multi‑layer password security, event‑driven async processing, and comprehensive monitoring for high availability, performance, and security.

Backend ArchitectureMicroservicesSystem Design
0 likes · 14 min read
Design and Implementation of a Hundred-Million-Level User Center System
MaGe Linux Operations
MaGe Linux Operations
Mar 1, 2021 · Backend Development

Mastering High Availability: From Cold Backup to Multi‑Active Architecture

This article examines high‑availability strategies for stateful backend services, covering cold backup, dual‑machine hot standby, same‑city active‑active, and remote multi‑active solutions, while discussing their benefits, trade‑offs, and architectural patterns for resilient distributed systems.

Backend Architectureactive standbycold backup
0 likes · 14 min read
Mastering High Availability: From Cold Backup to Multi‑Active Architecture
Efficient Ops
Efficient Ops
Feb 23, 2021 · Operations

Mastering High Availability: From Cold Backups to Multi‑Region Active‑Active

This article examines backend service high‑availability strategies, comparing cold backups, hot standby, same‑city and cross‑city active‑active designs, and explains the trade‑offs, architectural patterns, and practical considerations for building resilient distributed systems.

active standbycold backupcross‑region active‑active
0 likes · 16 min read
Mastering High Availability: From Cold Backups to Multi‑Region Active‑Active
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Feb 21, 2021 · Databases

How Ant Financial Scales to 540k TPS: Inside LDC Architecture, Unitization, and CAP Analysis

This article explains how Ant Financial’s payment system grew from 20,000 transactions per minute in 2010 to 540,000 TPS in 2019 by adopting logical data centers (LDC), unitized architecture (RZone, GZone, CZone), OceanBase’s Paxos‑based consensus, and sophisticated traffic steering and disaster‑recovery strategies.

CAP theoremDistributed SystemsOceanBase
0 likes · 41 min read
How Ant Financial Scales to 540k TPS: Inside LDC Architecture, Unitization, and CAP Analysis
Architecture Digest
Architecture Digest
Feb 17, 2021 · Operations

Step‑by‑Step Installation and Configuration of a 2‑Master‑2‑Slave Asynchronous RocketMQ Cluster

This article provides a comprehensive guide to installing and configuring Apache RocketMQ, covering its history, core features, high‑availability architectures, detailed port planning, downloading, editing broker configuration files, creating data directories, starting NameServer and brokers, troubleshooting common issues, and deploying the web console for monitoring.

InstallationMessage QueueRocketMQ
0 likes · 18 min read
Step‑by‑Step Installation and Configuration of a 2‑Master‑2‑Slave Asynchronous RocketMQ Cluster
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Feb 2, 2021 · Big Data

How Kuaishou Scaled Data Services with a Config‑Driven Big Data Platform

This article explains how Kuaishou’s data‑service platform tackles high development barriers and repetitive work by introducing a self‑service, configuration‑driven architecture, multi‑mode APIs, efficient data acceleration, and robust high‑availability mechanisms, while outlining its evolution and future roadmap.

Data AccelerationData PlatformService Architecture
0 likes · 14 min read
How Kuaishou Scaled Data Services with a Config‑Driven Big Data Platform
Architecture Digest
Architecture Digest
Feb 1, 2021 · Databases

Comparing NewSQL Distributed Databases with Middleware‑Based Sharding: Advantages, Trade‑offs, and Application Scenarios

This article objectively compares NewSQL distributed databases with traditional middleware‑plus‑sharding solutions, examining their architectural differences, distributed transaction handling, high‑availability mechanisms, scaling and sharding strategies, SQL support, storage engines, and maturity to help practitioners decide which approach best fits their workload and operational constraints.

Database ArchitectureNewSQLScalability
0 likes · 17 min read
Comparing NewSQL Distributed Databases with Middleware‑Based Sharding: Advantages, Trade‑offs, and Application Scenarios
Architecture Digest
Architecture Digest
Jan 28, 2021 · Databases

Practical Guide to Setting Up and Scaling a Redis Cluster (Redis 6.0+)

This article provides a step‑by‑step tutorial on building a Redis Cluster on a single server, covering node configuration, cluster handshaking, slot assignment, master‑slave replication, command routing, failover handling, and practical scaling operations such as adding and removing nodes using redis‑cli.

ClusterRedis CLIdatabase
0 likes · 22 min read
Practical Guide to Setting Up and Scaling a Redis Cluster (Redis 6.0+)
Top Architect
Top Architect
Jan 24, 2021 · Backend Development

Why Use Message Queues? Benefits, Drawbacks, and Design Guidelines

This article explains why message queues are essential for decoupling systems, enabling asynchronous processing and peak‑shaving, outlines their advantages and disadvantages, compares popular MQ products, and provides practical guidance on high availability, reliability, ordering, and architectural design.

DecouplingMessage Queuearchitecture
0 likes · 12 min read
Why Use Message Queues? Benefits, Drawbacks, and Design Guidelines