Tagged articles
1414 articles
Page 10 of 15
Programmer DD
Programmer DD
Aug 7, 2020 · Databases

Database Architecture Guide: Principles, Patterns, and Consistency Solutions

This article outlines core database architecture principles—high availability, performance, scalability, and consistency—examines four common deployment patterns (primary‑standby, dual‑primary, primary‑replica with read/write separation, and hybrid), and presents practical consistency solutions and personal insights for selecting and evolving database designs.

Database ArchitectureRead-Write SeparationScalability
0 likes · 11 min read
Database Architecture Guide: Principles, Patterns, and Consistency Solutions
Xiao Lou's Tech Notes
Xiao Lou's Tech Notes
Aug 2, 2020 · Backend Development

Service Registry Guide: Concepts, Features, and Choosing Zookeeper vs Nacos

This article explains what a service registry is, outlines its essential capabilities such as high availability, horizontal scaling, health checking, routing, and multi‑datacenter support, and compares popular open‑source solutions like Zookeeper and Nacos to help you select the right one for your stack.

Nacoshigh availabilityhorizontal scaling
0 likes · 11 min read
Service Registry Guide: Concepts, Features, and Choosing Zookeeper vs Nacos
Top Architect
Top Architect
Aug 1, 2020 · Cloud Computing

Design Analysis of Netflix’s Cloud‑Based Microservices Architecture

This article examines Netflix’s cloud‑based microservices architecture, detailing its client, backend, CDN components, design goals such as high availability, low latency, scalability, and the trade‑offs, resilience mechanisms, and scalability strategies employed on AWS to support millions of global streaming users.

AWSMicroservicesNetflix
0 likes · 22 min read
Design Analysis of Netflix’s Cloud‑Based Microservices Architecture
Architecture Digest
Architecture Digest
Jul 30, 2020 · Cloud Computing

Design Analysis of Netflix’s Cloud‑Based Microservices Architecture

The article examines Netflix’s migration to AWS and its micro‑service‑based cloud architecture, detailing client, backend, and CDN components, the design goals of high availability, low latency, scalability, and resilience, and how these goals are achieved through services such as EC2, S3, DynamoDB, Cassandra, Zuul, Hystrix, and Open Connect.

AWSNetflixcloud computing
0 likes · 25 min read
Design Analysis of Netflix’s Cloud‑Based Microservices Architecture
Programmer DD
Programmer DD
Jul 30, 2020 · Databases

Understanding Redis Cluster: Sharding, High Availability, and Hash Slot Mechanics

This article explains Redis Cluster’s automatic sharding, built‑in high availability, the role of the cluster bus ports, the limitations of traditional hash algorithms, and how consistent hashing with virtual nodes and the 16384‑slot hash slot mechanism enable efficient data distribution and low‑cost node scaling.

ClusterHash Slotconsistent hashing
0 likes · 3 min read
Understanding Redis Cluster: Sharding, High Availability, and Hash Slot Mechanics
21CTO
21CTO
Jul 29, 2020 · Backend Development

How JD.com Scaled Its Order Search with Elasticsearch: From Chaos to Real‑Time Dual Clusters

This article details how JD.com’s order center migrated from a MySQL‑only design to a high‑performance Elasticsearch cluster, evolving through isolation, replica tuning, master‑slave adjustments, and real‑time dual‑cluster architecture to achieve billions of documents, hundreds of millions of daily queries, and robust fault tolerance.

Backend EngineeringElasticsearchSearch Architecture
0 likes · 12 min read
How JD.com Scaled Its Order Search with Elasticsearch: From Chaos to Real‑Time Dual Clusters
IT Architects Alliance
IT Architects Alliance
Jul 25, 2020 · Fundamentals

What Makes Distributed File Systems Tick? Design Principles and Trade‑offs

This article examines the core concepts, architectural models, scalability, persistence, high availability, performance optimization, and security considerations of distributed file systems, comparing centralized and decentralized designs such as GFS and Ceph to guide future system design decisions.

ConsistencyDistributed File SystemScalability
0 likes · 21 min read
What Makes Distributed File Systems Tick? Design Principles and Trade‑offs
Selected Java Interview Questions
Selected Java Interview Questions
Jul 10, 2020 · Backend Development

Message Queue Interview Questions and Technical Guide

This article provides a comprehensive overview of message queue concepts, covering usage scenarios, advantages, drawbacks, technology selection, high‑availability architectures, duplicate handling, data loss prevention, ordering guarantees, latency management, and design principles, supplemented with interview‑style questions and code examples.

KafkaMQMessage Queue
0 likes · 20 min read
Message Queue Interview Questions and Technical Guide
Efficient Ops
Efficient Ops
Jul 7, 2020 · Information Security

Securing Operations with JD Cloud’s Bastion Host: Design, Challenges & Deployment

This article reviews JD Cloud’s bastion host solution presented at the 2020 Global New‑Generation Software Engineering Summit, detailing the security challenges of modern operations, compliance requirements, design principles, cloud‑native architecture, and real‑world deployments in enterprises and finance.

Bastion HostCloud NativeIdentity Management
0 likes · 15 min read
Securing Operations with JD Cloud’s Bastion Host: Design, Challenges & Deployment
JD Retail Technology
JD Retail Technology
Jul 7, 2020 · Operations

JD.com 618 Shopping Festival Technical Preparation: Resource Optimization, High Availability, and System Reliability

The 2020 JD.com 618 shopping festival showcased a record-breaking sales volume and demonstrated how meticulous resource fine‑tuning, container scaling, high‑fidelity load testing, and cross‑team coordination enabled stable, cost‑effective system performance without adding new compute servers.

618JD.comResource Optimization
0 likes · 7 min read
JD.com 618 Shopping Festival Technical Preparation: Resource Optimization, High Availability, and System Reliability
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 6, 2020 · Databases

Mastering MySQL HA: Step‑by‑Step MHA Setup and Failover Guide

This article walks through preparing a four‑node MySQL environment, configuring password‑less SSH, setting up master‑slave replication with GTID, installing and configuring MHA, testing connectivity, running the manager, verifying automatic failover, and highlighting the key advantages of using MHA for high availability.

Database operationsGTIDMHA
0 likes · 24 min read
Mastering MySQL HA: Step‑by‑Step MHA Setup and Failover Guide
MaGe Linux Operations
MaGe Linux Operations
Jul 4, 2020 · Databases

How to Build a High‑Availability MongoDB Sharded Replica Set Cluster on Ubuntu

This step‑by‑step tutorial shows how to set up a highly available MongoDB cluster with sharding and replica sets on Ubuntu 18.04, covering key concepts, environment preparation, configuration of config servers, shard replica sets, the mongos router, enabling sharding, testing data distribution, and final deployment tips.

Database ClusterMongoDBReplica Set
0 likes · 19 min read
How to Build a High‑Availability MongoDB Sharded Replica Set Cluster on Ubuntu
iQIYI Technical Product Team
iQIYI Technical Product Team
Jul 3, 2020 · Backend Development

Restructuring of Voting Service for 'You Are My Youth 2' to Enhance Scalability and Maintainability

The voting service for 'You Are My Youth 2' was re‑architected using Docker‑based QAE and the Skywalker microservices platform, adding containerized one‑click scaling, cross‑data‑center MySQL/Couchbase/HBase high availability, and Hive/Impala real‑time processing, which doubled performance, cut preparation from 30 days to 12 hours, and incorporated third‑party audit verification.

MicroservicesScalabilityVoting Service
0 likes · 12 min read
Restructuring of Voting Service for 'You Are My Youth 2' to Enhance Scalability and Maintainability
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Jul 3, 2020 · Databases

PostgreSQL Q&A: Use Cases, Oracle Comparison, Features, Migration and Best Practices

This article compiles a series of questions and answers from a recent PostgreSQL live session, covering recommended scenarios, differences with Oracle, backup and recovery, clustering, PostGIS, stored procedures, migration tips, version recommendations, performance tuning, high‑availability, and real‑world adoption across industries.

Oracle migrationdatabasehigh availability
0 likes · 10 min read
PostgreSQL Q&A: Use Cases, Oracle Comparison, Features, Migration and Best Practices
Meituan Technology Team
Meituan Technology Team
Jul 2, 2020 · Databases

Meituan's Million‑Level KV Storage Architecture and Practices: Squirrel (In‑Memory) and Cellar (Persistent)

Meituan’s KV platform evolved from Memcached and Redis to its own high‑throughput in‑memory Squirrel and durable Cellar systems, each partitioning keys into 16 384 slots, using ZooKeeper routing, Kubernetes scaling, intelligent migration, cross‑region Raft replication, multi‑queue latency isolation, and hotspot management, while planning further optimizations such as Redis‑Gossip tuning, kernel‑bypass I/O, and FPGA acceleration.

CellarKV storageMeituan
0 likes · 26 min read
Meituan's Million‑Level KV Storage Architecture and Practices: Squirrel (In‑Memory) and Cellar (Persistent)
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Jun 18, 2020 · Big Data

Kafka Interview Questions: High Availability, Reliability, Consistency, Performance, and Usage Rationale

This article explains common Kafka interview questions by analyzing the system's high‑availability design, reliability mechanisms, consistency model, performance tricks such as sequential writes and zero‑copy, and the reasons for using Kafka and message queues, providing both conceptual insight and practical details.

ConsistencyDistributed SystemsKafka
0 likes · 12 min read
Kafka Interview Questions: High Availability, Reliability, Consistency, Performance, and Usage Rationale
ITPUB
ITPUB
Jun 17, 2020 · Databases

Why Go All‑in on MongoDB? Architecture, HA, Sharding & Schema Design Explained

This article explains why a fast‑growing e‑commerce platform chose MongoDB, covering its high‑availability replica‑set architecture, Raft‑based election algorithm, replica‑set size limits, write‑concern trade‑offs, sharding components and load‑balancing, as well as the flexible document schema with practical code examples.

Database ArchitectureMongoDBhigh availability
0 likes · 13 min read
Why Go All‑in on MongoDB? Architecture, HA, Sharding & Schema Design Explained
Architecture Digest
Architecture Digest
Jun 12, 2020 · Databases

How to Build a Redis Cluster with Docker: Step‑by‑Step Guide

This article provides a comprehensive, step‑by‑step tutorial on creating a Redis Cluster using Docker, covering the preparation of a base Redis image, building node images, configuring cluster settings, assigning hash slots, adding replica nodes for high availability, testing failover, and solving network connectivity issues for production deployments.

ClusterDockerReplication
0 likes · 26 min read
How to Build a Redis Cluster with Docker: Step‑by‑Step Guide
Top Architect
Top Architect
May 29, 2020 · Databases

Redis Scaling Strategies: Partitioning, Master‑Slave Replication, Sentinel, and Cluster

This article explains how to extend Redis beyond single‑node limits by using partitioning, master‑slave replication, Sentinel for automatic failover, and Redis Cluster with hash slots, detailing their usage, advantages, drawbacks, and configuration examples for building high‑availability and scalable in‑memory data stores.

ClusterPartitioningReplication
0 likes · 11 min read
Redis Scaling Strategies: Partitioning, Master‑Slave Replication, Sentinel, and Cluster
Aikesheng Open Source Community
Aikesheng Open Source Community
May 19, 2020 · Databases

DBLE 2.20.04.0 Release Notes and Detailed Analysis

The article presents a comprehensive overview of DBLE version 2.20.04.0, including project background, new features such as larger MySQL packet support and redesigned failover, a full list of enhancements, bug fixes, backward‑compatibility changes, and links to official documentation and source repositories.

DBLEDatabase MiddlewareFeature Updates
0 likes · 11 min read
DBLE 2.20.04.0 Release Notes and Detailed Analysis
JavaEdge
JavaEdge
May 17, 2020 · Databases

Mastering MySQL High Availability: Backup, Binlog, and Replication Strategies

This guide explains how to achieve MySQL high availability and scalability through full and incremental backups, binlog-based point‑in‑time recovery, and both asynchronous and synchronous replication, while highlighting trade‑offs, best practices, and practical command examples.

BackupBinlogReplication
0 likes · 11 min read
Mastering MySQL High Availability: Backup, Binlog, and Replication Strategies
Xiaokun's Architecture Exploration Notes
Xiaokun's Architecture Exploration Notes
May 15, 2020 · Backend Development

Mastering Distributed System Design: Key Principles, Techniques, and Best Practices

This comprehensive guide explains why distributed systems are needed, outlines design goals, explores essential technologies and architectural patterns, and provides practical strategies for scalability, high availability, service governance, DevOps automation, and monitoring to help engineers build robust distributed architectures.

Distributed SystemsScalabilityhigh availability
0 likes · 22 min read
Mastering Distributed System Design: Key Principles, Techniques, and Best Practices
ITPUB
ITPUB
May 11, 2020 · Operations

Scaling JD.com Order Search: Real‑Time Dual‑Cluster Elasticsearch Architecture

This article details how JD.com’s order center evolved its Elasticsearch deployment from a single, default‑configured cluster to a real‑time, dual‑cluster architecture with replica tuning, master‑slave failover, version upgrades, and optimized data synchronization to handle billions of documents and hundreds of millions of daily queries.

Cluster ArchitectureElasticsearchdata synchronization
0 likes · 13 min read
Scaling JD.com Order Search: Real‑Time Dual‑Cluster Elasticsearch Architecture
Tencent Cloud Developer
Tencent Cloud Developer
May 7, 2020 · Databases

TDSQL Horizontal Scaling: Design Principles and Practice for Non‑Disruptive Linear Expansion

TDSQL enables Tencent Meeting’s non‑disruptive, linear horizontal scaling by using a stateless SQL engine, shard‑key selection, two‑phase distributed transactions, and one‑click set expansion, delivering near‑linear performance growth, high availability, and seamless capacity increase for millions of daily users worldwide.

TDSQLcloud video conferencedistributed database
0 likes · 17 min read
TDSQL Horizontal Scaling: Design Principles and Practice for Non‑Disruptive Linear Expansion
ITPUB
ITPUB
Apr 30, 2020 · Databases

10 Essential Redis Tips to Boost Performance and Reliability

This guide presents ten practical Redis best‑practice tips—including avoiding KEYS *, using SCAN, analyzing command stats, leveraging hashes, setting expirations, choosing eviction policies, handling errors, scaling with clusters, understanding threading limits, and configuring Sentinel for high availability—to help you run Redis efficiently and safely.

best practicesdatabaseshigh availability
0 likes · 9 min read
10 Essential Redis Tips to Boost Performance and Reliability
Tencent Cloud Developer
Tencent Cloud Developer
Apr 29, 2020 · Cloud Computing

Large-Scale Task Scheduling Architecture of Tencent Meeting and VStation

The talk explains how Tencent’s self‑developed VStation scheduler, integrated with TKE and using a hybrid sharding‑plus‑master‑worker architecture, enabled Tencent Meeting to scale to over 100 000 hosts and one million CPU cores, cutting provisioning time to under ten seconds while handling thousands of tasks per minute through DAG‑driven automation and fault‑tolerant mechanisms.

Distributed SystemsTencent MeetingVStation
0 likes · 24 min read
Large-Scale Task Scheduling Architecture of Tencent Meeting and VStation
DataFunTalk
DataFunTalk
Apr 29, 2020 · Databases

High Availability Architecture for SQL Server, MySQL, and Redis at Ctrip

This article explains Ctrip's high‑availability designs for SQL Server, MySQL, and Redis, describing multi‑replica strategies, evolution from legacy mirroring to AlwaysOn, multi‑node MHA management, and Redis sentinel‑based failover, while emphasizing fault‑tolerance across data centers.

Database ArchitectureSQL Serverhigh availability
0 likes · 12 min read
High Availability Architecture for SQL Server, MySQL, and Redis at Ctrip
UCloud Tech
UCloud Tech
Apr 28, 2020 · Cloud Native

How We Built a Highly Available Kubernetes Platform for Multi‑Cluster Deployments

This article explains why Kubernetes was chosen, describes the overall architecture, high‑availability master design, multi‑IDC cluster deployment, logging, monitoring, service exposure, image building, lifecycle hooks, CI/CD, multi‑cluster management, encountered challenges, and future plans for operators and automated scaling.

KubernetesMulti-Clusterci/cd
0 likes · 11 min read
How We Built a Highly Available Kubernetes Platform for Multi‑Cluster Deployments
360 Tech Engineering
360 Tech Engineering
Apr 21, 2020 · Backend Development

Using ETCD for Leader Election and High Availability: Architecture, Installation, and Go Implementation

This article explains ETCD's role as a distributed key‑value store, details its architecture and leader election mechanism, provides step‑by‑step cluster deployment on CentOS with systemd, and demonstrates a Go implementation of leader election to achieve high availability.

Distributed SystemsSystemdhigh availability
0 likes · 10 min read
Using ETCD for Leader Election and High Availability: Architecture, Installation, and Go Implementation
Architecture Digest
Architecture Digest
Apr 19, 2020 · Databases

Sharding (Partitioning) vs NewSQL Databases: A Comparative Analysis

This article objectively compares traditional sharding‑plus‑middleware architectures with modern NewSQL distributed databases, examining their design principles, distributed transaction handling, high‑availability mechanisms, scaling and sharding strategies, SQL support, storage engines, and overall maturity to help practitioners choose the most suitable solution for their workloads.

Database ArchitectureNewSQLhigh availability
0 likes · 21 min read
Sharding (Partitioning) vs NewSQL Databases: A Comparative Analysis
iQIYI Technical Product Team
iQIYI Technical Product Team
Apr 17, 2020 · Mobile Development

Building iQIYI's Mobile Middle Platform: Architecture, Decoupling, and SaaS Enablement

iQIYI’s Mobile Middle Platform decouples its multiple apps into a reusable, SaaS‑enabled architecture that centralizes services through the QMAS portal, provides ready‑made scaffolding and cross‑platform frameworks, and ensures high‑availability via comprehensive monitoring and a custom network foundation, dramatically accelerating development and unifying user experience.

Component DecouplingMobile DevelopmentSDK management
0 likes · 13 min read
Building iQIYI's Mobile Middle Platform: Architecture, Decoupling, and SaaS Enablement
Architects' Tech Alliance
Architects' Tech Alliance
Apr 16, 2020 · Databases

Database Application Architecture Principles and Common Schemes

This article outlines core database architecture principles—high availability, high performance, consistency, and scalability—examines four common deployment schemes (primary‑standby, dual‑primary, primary‑replica with read/write separation, and a combined dual‑primary/replica model), and presents multiple consistency and cache‑integration solutions.

Database ArchitectureRead-Write Separationcaching
0 likes · 12 min read
Database Application Architecture Principles and Common Schemes
dbaplus Community
dbaplus Community
Apr 15, 2020 · Operations

How to Diagnose and Fix a Dual‑Leader ZooKeeper Cluster

This article walks through a real‑world ZooKeeper incident where a five‑node cluster showed two leaders, explains the election rules, analyzes log and configuration mismatches, assesses business impact, and provides a step‑by‑step recovery plan to restore normal service without data loss.

ClusterOperationsZooKeeper
0 likes · 10 min read
How to Diagnose and Fix a Dual‑Leader ZooKeeper Cluster
Architects' Tech Alliance
Architects' Tech Alliance
Apr 14, 2020 · Operations

How Server NIC Teaming Eliminates Single Points of Failure in High‑Availability Data Centers

This article explains the principles, configurations, and real‑world testing of server NIC teaming (Team) technology—covering Intel and Broadcom adapters, switch‑independent and switch‑dependent modes, load‑balancing and fault‑tolerance mechanisms, and step‑by‑step setup procedures to ensure continuous network access for mission‑critical securities systems.

BroadcomIT OperationsIntel
0 likes · 28 min read
How Server NIC Teaming Eliminates Single Points of Failure in High‑Availability Data Centers
Top Architect
Top Architect
Apr 14, 2020 · Databases

Designing a High‑Availability Redis Service with Sentinel

This article explains how to build a highly available Redis service by analyzing failure scenarios, comparing single‑instance, master‑slave with one or multiple Sentinel processes, and finally presenting a three‑Sentinel architecture that ensures continuous service despite node or network outages.

architecturefailoverhigh availability
0 likes · 11 min read
Designing a High‑Availability Redis Service with Sentinel
Tencent Cloud Developer
Tencent Cloud Developer
Apr 10, 2020 · Backend Development

High‑Availability Architecture for a Flash‑Sale System in the Weishi Spring Festival Card‑Collect Event

The article details a high‑availability flash‑sale architecture for Weishi’s Spring Festival card‑collect event, describing three design models, a funnel‑style traffic‑filtering approach, product and client strategies, layered rate limiting, sharding, asynchronous order handling, multi‑region DB redundancy, and a three‑level degradation plan to sustain extreme concurrency.

Distributed Systemsflash salehigh availability
0 likes · 9 min read
High‑Availability Architecture for a Flash‑Sale System in the Weishi Spring Festival Card‑Collect Event
21CTO
21CTO
Apr 6, 2020 · Operations

How Alipay Achieved Near‑Zero Downtime with Multi‑Datacenter Failover Architecture

This article explains the evolution of Alipay's high‑availability and disaster‑recovery architecture—from a simple single‑datacenter design to a multi‑datacenter, unit‑based system with failover and blue‑green deployment—highlighting the challenges, solutions, and operational benefits that enable continuous service during massive traffic spikes.

Alipay architectureBlue‑Green deploymentDistributed Systems
0 likes · 17 min read
How Alipay Achieved Near‑Zero Downtime with Multi‑Datacenter Failover Architecture
Java Backend Technology
Java Backend Technology
Apr 4, 2020 · Databases

10 Essential Redis Tips for Faster, Safer Data Management

This article presents ten practical Redis best‑practice tips—including avoiding KEYS *, using INFO commandstats, leveraging hashes, setting key expirations, selecting proper eviction policies, handling errors, scaling with clusters, and ensuring high availability—to help developers optimize performance and reliability of their in‑memory data stores.

best practicescachinghigh availability
0 likes · 9 min read
10 Essential Redis Tips for Faster, Safer Data Management
Cloud Native Technology Community
Cloud Native Technology Community
Mar 30, 2020 · Cloud Native

Building a Cloud‑Native Large‑Scale Distributed Monitoring System with Prometheus

This article explains how to design and implement a cloud‑native, large‑scale distributed monitoring system using Prometheus, covering its limitations, service‑level sharding, centralized storage, federation, and high‑availability strategies to overcome scaling challenges in Kubernetes environments.

Cloud NativeFederationPrometheus
0 likes · 12 min read
Building a Cloud‑Native Large‑Scale Distributed Monitoring System with Prometheus
FunTester
FunTester
Mar 30, 2020 · Operations

How Virtualization Transforms Software Testing: Benefits, Types, and Common Pitfalls

The article explains what virtualization is, outlines its main types, and shows how it enables efficient software testing by consolidating servers, improving disaster recovery, saving time, increasing availability, reducing complexity, and protecting data, while also noting potential driver, memory, and performance issues.

Software TestingVirtualizationdisaster recovery
0 likes · 7 min read
How Virtualization Transforms Software Testing: Benefits, Types, and Common Pitfalls
dbaplus Community
dbaplus Community
Mar 29, 2020 · Databases

Mastering GaussDB T: Planning Single‑Node, HA, and Distributed Architectures

This article outlines a comprehensive plan for deploying GaussDB T, covering the software architecture, component roles, and practical guidelines for single‑node, primary‑standby, and distributed setups, while highlighting common pitfalls and operational considerations for production environments.

Database ArchitectureGaussDBPlanning
0 likes · 13 min read
Mastering GaussDB T: Planning Single‑Node, HA, and Distributed Architectures
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 27, 2020 · Databases

How OceanBase Delivers Cloud‑Native Distributed Relational Database Performance and Availability

This article explains OceanBase's public‑cloud deployment, its unique architecture without a central controller, horizontal scaling via partition groups, LSM‑Tree storage design, advanced SQL engine features, ACID‑plus‑Availability guarantees, and real‑world performance records, illustrating why it suits high‑availability financial workloads.

Cloud NativeLSM‑TreeOceanBase
0 likes · 12 min read
How OceanBase Delivers Cloud‑Native Distributed Relational Database Performance and Availability
Liangxu Linux
Liangxu Linux
Mar 23, 2020 · Databases

10 Essential Redis Best Practices Every Developer Should Follow

This article presents ten practical Redis best‑practice guidelines—from avoiding the costly KEYS * command and leveraging SCAN, to using INFO for command statistics, employing hashes, setting expirations, selecting eviction policies, handling errors, scaling with clusters, understanding threading limits, and configuring Sentinel for high availability.

best practicescachingdatabase
0 likes · 9 min read
10 Essential Redis Best Practices Every Developer Should Follow
Top Architect
Top Architect
Mar 10, 2020 · Backend Development

Overview of Alipay System Architecture and the Open‑Source Distributed Messaging Middleware Metamorphosis (MetaQ)

This article presents an overview of Alipay’s comprehensive system architecture and introduces MetaQ (Metamorphosis), an open‑source Java‑based distributed messaging middleware, detailing its features, advantages over Kafka, and suitable high‑throughput, reliable messaging scenarios.

AlipayBackendDistributed Messaging
0 likes · 5 min read
Overview of Alipay System Architecture and the Open‑Source Distributed Messaging Middleware Metamorphosis (MetaQ)
MaGe Linux Operations
MaGe Linux Operations
Feb 27, 2020 · Cloud Native

Mastering Harbor: Install, Configure, and Scale a Secure Docker Registry

This guide walks you through Harbor's purpose, core features, architecture components, step‑by‑step installation, testing procedures, and high‑availability setup with Nginx, providing a comprehensive tutorial for building a secure, enterprise‑grade Docker image registry.

Cloud NativeContainer SecurityDocker Registry
0 likes · 11 min read
Mastering Harbor: Install, Configure, and Scale a Secure Docker Registry
Big Data Technology Architecture
Big Data Technology Architecture
Feb 24, 2020 · Operations

Evolution and Optimization of JD.com Order Center Elasticsearch Cluster Architecture

This article details how JD.com’s order center migrated its Elasticsearch cluster through multiple architectural stages—initial deployment, isolation, replica tuning, master‑slave adjustments, and real‑time dual‑cluster backup—while addressing data synchronization, scaling, and performance pitfalls to achieve high availability and query stability.

Cluster ArchitectureElasticsearchJD.com
0 likes · 13 min read
Evolution and Optimization of JD.com Order Center Elasticsearch Cluster Architecture
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 18, 2020 · Cloud Native

Why Do Your Apps Crash? Alibaba’s High‑Availability Architecture Playbook

This article explains why online applications experience crashes during traffic spikes, outlines the complexity of modern cloud‑based service architectures, and shares Alibaba engineers’ practical notes on high‑availability design, capacity planning, full‑link stress testing, monitoring, traffic control, routine inspections, and chaos‑engineering drills using tools such as AHAS, PTS, Sentinel and Advisor.

Alibaba Cloudcapacity planningchaos engineering
0 likes · 12 min read
Why Do Your Apps Crash? Alibaba’s High‑Availability Architecture Playbook
Tencent Cloud Developer
Tencent Cloud Developer
Feb 13, 2020 · Cloud Computing

Cloud Architecture Design Principles: A Free Live Course by Tencent Cloud University

Tencent Cloud University, together with the TVP team, offers a free live course taught by certified instructor Zhao Baomin that explains why cloud architecture design matters and introduces its core principles—high availability, scalability, security, and low cost—while providing a weekly schedule, interactive gifts for participants, a discussion group, and related reading.

TVPTencent Cloudarchitecture design principles
0 likes · 3 min read
Cloud Architecture Design Principles: A Free Live Course by Tencent Cloud University
Efficient Ops
Efficient Ops
Feb 10, 2020 · Big Data

How Tencent Scales Elasticsearch for Massive Log, Search, and Time‑Series Workloads

Tencent leverages Elasticsearch at massive scale across log analytics, search services, and time‑series monitoring, addressing challenges of high availability, low cost, and high performance through kernel optimizations, resource‑aware throttling, cold‑data merging, rollup, caching, and open‑source contributions.

Cost OptimizationElasticsearchLog Analytics
0 likes · 20 min read
How Tencent Scales Elasticsearch for Massive Log, Search, and Time‑Series Workloads
dbaplus Community
dbaplus Community
Feb 5, 2020 · Databases

Tencent’s Secrets to Scaling Elasticsearch for Trillion‑Level Data

Tencent shares how it leverages Elasticsearch at trillion‑scale across logging, search, and time‑series workloads, detailing the challenges of high availability, low cost, and high performance, and describing concrete kernel‑level optimizations, resource‑limiting strategies, storage tiering, rollup, cache, and merge techniques that enable robust, efficient operation.

Cost OptimizationElasticsearchLarge‑Scale Search
0 likes · 18 min read
Tencent’s Secrets to Scaling Elasticsearch for Trillion‑Level Data
Efficient Ops
Efficient Ops
Feb 4, 2020 · Databases

How Multi‑Active Database Architecture Is Redefining Bank Disaster Recovery

In this interview, a senior database expert from Huaxia Bank shares twelve years of experience and explains how moving from traditional replication to multi‑active, real‑time consistent data centers, combined with automation and mobile remote operations, is transforming banking database reliability and security.

Database operationsdisaster recoveryhigh availability
0 likes · 9 min read
How Multi‑Active Database Architecture Is Redefining Bank Disaster Recovery
dbaplus Community
dbaplus Community
Feb 2, 2020 · Databases

JDHBase Multi‑Active Disaster Recovery: Replication, Auto‑Failover & Consistency

JDHBase, JD.com’s large‑scale KV store, powers billions of daily reads and writes across 7,000 nodes, and this article details its multi‑active, cross‑region architecture—including HBase replication fundamentals, Fox Manager routing, automatic failover policies, dynamic replication tuning, and serial replication to ensure strong consistency.

Database ArchitectureHBaseReplication
0 likes · 15 min read
JDHBase Multi‑Active Disaster Recovery: Replication, Auto‑Failover & Consistency
Big Data Technology & Architecture
Big Data Technology & Architecture
Jan 19, 2020 · Big Data

Tencent's Elasticsearch Practices: Application Scenarios, Challenges, Optimizations, and Future Directions

This article details how Tencent leverages Elasticsearch for log analysis, search services, and time‑series data, outlines the specific challenges faced in high‑availability and cost‑efficiency, and presents the comprehensive optimization techniques and future open‑source contributions that improve performance, scalability, and reliability.

Big DataCost OptimizationElasticsearch
0 likes · 16 min read
Tencent's Elasticsearch Practices: Application Scenarios, Challenges, Optimizations, and Future Directions
Big Data Technology & Architecture
Big Data Technology & Architecture
Jan 16, 2020 · Big Data

Kafka Interview Guide: Core Concepts, Architecture, and Practical Tips

This article compiles essential Kafka interview material, covering its role as a message queue, usage scenarios, architectural components, storage mechanisms, consumer group rebalancing, high‑availability features, replication details, ordering guarantees, producer/consumer client design, topic management, log retention, performance optimizations, and key monitoring metrics.

Big DataDistributed SystemsKafka
0 likes · 16 min read
Kafka Interview Guide: Core Concepts, Architecture, and Practical Tips
Sohu Tech Products
Sohu Tech Products
Jan 8, 2020 · Databases

Understanding Distributed Database Scenarios and Data Partitioning

This article explains the primary use cases for distributed databases, contrasts them with traditional databases, and describes how data partitioning and metadata enable clients to locate data without scanning all nodes, highlighting both external user benefits and internal implementation challenges.

Data PartitioningScalabilitydistributed databases
0 likes · 3 min read
Understanding Distributed Database Scenarios and Data Partitioning
Architects Research Society
Architects Research Society
Jan 8, 2020 · Cloud Native

Running a Kubernetes Cluster Across Multiple Zones

This article explains how Kubernetes 1.2 enables a single cluster to operate across multiple zones within the same cloud provider, detailing automatic zone labeling, pod and volume scheduling behavior, and the key limitations and considerations for multi‑zone deployments.

ClusterKubernetesPersistent Volume
0 likes · 5 min read
Running a Kubernetes Cluster Across Multiple Zones
JD Retail Technology
JD Retail Technology
Jan 6, 2020 · Backend Development

JDHBase Multi‑Active Architecture and Replication Practices

This article describes JDHBase’s large‑scale KV storage deployment, its HBase‑based asynchronous replication mechanism, the multi‑active architecture with active‑standby clusters, client interaction via Fox Manager, automatic failover strategies, dynamic replication tuning, and serial replication techniques to ensure data consistency across data centers.

Cluster ManagementHBaseReplication
0 likes · 13 min read
JDHBase Multi‑Active Architecture and Replication Practices
Ops Development Stories
Ops Development Stories
Dec 27, 2019 · Databases

Step-by-Step Guide to Building a Secure OpenLDAP Server with High Availability

This comprehensive tutorial walks you through preparing the environment, installing and configuring OpenLDAP on CentOS, securing it with TLS/SSL, setting up phpldapadmin, defining ACLs, enabling master‑slave replication, and implementing Keepalived for VIP‑based high availability, all with ready‑to‑use scripts and configuration files.

LDAP setupLinuxOpenLDAP
0 likes · 68 min read
Step-by-Step Guide to Building a Secure OpenLDAP Server with High Availability
Qunar Tech Salon
Qunar Tech Salon
Dec 24, 2019 · Operations

Cache Governance and High‑Availability Practices for Redis in a Hotel Quoting System

The article describes how a hotel quoting team identified recurring Redis‑related outages, designed a comprehensive cache governance plan—including fast recovery, multi‑replica, manual downgrade, and parameter tuning—and executed a structured process that improved system reliability and operational readiness.

Cache Governancehigh availabilityredis
0 likes · 8 min read
Cache Governance and High‑Availability Practices for Redis in a Hotel Quoting System
Cloud Native Technology Community
Cloud Native Technology Community
Dec 23, 2019 · Databases

MongoDB Architect’s Talk on Cloud‑Native Data Challenges and Solutions at CNBPS 2019

In his CNBPS 2019 keynote, MongoDB architect Song Zhiqi discusses how the shift to cloud‑native architectures has amplified data volume and variety, outlines the key challenges such as rapid rollout, downtime, and unstructured data, and explains how MongoDB’s flexible, high‑availability document model addresses these issues while supporting modern micro‑service and Kubernetes ecosystems.

Cloud NativeMongoDBNoSQL
0 likes · 8 min read
MongoDB Architect’s Talk on Cloud‑Native Data Challenges and Solutions at CNBPS 2019
Youku Technology
Youku Technology
Dec 18, 2019 · Operations

Technical Strategies and Achievements of Alibaba Entertainment 2019 Double‑11 Cat Night Live Streaming

Alibaba Entertainment’s 2019 Double‑11 Cat Night live stream combined FPGA‑based H.265 transcoding, smart‑profile QoE adaptation, Dolby Atmos audio and frame‑aligned multi‑view video to deliver ultra‑HD 4K60 streams at 35% lower bandwidth cost, achieving zero failures, zero degradations and over 99.99% service availability.

Dolby AtmosFPGAVideo Encoding
0 likes · 19 min read
Technical Strategies and Achievements of Alibaba Entertainment 2019 Double‑11 Cat Night Live Streaming
ITPUB
ITPUB
Dec 17, 2019 · Backend Development

From Single Server to Cloud‑Native: 12 Steps of Scaling an E‑Commerce Backend

The article walks through the evolution of a high‑traffic e‑commerce backend—from a single‑machine setup to distributed databases, load‑balancing, micro‑services, and finally cloud‑native deployment—highlighting the technical challenges and design principles at each stage.

Cloud NativeDistributed SystemsMicroservices
0 likes · 20 min read
From Single Server to Cloud‑Native: 12 Steps of Scaling an E‑Commerce Backend
Java Architect Essentials
Java Architect Essentials
Dec 15, 2019 · Backend Development

Designing Ultra‑High‑Performance Flash‑Sale Systems: Architecture, Consistency, and Availability

This article analyzes the core challenges of building flash‑sale (秒杀) systems—high concurrency reads and writes, strict consistency, and ultra‑high availability—and presents a layered architectural approach covering dynamic/static separation, hotspot optimization, database tuning, and comprehensive high‑availability strategies.

Backend ArchitectureConsistencyOperations
0 likes · 28 min read
Designing Ultra‑High‑Performance Flash‑Sale Systems: Architecture, Consistency, and Availability
DataFunTalk
DataFunTalk
Dec 13, 2019 · Databases

Lindorm: High‑Performance Distributed NoSQL Database for Big Data

Lindorm, an Alibaba‑derived distributed NoSQL database built on HBase, delivers multi‑model hybrid storage, five‑fold throughput gains, sub‑millisecond latency, advanced indexing, cloud‑native elasticity, strong/adjustable consistency, and comprehensive security and multi‑tenant features for massive data workloads.

Big DataNoSQLdistributed database
0 likes · 25 min read
Lindorm: High‑Performance Distributed NoSQL Database for Big Data
Programmer DD
Programmer DD
Dec 8, 2019 · Backend Development

Why Does Dubbo Keep Calling When Nacos Is Down? Uncovering the Cache Mechanism

This article analyzes a real‑world incident where Dubbo services continued to operate despite Nacos registry failures, explains the role of local provider caches, examines the namingLoadCacheAtStart configuration, and provides reproduction steps and best‑practice recommendations for high‑availability service discovery.

CacheDubboNacos
0 likes · 11 min read
Why Does Dubbo Keep Calling When Nacos Is Down? Uncovering the Cache Mechanism
21CTO
21CTO
Dec 7, 2019 · Backend Development

Master Flash Sale Systems: Boost Performance, Ensure Consistency & High Availability

This article explains how to design a flash‑sale (秒杀) system that handles massive concurrent requests by applying dynamic‑static separation, hotspot optimization, database tuning, and traffic‑shaping techniques to achieve high performance, strong consistency, and robust high‑availability.

BackendConsistencySystem Design
0 likes · 26 min read
Master Flash Sale Systems: Boost Performance, Ensure Consistency & High Availability
Ziru Technology
Ziru Technology
Dec 7, 2019 · Backend Development

How Distributed Delayed Scheduling Guarantees High Availability with Redis and RabbitMQ

This article explains how a distributed delayed‑execution component solves business scenarios that require timed actions, ensuring high availability and data consistency by leveraging Redis sorted sets and RabbitMQ dead‑letter queues, while providing a simple Java API and optional DB‑backed reliability.

Distributed SchedulingRabbitMQdelay queue
0 likes · 9 min read
How Distributed Delayed Scheduling Guarantees High Availability with Redis and RabbitMQ
Java High-Performance Architecture
Java High-Performance Architecture
Dec 2, 2019 · Databases

How Redis Sentinel Ensures Automatic Failover and High Availability

Redis Sentinel provides an automated high‑availability solution for Redis by monitoring master health, broadcasting SDOWN/ODOWN messages, electing a new master based on priority, offset and runid, and allowing clients to discover the current master via sentinel commands, all explained with configuration examples and diagrams.

Configurationhigh availabilitymonitoring
0 likes · 6 min read
How Redis Sentinel Ensures Automatic Failover and High Availability
Qunar Tech Salon
Qunar Tech Salon
Nov 28, 2019 · Databases

Qunar Redis High‑Availability Architecture Design, Security Mechanisms, and Automated Operations

This article details the design principles, components, client implementation, data sharding, security mechanisms, high‑risk command blocking, configuration optimizations, and automated operational workflows of Qunar's Redis high‑availability cluster, including code modifications, deployment scripts, and platform‑based management for large‑scale production environments.

Database Architectureautomationhigh availability
0 likes · 18 min read
Qunar Redis High‑Availability Architecture Design, Security Mechanisms, and Automated Operations
AntTech
AntTech
Nov 27, 2019 · Databases

OceanBase Technical Overview: Distributed Transaction Processing and TPC‑C Benchmark

The article presents a detailed technical overview of OceanBase, explaining why traditional centralized databases cannot meet modern OLTP and OLAP demands, how OceanBase achieves distributed transaction processing with Paxos‑based replication and majority voting for high availability, and how it validates its capabilities through the TPC‑C benchmark.

OceanBaseTPC-Chigh availability
0 likes · 18 min read
OceanBase Technical Overview: Distributed Transaction Processing and TPC‑C Benchmark
Efficient Ops
Efficient Ops
Nov 26, 2019 · Backend Development

How to Build a High‑Performance, Consistent, and Highly Available Flash‑Sale System

This article explores the architectural design of a flash‑sale (秒杀) system, covering high‑performance techniques, consistency guarantees, high‑availability strategies, hotspot optimization, and practical implementation details such as static‑dynamic separation, caching, data integration, and database tuning.

BackendConsistencySystem Design
0 likes · 33 min read
How to Build a High‑Performance, Consistent, and Highly Available Flash‑Sale System
Amap Tech
Amap Tech
Nov 21, 2019 · Industry Insights

Gaode’s Unit‑Based Architecture: Scaling Services with Smart Routing and Data Sync

This article details Gaode's practical experience in building a unit‑based service architecture, covering challenges like request routing, unit isolation, and data synchronization, and explains the design choices, deployment strategies, performance metrics, and future optimization plans.

BackendService Architecturedata synchronization
0 likes · 15 min read
Gaode’s Unit‑Based Architecture: Scaling Services with Smart Routing and Data Sync
Architecture Digest
Architecture Digest
Nov 20, 2019 · Backend Development

Design Principles and Optimization Strategies for High‑Concurrency Flash‑Sale (Seckill) Systems

This article examines the architectural design of flash‑sale (seckill) systems, covering high performance through dynamic‑static separation, hotspot optimization, consistency handling for inventory deduction, and high‑availability techniques such as traffic shaping, queuing, and fallback plans.

ConsistencySeckillarchitecture
0 likes · 30 min read
Design Principles and Optimization Strategies for High‑Concurrency Flash‑Sale (Seckill) Systems
High Availability Architecture
High Availability Architecture
Nov 19, 2019 · Blockchain

How Coinbase Builds and Deploys Blockchain Nodes with Snapchain

The article explains Coinbase’s unique security and infrastructure requirements for blockchain nodes, describes the challenges of blue‑green deployments, and details the Snapchain system built on AWS that enables fast, reliable snapshot‑based node provisioning, upgrades, and high‑availability scaling.

AWSBlockchainInfrastructure
0 likes · 7 min read
How Coinbase Builds and Deploys Blockchain Nodes with Snapchain
System Architect Go
System Architect Go
Nov 17, 2019 · Databases

Handling Single Point Failures and Disaster Recovery in InfluxDB

To mitigate the inherent single‑point‑failure risk of the open‑source InfluxDB community edition, the article proposes deploying multiple InfluxDB instances with concurrent client writes, tracking failed writes, temporarily storing them, and using custom workers to replay data, while addressing timeout, data consistency, and storage considerations.

Data ConsistencyInfluxDBTime Series Database
0 likes · 3 min read
Handling Single Point Failures and Disaster Recovery in InfluxDB
Efficient Ops
Efficient Ops
Nov 12, 2019 · Operations

Building High‑Availability Systems in Securities: Practices and Tech Choices

This article examines the unique regulatory and operational characteristics of the securities industry and explains how careful technology selection—ranging from native database log replay to third‑party replication and big‑data platforms—enables robust high‑availability architectures, illustrated with real‑world practices from Dongfang Securities.

Database ReplicationSystem Architecturehigh availability
0 likes · 14 min read
Building High‑Availability Systems in Securities: Practices and Tech Choices
Aikesheng Open Source Community
Aikesheng Open Source Community
Nov 11, 2019 · Databases

Will MySQL Be Subject to “Bottleneck” Restrictions? An Analysis of US and Chinese Export Controls and Technical Viability

The article examines whether MySQL, especially the Community Edition, faces commercial or regulatory “bottleneck” risks by reviewing US export control rules, Chinese policy guidance, and the technical feasibility of MySQL’s high‑availability architectures, concluding that current policies pose no significant threat.

Database ArchitectureExport controlshigh availability
0 likes · 8 min read
Will MySQL Be Subject to “Bottleneck” Restrictions? An Analysis of US and Chinese Export Controls and Technical Viability
dbaplus Community
dbaplus Community
Nov 10, 2019 · Backend Development

Mastering Redis at Scale: Real‑World Use Cases, Performance Tweaks, and High‑Availability Strategies

This technical guide compiled by Tencent engineers explores common Redis data‑structure use cases, latency and memory considerations, compares distributed deployment options, and provides practical optimization, high‑availability, and troubleshooting techniques for large‑scale Redis (Codis) deployments.

Codishigh availabilityredis
0 likes · 15 min read
Mastering Redis at Scale: Real‑World Use Cases, Performance Tweaks, and High‑Availability Strategies
DevOps Cloud Academy
DevOps Cloud Academy
Nov 9, 2019 · Operations

Configuring Jenkins High Availability with HAProxy and NFS

This guide explains how to achieve Jenkins high availability by deploying two Jenkins master nodes behind HAProxy, sharing Jenkins home via NFS, and configuring HAProxy load balancing and health checks, including detailed host setup, NFS and Jenkins installation steps, and test results.

DevOpsHAProxyInfrastructure
0 likes · 10 min read
Configuring Jenkins High Availability with HAProxy and NFS
Architecture Digest
Architecture Digest
Nov 8, 2019 · Backend Development

Evolution of Taobao Backend Architecture from Hundred to Ten Million Concurrent Users

This article uses Taobao’s backend as a case study to illustrate how server architecture evolves through ten stages—from a single‑machine setup handling hundreds of requests to a distributed, containerized, cloud‑native system supporting tens of millions of concurrent users—detailing the technologies and design principles at each step.

Backend ArchitectureScalabilitycloud computing
0 likes · 19 min read
Evolution of Taobao Backend Architecture from Hundred to Ten Million Concurrent Users
Efficient Ops
Efficient Ops
Nov 6, 2019 · Operations

Mastering Load Balancing: Types, Architectures, and Algorithms Explained

This article explains why a single server eventually hits a performance ceiling, introduces high‑performance clustering and the role of load balancers, compares DNS, hardware, and software load‑balancing solutions, outlines typical multi‑layer architectures, and reviews common load‑balancing algorithms with their pros and cons.

algorithmhigh availabilityload balancing
0 likes · 18 min read
Mastering Load Balancing: Types, Architectures, and Algorithms Explained