Tagged articles
1414 articles
Page 14 of 15
dbaplus Community
dbaplus Community
Sep 20, 2016 · Backend Development

Zero‑Intrusion Data Fallback with Nginx + Lua: A Practical Guide

This article explains how to design a robust, zero‑intrusion data fallback component for high‑traffic web services using Nginx, Lua, and AOP principles, covering problem definition, architectural options, detailed execution steps, configuration snippets, storage choices, and performance monitoring.

Data FallbackLuaNginx
0 likes · 11 min read
Zero‑Intrusion Data Fallback with Nginx + Lua: A Practical Guide
ITPUB
ITPUB
Sep 19, 2016 · Databases

How Oracle Data Guard’s Maximum Protection Mode Handles Failures and Recovery

This article demonstrates how to enable Oracle Data Guard’s Maximum Protection mode, examines the behavior of primary and standby databases when the standby connection is broken, and walks through the resulting errors, forced shutdown, and step‑by‑step recovery back to normal operation.

Data GuardDatabase ReplicationMaximum Protection
0 likes · 12 min read
How Oracle Data Guard’s Maximum Protection Mode Handles Failures and Recovery
Meituan Technology Team
Meituan Technology Team
Sep 9, 2016 · Backend Development

Evolution and Scaling of Meituan's Food Delivery Order System

Meituan’s food‑delivery order platform evolved from a simple monolithic prototype in 2013 to a distributed, highly available service that now handles tens of millions of orders daily, using asynchronous processing, sharding, multi‑data‑center deployment, and automated operations to achieve sub‑150 ms transaction latency and 99.9999% uptime.

BackendMeituanPerformance Optimization
0 likes · 20 min read
Evolution and Scaling of Meituan's Food Delivery Order System
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 5, 2016 · Operations

Avoid Common Pitfalls in Geo-Active High-Availability Design

This article examines common misconceptions in designing geo-distributed active-active systems, explains why striving for perfect real-time data sync is unrealistic, and offers practical strategies—such as prioritizing core services, reducing distance, limiting data replication, and combining storage sync with messaging—to achieve reliable high-availability.

Active-ActiveData ConsistencySystem Design
0 likes · 17 min read
Avoid Common Pitfalls in Geo-Active High-Availability Design
Efficient Ops
Efficient Ops
Aug 31, 2016 · Databases

Can WeChat’s Open‑Source PhxSQL Deliver True High‑Availability MySQL?

The article reviews WeChat’s newly open‑sourced PhxSQL, a MySQL clustering solution that promises high availability and strong consistency, examines its complex architecture with added Phxbinlogsvr and Phxsqlproxy components, and discusses practical concerns such as deployment difficulty, replication latency, lack of multi‑master writes, semi‑synchronous behavior, and monitoring challenges.

PhxSQLdatabase clusteringhigh availability
0 likes · 9 min read
Can WeChat’s Open‑Source PhxSQL Deliver True High‑Availability MySQL?
ITPUB
ITPUB
Aug 30, 2016 · Operations

Build a Dual‑Master HAProxy + Keepalived Load Balancer for Zero Downtime

This guide walks through setting up a high‑availability HAProxy and Keepalived dual‑master configuration, detailing the experiment environment, installation steps, keepalived scripts, and testing procedures that ensure continuous service even if one node fails.

HAProxyServer Setuphigh availability
0 likes · 6 min read
Build a Dual‑Master HAProxy + Keepalived Load Balancer for Zero Downtime
WeChat Backend Team
WeChat Backend Team
Aug 30, 2016 · Databases

How PhxSQL Achieves MySQL‑Compatible High Availability with Paxos‑Based Replication

PhxSQL is an open‑source, MySQL‑compatible relational database cluster that uses a Paxos‑based log store and a three‑layer architecture (Phxbinlogsvr, Phxsqlproxy, Phxsync) to provide automatic master failover, strong data consistency, and improved write performance compared to MySQL semi‑synchronous replication.

PaxosReplicationdatabase clustering
0 likes · 10 min read
How PhxSQL Achieves MySQL‑Compatible High Availability with Paxos‑Based Replication
High Availability Architecture
High Availability Architecture
Aug 30, 2016 · Operations

Evolution of Meizu Flyme Operations Architecture and High‑Availability Practices

The article details Meizu's Flyme operations platform evolution—from a single‑cabinet setup in 2011 to a multi‑IDC, 6000‑server infrastructure—highlighting challenges, architectural upgrades, monitoring, cost control, automation, and future high‑availability directions for large‑scale internet services.

Infrastructurecost controlhigh availability
0 likes · 13 min read
Evolution of Meizu Flyme Operations Architecture and High‑Availability Practices
High Availability Architecture
High Availability Architecture
Aug 26, 2016 · Backend Development

Evolution of Weibo Cache Service: From Bare Memcache to Multi‑Layered Service Architecture

The article details how Weibo’s cache infrastructure progressed from simple Memcache deployments to a sophisticated, service‑oriented architecture featuring multi‑layer caching, proxy layers, dynamic configuration, monitoring, and automated scaling to meet massive read‑write demands and high availability requirements.

BackendCacheScalability
0 likes · 19 min read
Evolution of Weibo Cache Service: From Bare Memcache to Multi‑Layered Service Architecture
dbaplus Community
dbaplus Community
Aug 24, 2016 · Databases

Choosing the Right MySQL HA Architecture: Master‑Slave, Dual‑Master, Heartbeat+DRBD, and GTID

This article examines common MySQL high‑availability designs—including single master‑slave, dual‑master with multiple slaves, Heartbeat + DRBD, MySQL Cluster, semi‑synchronous replication, and GTID—explaining their benefits, drawbacks, and practical considerations for reliable failover and scalability.

DRBDDatabase ArchitectureGTID
0 likes · 9 min read
Choosing the Right MySQL HA Architecture: Master‑Slave, Dual‑Master, Heartbeat+DRBD, and GTID
ITPUB
ITPUB
Aug 23, 2016 · Databases

Designing a Zookeeper‑MHA MySQL High‑Availability Architecture: Key Insights

This talk explains how Lianjia redesigned its MySQL high‑availability setup by replacing VIP‑based failover with a Zookeeper‑driven naming service, detailing the original MHA architecture, its shortcomings, the new components, workflow, operational challenges, and a DNS‑based refinement.

DNSDatabase ArchitectureMHA
0 likes · 14 min read
Designing a Zookeeper‑MHA MySQL High‑Availability Architecture: Key Insights
Architecture Digest
Architecture Digest
Aug 18, 2016 · Backend Development

Architecture and Scaling Practices of the Miaopai Short Video Platform

The article details the evolution, system architecture, backend technologies, high‑availability designs, data handling, upload and playback workflows, and operational monitoring of the Miaopai short‑video service, illustrating how it scales to hundreds of millions of users under heavy load.

ScalabilitySystem Architecturecloud storage
0 likes · 12 min read
Architecture and Scaling Practices of the Miaopai Short Video Platform
Architecture Digest
Architecture Digest
Aug 17, 2016 · Backend Development

Design and Optimization of Bilibili Live Chat (GOIM) System

The article presents a detailed overview of Bilibili's GOIM live chat architecture, covering its high‑stability, high‑availability, low‑latency design, component breakdown, memory and module optimizations, network improvements, and performance testing results to achieve scalable real‑time messaging.

Backend ArchitectureGoKafka
0 likes · 13 min read
Design and Optimization of Bilibili Live Chat (GOIM) System
ITPUB
ITPUB
Aug 16, 2016 · Databases

Achieving Seamless MySQL HA with Pacemaker and MHA: Lessons from DTCC 2016

This article details a MySQL high‑availability solution built on Pacemaker, Corosync and MHA, explains why earlier keepalived‑based designs suffered split‑brain issues, and walks through the architecture, quorum handling, resource agents, failover workflow, testing methodology, and practical lessons learned.

MHAOperationsPacemaker
0 likes · 16 min read
Achieving Seamless MySQL HA with Pacemaker and MHA: Lessons from DTCC 2016
dbaplus Community
dbaplus Community
Aug 9, 2016 · Databases

Step-by-Step Guide to Deploying MHA for MySQL High Availability

This tutorial walks through preparing a CentOS 6.6 environment, creating MHA users and SSH keys, configuring hosts, installing MHA Node and Manager, verifying configurations, and testing master failover to achieve reliable MySQL high‑availability using MHA.

Database AdministrationMHAhigh availability
0 likes · 6 min read
Step-by-Step Guide to Deploying MHA for MySQL High Availability
Architecture Digest
Architecture Digest
Aug 7, 2016 · Operations

Designing a Three‑Dimensional High‑Availability Architecture for Alibaba's Game Integration System

The article describes how Alibaba's game integration platform achieved business‑oriented high availability by abandoning traditional system‑centric designs and implementing a three‑dimensional architecture that combines clear HA goals, multi‑active deployment, client‑side retries, functional isolation, automated monitoring, and rapid fault recovery, ultimately meeting a 3‑minute issue‑location and 5‑minute business‑recovery target.

OperationsSystem Architecturebusiness‑oriented HA
0 likes · 21 min read
Designing a Three‑Dimensional High‑Availability Architecture for Alibaba's Game Integration System
ITPUB
ITPUB
Aug 2, 2016 · Databases

How to Build a Highly Available MySQL Cluster with HAProxy and Keepalived on OpenStack

This guide walks through configuring a Percona XtraDB Cluster on three OpenStack nodes, disabling SELinux and firewalls, installing and tuning MySQL, setting up HAProxy for load balancing, deploying Keepalived for failover, and verifying high‑availability with practical test steps.

HAProxyOpenStackPercona XtraDB Cluster
0 likes · 13 min read
How to Build a Highly Available MySQL Cluster with HAProxy and Keepalived on OpenStack
MaGe Linux Operations
MaGe Linux Operations
Aug 1, 2016 · Operations

Mastering MHA Core Parameters: Complete Guide to MySQL HA Configuration

This article provides a detailed walkthrough of MHA's core configuration parameters—including server scopes, connection settings, candidate master rules, failover scripts, and monitoring options—explaining where each setting belongs and how to fine‑tune MySQL high‑availability behavior.

ConfigurationMHAOperations
0 likes · 11 min read
Mastering MHA Core Parameters: Complete Guide to MySQL HA Configuration
Architecture Digest
Architecture Digest
Jul 31, 2016 · Game Development

Maintaining Game World State with Data Services: Architecture, Sharding, and High Availability

This article examines how to maintain game world state using data services, discussing traditional DB proxy processes, the role of Redis as a cache and database, stateless versus stateful services, sharding, high‑availability mechanisms such as Sentinel, Zookeeper, and the overall architecture for scalable MMO back‑ends.

Data Servicegame serverhigh availability
0 likes · 32 min read
Maintaining Game World State with Data Services: Architecture, Sharding, and High Availability
dbaplus Community
dbaplus Community
Jul 21, 2016 · Databases

How MHA Delivers Fast, Zero‑Data‑Loss MySQL High Availability

This article explains MHA’s architecture, failover workflow, comparison with other MySQL HA solutions, and its six key advantages, showing how it can switch masters within seconds while preserving data consistency without altering MySQL settings or adding many servers.

MHAfailoverhigh availability
0 likes · 9 min read
How MHA Delivers Fast, Zero‑Data‑Loss MySQL High Availability
Architecture Digest
Architecture Digest
Jul 1, 2016 · Fundamentals

Key Architectural Practices and Insights for Scalable, High‑Availability Systems

This article collection reviews essential architectural concepts—including scalable web development, business‑centric high‑availability design, front‑end/back‑end separation with REST, large‑scale instant‑messaging platforms, industrial IoT frameworks, software‑defined storage, and the personal growth required of architects—providing practical guidance for building robust, adaptable systems.

IoTScalabilitySoftware-Defined Storage
0 likes · 7 min read
Key Architectural Practices and Insights for Scalable, High‑Availability Systems
dbaplus Community
dbaplus Community
Jun 29, 2016 · Cloud Native

How to Build a Highly Available Kubernetes Master on Private Cloud

This guide walks through constructing a Kubernetes master high‑availability environment on a private‑cloud Ubuntu 14.04 platform, covering full component containerization, external load‑balancer setup with HAProxy and Keepalived, static pod configuration, privileged mode requirements, flannel networking, common pitfalls, and future HA master developments.

FlannelHAProxyKubernetes
0 likes · 16 min read
How to Build a Highly Available Kubernetes Master on Private Cloud
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 19, 2016 · Backend Development

How to Tackle Common Cache Problems in Distributed Systems

This article explores typical cache challenges in distributed systems—including data consistency, high availability, cache avalanche, and cache penetration—explaining their causes, real‑world scenarios, and practical mitigation strategies to ensure reliable and efficient caching.

CacheData ConsistencyDistributed Systems
0 likes · 9 min read
How to Tackle Common Cache Problems in Distributed Systems
Architecture Digest
Architecture Digest
Jun 15, 2016 · Operations

Design and Implementation of Multi‑Site Active‑Active Disaster Recovery for Call Centers

The article describes how a large‑scale call center evolves to a multi‑site architecture and implements system‑level active‑active disaster recovery using Ctrip's contact‑center and unified login platforms, detailing the login flow, fault‑detection logic, key features, and future extensions.

Active-ActiveSystem Architecturecall center
0 likes · 6 min read
Design and Implementation of Multi‑Site Active‑Active Disaster Recovery for Call Centers
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 9, 2016 · Operations

Mastering Load Balancing: Principles, Types, and Practical Tips

This article explains how load balancing solves high‑traffic, high‑concurrency challenges by distributing requests across multiple servers, covering its core principles, vertical and horizontal scaling, various classification methods such as DNS, IP, link‑layer, and hybrid approaches, along with their advantages, drawbacks, and deployment recommendations.

Distributed Systemshigh availabilityhorizontal scaling
0 likes · 10 min read
Mastering Load Balancing: Principles, Types, and Practical Tips
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 9, 2016 · Fundamentals

Distributed vs Cluster: Key Differences and When to Use Each

This article explains the core distinctions between distributed systems and clusters, covering their architectures, efficiency goals, typical use cases, and examples such as Hadoop MapReduce and load‑balancing clusters, while also detailing cluster types, high‑availability, load balancing, and high‑performance computing.

Cluster ComputingDistributed SystemsHigh‑performance computing
0 likes · 10 min read
Distributed vs Cluster: Key Differences and When to Use Each
dbaplus Community
dbaplus Community
Jun 1, 2016 · Backend Development

Mastering RabbitMQ: Architecture, Optimization, and Real-World Cases in Microservices

This article explores microservice architecture fundamentals, compares synchronous and asynchronous communication, details RabbitMQ’s AMQP model, optimization techniques, high‑availability configurations, flow‑control mechanisms, and shares practical case studies from NetEase’s Hive platform, offering actionable insights for reliable, scalable message‑queue deployments.

Message QueueMicroservicesPerformance Optimization
0 likes · 18 min read
Mastering RabbitMQ: Architecture, Optimization, and Real-World Cases in Microservices
Tencent Music Tech Team
Tencent Music Tech Team
May 27, 2016 · Databases

Cross‑IDC Disaster Recovery Solution for KV Storage via a Proxy Layer

The article proposes a cross‑IDC disaster‑recovery architecture for key‑value stores that uses a storage proxy layer to record write logs in the primary data center, forward them to a read‑only backup center for replay, and route reads via load‑balancing, while noting current limitations such as complex configuration, low synchronization concurrency, and lack of multi‑write support.

KV storagecross-IDCdatabase
0 likes · 4 min read
Cross‑IDC Disaster Recovery Solution for KV Storage via a Proxy Layer
ITPUB
ITPUB
May 25, 2016 · Databases

Mastering Redis Replication: Features, Mechanics, and Deployment Guide

Redis replication offers high‑availability through master‑slave syncing, supporting multiple slaves, cascading replication, non‑blocking data transfer, read‑write separation, and optional disk‑less sync, with detailed steps for configuration, safety considerations, partial resynchronization, and practical deployment examples.

Master‑SlaveReplicationdatabase
0 likes · 8 min read
Mastering Redis Replication: Features, Mechanics, and Deployment Guide
21CTO
21CTO
May 13, 2016 · Backend Development

How to Refactor a Live Backend System Without Stopping the Business

The article shares practical experiences and lessons learned from three large‑scale backend architecture refactorings, explaining why doing business‑critical changes while keeping services online is the toughest challenge and how to identify and solve the most critical problems first.

Backendhigh availabilityrefactoring
0 likes · 8 min read
How to Refactor a Live Backend System Without Stopping the Business
21CTO
21CTO
May 8, 2016 · Databases

Which MySQL High‑Availability Architecture Is Right for You? A Comprehensive Guide

The article reviews common MySQL high‑availability solutions—including shared‑storage SAN, DRBD disk replication, keepalived/heartbeat, MHA, ZooKeeper‑based HA, Galera/PXC clusters, and proxy middleware—detailing their architectures, advantages, limitations, and suitability for different business and operational requirements.

ClusterDatabase ReplicationHA Architecture
0 likes · 19 min read
Which MySQL High‑Availability Architecture Is Right for You? A Comprehensive Guide
Architecture Digest
Architecture Digest
May 5, 2016 · Fundamentals

Understanding Distributed Systems and Cluster Architectures: Concepts, Examples, and Strategies

This article explains the fundamental differences between distributed systems and clusters, illustrates them with real‑world examples such as Dubbo, Elasticsearch and Git, and details various cluster types, high‑availability and load‑balancing strategies, as well as fault‑tolerance mechanisms.

Distributed SystemsDubboElasticsearch
0 likes · 12 min read
Understanding Distributed Systems and Cluster Architectures: Concepts, Examples, and Strategies
dbaplus Community
dbaplus Community
May 4, 2016 · Databases

Choosing Percona, MariaDB or MySQL and Proven MySQL Tuning & HA Strategies

This guide compares Percona Server, MariaDB, and MySQL, outlines practical MySQL performance tuning steps across hardware, storage, filesystem, and server parameters, and presents several high‑availability architectures such as master‑slave replication, MMM/MHA, Heartbeat/SAN, and Heartbeat/DRBD with real‑world deployment examples.

Database TuningMariaDBPercona
0 likes · 9 min read
Choosing Percona, MariaDB or MySQL and Proven MySQL Tuning & HA Strategies
21CTO
21CTO
May 4, 2016 · Operations

Mastering Load Balancing: Principles, Types, and Algorithms Explained

This article provides a comprehensive overview of load balancing, covering its core principles, the distinction between vertical and horizontal scaling, various classifications such as DNS, HTTP, IP, link‑layer, and hybrid approaches, as well as common algorithms and hardware considerations for high‑availability, high‑performance systems.

Distributed Systemshigh availabilityload balancing
0 likes · 11 min read
Mastering Load Balancing: Principles, Types, and Algorithms Explained
High Availability Architecture
High Availability Architecture
May 3, 2016 · Databases

Designing and Scaling a Social Platform with Redis

This article explains how to build a micro‑blogging social system using Redis, covering core data models, read‑through replication, Sentinel failover, and write‑side sharding techniques that enable the platform to handle hundreds of millions of users and massive traffic.

Social networkhigh availabilityread replication
0 likes · 19 min read
Designing and Scaling a Social Platform with Redis
21CTO
21CTO
Apr 23, 2016 · Operations

How to Achieve Nginx High Availability with Keepalived on Linux

This guide explains how to use Keepalived and the VRRP protocol to build a two‑node high‑availability setup for Nginx, covering installation, health‑check scripts, configuration options, testing procedures, and troubleshooting tips for seamless failover.

NginxVRRPhigh availability
0 likes · 16 min read
How to Achieve Nginx High Availability with Keepalived on Linux
21CTO
21CTO
Apr 19, 2016 · Backend Development

How Alibaba’s Notify and MetaQ Power Massive E‑Commerce Messaging

This article explains the design principles, architecture, and performance optimizations of Alibaba's Notify and MetaQ message middleware, illustrating how they achieve reliable asynchronous communication, high scalability, and low latency for billions of messages during peak e‑commerce events like Double 11.

Backend ArchitectureDistributed SystemsMessage Queue
0 likes · 14 min read
How Alibaba’s Notify and MetaQ Power Massive E‑Commerce Messaging
Art of Distributed System Architecture Design
Art of Distributed System Architecture Design
Apr 15, 2016 · Cloud Computing

How Meizu Optimized Cloud Sync: Architecture, Protocols, and Performance Insights

This article provides an in‑depth technical overview of Meizu's cloud sync service, covering its business scenarios, the evolution of its core SyncML‑based protocols, modular architecture across multiple data centers, performance optimizations such as traffic reduction and connection handling, as well as high‑availability, security, and future SDK‑API extensions.

Distributed SystemsPerformance Optimizationcloud sync
0 likes · 25 min read
How Meizu Optimized Cloud Sync: Architecture, Protocols, and Performance Insights
21CTO
21CTO
Apr 13, 2016 · Operations

Designing a Highly Available Transaction System: Real‑World Evolution

This article examines how a large‑scale e‑commerce transaction platform achieved high availability through iterative architectural evolution—from early .NET monoliths to vertical and horizontal micro‑service splits—highlighting practical strategies for fault detection, rapid recovery, scaling, and operational best‑practices.

MicroservicesOperationsSystem Architecture
0 likes · 15 min read
Designing a Highly Available Transaction System: Real‑World Evolution
Qunar Tech Salon
Qunar Tech Salon
Mar 31, 2016 · Operations

Technical Summary of Large‑Scale Distributed Website Architecture

This article provides a comprehensive technical summary of large‑scale distributed website architecture, outlining characteristics, architectural goals, patterns, and detailed strategies for high performance, high availability, scalability, extensibility, security, and agility, supplemented with illustrative examples and practical insights.

Scalabilityarchitecturehigh availability
0 likes · 12 min read
Technical Summary of Large‑Scale Distributed Website Architecture
dbaplus Community
dbaplus Community
Mar 24, 2016 · Databases

Step‑by‑Step Migration of a 2‑Node Oracle 11gR2 RAC to New Hardware with Active Data Guard

This guide details the complete process of moving a 2‑node Oracle 11gR2 RAC database to new servers and storage using Active Physical Standby Data Guard, covering architecture, pre‑implementation preparation, detailed implementation steps, switchover procedures, impact assessment, rollback measures, and lessons learned.

Data GuardOraclePhysical Standby
0 likes · 10 min read
Step‑by‑Step Migration of a 2‑Node Oracle 11gR2 RAC to New Hardware with Active Data Guard
Efficient Ops
Efficient Ops
Mar 19, 2016 · Operations

From Zero to Senior Ops Engineer: My Linux, Automation & High‑Availability Journey

Freeman, a 1988‑born O2O senior operations engineer from Henan now in Shanghai, shares his personal learning saga—from clueless newcomer to mastering Linux, Nagios, iptables, load‑balancing, MySQL replication, and high‑availability architectures—highlighting the challenges, training, and mindset needed to thrive in modern IT operations.

careerhigh availabilitylinux
0 likes · 14 min read
From Zero to Senior Ops Engineer: My Linux, Automation & High‑Availability Journey
High Availability Architecture
High Availability Architecture
Mar 18, 2016 · Backend Development

High‑Availability Architecture of Upyun Image Processing Service

The article details Upyun's high‑availability image processing architecture, covering workload‑aware system design, custom GmServer implementation, task scheduling, current strengths and limitations, and future directions such as a ServiceServer‑based queue and Docker‑driven dynamic scaling.

Backend ArchitectureCloud ServicesDocker
0 likes · 15 min read
High‑Availability Architecture of Upyun Image Processing Service
21CTO
21CTO
Mar 15, 2016 · Backend Development

Why Multi‑Datacenter Architecture Is Essential for High‑Availability Services

The article explains how multi‑datacenter architectures prevent total service loss, improve latency by placing services near users, and balance the CAP trade‑offs through models like AC, CP, and AP, while outlining practical design, sharding, monitoring, and failover strategies for large‑scale backend systems.

CAP theoremData ConsistencyDistributed Systems
0 likes · 14 min read
Why Multi‑Datacenter Architecture Is Essential for High‑Availability Services
ITPUB
ITPUB
Mar 15, 2016 · Databases

Choosing the Right MySQL OLTP Architecture: PXC, MHA, or Dual‑Master?

Before migrating enterprise OLTP workloads to open‑source MySQL, this guide evaluates three high‑availability architectures—Percona XtraDB Cluster, MHA, and native dual‑master—by outlining key pre‑migration questions, their advantages, drawbacks, and practical deployment considerations.

OLTPPercona XtraDB Clusterhigh availability
0 likes · 9 min read
Choosing the Right MySQL OLTP Architecture: PXC, MHA, or Dual‑Master?
Architects' Tech Alliance
Architects' Tech Alliance
Mar 7, 2016 · Cloud Computing

Overview of VMware High‑Availability, DRS, vMotion, and Related Virtualization Features

This article provides a comprehensive overview of VMware's high‑reliability PSA architecture, virtual machine conversion tools, HA, DRS, FT, vMotion, storage vMotion, DPM, vNetwork distributed switch, and automation utilities, explaining their functions, components, and operational benefits within virtualized environments.

DRSVMwarecloud computing
0 likes · 9 min read
Overview of VMware High‑Availability, DRS, vMotion, and Related Virtualization Features
Architect
Architect
Feb 29, 2016 · Big Data

Design Principles of Real-Time Distributed Streaming Systems: A Comparison of Spark and Storm

This article examines the design considerations of real-time distributed streaming systems, outlines their background and characteristics, compares the architectures of Spark Streaming and Storm, discusses primitives, message passing, high availability, storage models, and integration with production environments, providing practical insights for architects.

Distributed SystemsReal-time ProcessingSpark
0 likes · 20 min read
Design Principles of Real-Time Distributed Streaming Systems: A Comparison of Spark and Storm
21CTO
21CTO
Feb 27, 2016 · Operations

Mastering Load Balancing: Principles, Types, Algorithms, and Hardware Solutions

Load balancing distributes incoming traffic across multiple servers to enhance performance, ensure high availability, and enable horizontal scaling, with various implementations such as DNS, HTTP, IP, link‑layer, and hybrid methods, each employing algorithms like round‑robin, random, least connections, hash, and weighted distribution, plus hardware options.

Distributed SystemsHardwareScalability
0 likes · 14 min read
Mastering Load Balancing: Principles, Types, Algorithms, and Hardware Solutions
Architecture Digest
Architecture Digest
Feb 27, 2016 · Databases

Architecture and Design of TDSQL: A Distributed MySQL‑Based SQL System for High‑Consistency Billing

The article analyzes the architecture of TDSQL, a MySQL‑based distributed SQL system designed for Tencent’s billing platform, detailing its transition from an in‑memory NoSQL solution to a high‑consistency, auto‑scaling, fault‑tolerant database with sharding, scheduler, agent, and gateway components.

Replicationdistributed databasehigh availability
0 likes · 17 min read
Architecture and Design of TDSQL: A Distributed MySQL‑Based SQL System for High‑Consistency Billing
Qunar Tech Salon
Qunar Tech Salon
Feb 25, 2016 · Databases

Scaling MySQL Master‑Slave Replication with Binlog Server: Architecture, Benefits, and Use Cases

This article explains how Booking.com solved MySQL master‑to‑hundreds‑of‑slaves bandwidth limits by introducing a Binlog Server component, detailing its design, advantages over traditional relay‑master setups, and multiple deployment scenarios for high‑availability and cross‑region replication.

Binlog ServerDatabase ArchitectureReplication
0 likes · 9 min read
Scaling MySQL Master‑Slave Replication with Binlog Server: Architecture, Benefits, and Use Cases
21CTO
21CTO
Feb 9, 2016 · Databases

Understanding Database Clustering: Architectures, Benefits, and Challenges

This article explores the importance of database clustering in modern information systems, outlines the challenges of performance, availability, and scalability, and compares Share‑Disk and Share‑Nothing architectures along with their advantages, drawbacks, and real‑world implementations.

Scalabilitydatabase clusteringhigh availability
0 likes · 10 min read
Understanding Database Clustering: Architectures, Benefits, and Challenges
21CTO
21CTO
Feb 9, 2016 · Cloud Computing

How Tencent Cloud Powers WeChat’s Massive New Year Red Envelope Storm

During the Chinese New Year, Tencent Cloud handled billions of WeChat red‑envelope transactions by employing multi‑region load balancing, disaster‑recovery architectures, database sharding, and high‑throughput caching to ensure seamless, high‑availability service for millions of concurrent users.

Distributed Systemshigh availability
0 likes · 5 min read
How Tencent Cloud Powers WeChat’s Massive New Year Red Envelope Storm
21CTO
21CTO
Feb 4, 2016 · Backend Development

Key Principles for Building Scalable Distributed Web Systems

This article outlines essential design principles for large‑scale web architectures—including availability, performance, reliability, scalability, manageability and cost—and demonstrates their application through a detailed image‑hosting service example, covering services, redundancy, partitioning, caching, proxies, indexing, load balancing, and queuing to achieve efficient, scalable data access.

Data PartitioningDistributed Systemscaching
0 likes · 37 min read
Key Principles for Building Scalable Distributed Web Systems
21CTO
21CTO
Jan 28, 2016 · Operations

How to Build High‑Availability Systems: Lessons from a Transaction Platform Evolution

This article shares practical insights on achieving high availability by understanding goals, decomposing requirements, designing resilient architectures, ensuring operability, testing rigorously, and reducing release risk, illustrated through the multi‑stage evolution of a transaction system.

MicroservicesOperationsScalability
0 likes · 14 min read
How to Build High‑Availability Systems: Lessons from a Transaction Platform Evolution
High Availability Architecture
High Availability Architecture
Jan 17, 2016 · Operations

High‑Availability Architecture and Scaling Practices of Meipai Short‑Video Platform

This article outlines Meipai’s evolution into a billion‑user short‑video platform, detailing its high‑availability, scalable architecture, service discovery via etcd, data storage challenges, CDN and cloud‑storage redundancy, fault‑tolerance mechanisms, and future directions such as H.265 adoption and P2P‑CDN hybrid delivery.

CDNcloud storageetcd
0 likes · 18 min read
High‑Availability Architecture and Scaling Practices of Meipai Short‑Video Platform
ITPUB
ITPUB
Jan 8, 2016 · Databases

Mastering Oracle RAC: Best Practices, Common Pitfalls, and Real-World Cases

This technical session covers Oracle RAC high‑availability best practices, installation steps, daily operational commands, detailed case studies of auto‑start checks, version‑mix issues, addNode failures, network heartbeat problems, and client connection errors, plus a concise Q&A on uninstall, SCAN vs VIP, and split‑brain detection.

InstallationOracleRAC
0 likes · 21 min read
Mastering Oracle RAC: Best Practices, Common Pitfalls, and Real-World Cases
21CTO
21CTO
Dec 30, 2015 · Operations

Designing a 100k-Server Monitoring System: Architecture and Key Lessons

This article shares the architecture, design principles, challenges, and performance‑optimizing solutions behind a ten‑hundred‑thousand‑scale server monitoring system, covering data collection agents, distributed pipelines, real‑time alerts, high throughput, multi‑platform support, and practical lessons learned.

Performance OptimizationScalabilityServer Monitoring
0 likes · 10 min read
Designing a 100k-Server Monitoring System: Architecture and Key Lessons
High Availability Architecture
High Availability Architecture
Dec 18, 2015 · Operations

Weibo's Multi-Data-Center (Active‑Active) Architecture: Experience, Challenges, and Best Practices

The article details Weibo's journey in building a multi‑data‑center active‑active architecture, covering its evolution, technical challenges such as latency and data synchronization, the adopted MCQ‑based messaging solution, operational best practices, and future directions for high‑availability deployments.

DeploymentMessagingMulti-Data Center
0 likes · 16 min read
Weibo's Multi-Data-Center (Active‑Active) Architecture: Experience, Challenges, and Best Practices
MaGe Linux Operations
MaGe Linux Operations
Dec 16, 2015 · Databases

Mastering MySQL Replication: Concepts, Architectures, and Advanced Strategies

This article explains MySQL replication fundamentals, detailing master‑slave and multi‑master architectures, thread roles, configuration parameters, read‑write splitting, multi‑source setups, and advanced scenarios such as multi‑level and circular replication, while highlighting common pitfalls and performance considerations.

Database ArchitectureMaster‑SlaveReplication
0 likes · 7 min read
Mastering MySQL Replication: Concepts, Architectures, and Advanced Strategies
21CTO
21CTO
Dec 5, 2015 · Cloud Computing

How Momo Scales to Billions: High‑Availability Communication Architecture for Mobile Social Apps

Facing up to 1.6‑1.7 billion daily requests, Momo’s tech director explains how the company built a highly available, low‑latency communication protocol and distributed infrastructure—leveraging virtualization, OpenStack, and cloud servers—to overcome weak mobile networks and serve its massive social‑gaming user base.

Communication ProtocolMobile NetworkingVirtualization
0 likes · 4 min read
How Momo Scales to Billions: High‑Availability Communication Architecture for Mobile Social Apps
ITPUB
ITPUB
Nov 30, 2015 · Backend Development

How 58.com Achieves Read/Write High Availability with Dual‑Master and Redis Caching

The article explains 58.com’s service‑oriented architecture that combines dual‑master MySQL replication, a standby database, and Redis caching to provide high‑availability read/write operations, outlines their scaling process, and details a four‑step cache‑consistency strategy.

Database ReplicationService Architecturehigh availability
0 likes · 6 min read
How 58.com Achieves Read/Write High Availability with Dual‑Master and Redis Caching
Efficient Ops
Efficient Ops
Nov 24, 2015 · Operations

Achieve Web High Availability and Static/Dynamic Separation with HAProxy & Keepalived

This article walks through implementing web high availability and static‑dynamic content separation using HAProxy combined with Keepalived, covering load‑balancing concepts, VRRP basics, step‑by‑step configuration of time sync, hostnames, SSH trust, installing required packages, and testing failover scenarios.

HAProxyWebhigh availability
0 likes · 13 min read
Achieve Web High Availability and Static/Dynamic Separation with HAProxy & Keepalived
21CTO
21CTO
Nov 14, 2015 · Fundamentals

Inside Taobao’s High‑Performance Distributed File System (TFS): Architecture & Scaling

This article explains the design, storage mechanisms, high‑availability architecture, scaling strategies, multi‑data‑center disaster recovery, operational management, and future plans of Taobao’s distributed file system (TFS), a highly available and scalable storage solution for massive unstructured data.

Distributed File SystemTFSTaobao
0 likes · 14 min read
Inside Taobao’s High‑Performance Distributed File System (TFS): Architecture & Scaling
21CTO
21CTO
Nov 13, 2015 · Operations

Inside Taobao’s High‑Performance Distributed File System (TFS): Architecture & Scaling

Taobao’s File System (TFS) is a highly available, high‑performance distributed storage solution built on Linux servers, featuring name‑server and data‑server clusters, block‑level replication, HA mechanisms, client caching, seamless scaling, multi‑data‑center disaster recovery, and open‑source support for C++, Java, and Nginx integration.

Distributed File SystemTaobaohigh availability
0 likes · 15 min read
Inside Taobao’s High‑Performance Distributed File System (TFS): Architecture & Scaling
Qunar Tech Salon
Qunar Tech Salon
Nov 5, 2015 · Databases

Design and Architecture of TDSQL: A Distributed MySQL‑Based SQL System

The article describes how the limitations of an in‑memory NoSQL HOLD platform led to the creation of TDSQL, a distributed MySQL‑based SQL system featuring ZooKeeper‑coordinated Scheduler, Agent, and Gateway modules, automatic cross‑IDC disaster recovery, transparent horizontal scaling, strong synchronous replication, and future integration with container technologies.

ScalabilityTDSQLdistributed database
0 likes · 19 min read
Design and Architecture of TDSQL: A Distributed MySQL‑Based SQL System
Qunar Tech Salon
Qunar Tech Salon
Nov 4, 2015 · Backend Development

Evolution of 58.com Architecture: From Single‑Server All‑In‑One to Scalable Service‑Oriented System

The article chronicles how 58.com’s web architecture evolved from a tiny, single‑machine setup to a multi‑layer, Java‑based, highly available service‑oriented platform, detailing the technical decisions, scaling challenges, and automation practices adopted at each traffic milestone.

Scalabilityarchitecturehigh availability
0 likes · 14 min read
Evolution of 58.com Architecture: From Single‑Server All‑In‑One to Scalable Service‑Oriented System
Efficient Ops
Efficient Ops
Sep 23, 2015 · Operations

How Tencent Powers Millions with SET‑Based NoSQL Clusters

Tencent’s operations team explains how its SET‑based NoSQL clusters deliver ultra‑low latency, high availability, and seamless disaster recovery for billions of users, detailing deployment models, synchronization mechanisms, cost‑saving techniques, and the Data‑as‑Service approach that underpins its massive social platforms.

Cost OptimizationData as a ServiceDistributed Systems
0 likes · 12 min read
How Tencent Powers Millions with SET‑Based NoSQL Clusters
21CTO
21CTO
Sep 16, 2015 · Databases

How TDSQL Achieves Scalable, High‑Availability Distributed SQL on MySQL

This article explains how TDSQL transforms MySQL into a distributed, high‑availability SQL system by addressing NoSQL limitations, introducing a Scheduler‑Agent‑Gateway architecture, automatic scaling, sharding, robust disaster‑recovery mechanisms, and future integration with container technologies.

Auto Scalingdistributed databasehigh availability
0 likes · 19 min read
How TDSQL Achieves Scalable, High‑Availability Distributed SQL on MySQL
21CTO
21CTO
Aug 31, 2015 · Backend Development

Scaling JD.com’s Product Detail Pages with Dynamic, High‑Performance Architecture

This article details the evolution and redesign of JD.com’s product detail page architecture, describing the transition from static HTML generation to a dynamic, high‑performance, multi‑datacenter system built on key‑value storage, Nginx + Lua, asynchronous processing, multi‑level caching, and robust scaling and reliability strategies.

Backend ArchitectureLuacaching
0 likes · 34 min read
Scaling JD.com’s Product Detail Pages with Dynamic, High‑Performance Architecture
High Availability Architecture
High Availability Architecture
Aug 31, 2015 · Backend Development

High‑Availability Architecture for JD.com Product Detail Pages

This article describes how JD.com redesigned its product detail page system from a static, cache‑heavy architecture to a fully dynamic, multi‑level cached service using Nginx+Lua, JIMDB, and asynchronous workers, addressing scalability, performance, and high‑availability challenges for billions of daily page views.

Backend ArchitectureJimdbProduct Detail Page
0 likes · 30 min read
High‑Availability Architecture for JD.com Product Detail Pages
Architect
Architect
Aug 26, 2015 · Backend Development

Design Considerations for Master/Slave Distributed Cache with Proxy and CAS

The article analyzes the use of a master/slave architecture for distributed caching, explains why two clusters, CAS, and proxy are employed, discusses consistency and availability challenges, and evaluates possible mitigation strategies for cache failures.

CASConsistencyMaster‑Slave
0 likes · 7 min read
Design Considerations for Master/Slave Distributed Cache with Proxy and CAS
Java High-Performance Architecture
Java High-Performance Architecture
Aug 24, 2015 · Operations

How DRBD Enables Real-Time Block-Level Replication for High Availability

DRBD (Distributed Replicated Block Device) is a software‑based, network‑driven block replication solution that mirrors disks, partitions, or logical volumes across servers in real time, offering synchronous and asynchronous modes, transparent failover, and a middle‑layer between the filesystem and physical storage.

Asynchronous ReplicationDRBDblock replication
0 likes · 3 min read
How DRBD Enables Real-Time Block-Level Replication for High Availability
21CTO
21CTO
Aug 15, 2015 · Backend Development

Inside Weibo’s Third‑Generation Backend Architecture: Scalability and High‑Availability

An in‑depth look at Weibo’s evolution to its third‑generation backend system, detailing the orthogonal decomposition model, three‑tier horizontal layering, key middleware such as MCQ, Motan RPC, SSDCache, and the WatchMan tracing platform that together enable high‑availability, massive concurrency, and low‑latency services for billions of users.

Backend ArchitectureDistributed Tracinghigh availability
0 likes · 12 min read
Inside Weibo’s Third‑Generation Backend Architecture: Scalability and High‑Availability
High Availability Architecture
High Availability Architecture
Aug 7, 2015 · Backend Development

Highlights from High‑Availability Architecture Discussions: Plugins, Kanban Tools, Scheduling Services, Massive Tables, and ID Generation

This article compiles recent high‑availability architecture discussions covering plugin‑based system design, simple kanban tools, independent scheduling services, challenges of managing 60‑billion‑row tables, and the trade‑offs between UUIDs and custom distributed ID generators, offering practical insights for backend engineers.

BackendPluginsScheduling
0 likes · 7 min read
Highlights from High‑Availability Architecture Discussions: Plugins, Kanban Tools, Scheduling Services, Massive Tables, and ID Generation