Tagged articles
2122 articles
Page 20 of 22
21CTO
21CTO
Apr 16, 2017 · Operations

Which Load‑Balancing Strategy Guarantees the Highest Reliability?

This article explains common load‑balancing strategies—round‑robin, random, minimum response time, minimum concurrency, and hash—detailing their principles, advantages, drawbacks, and mathematical reliability analysis, including probability formulas and visual illustrations to help choose the most fault‑tolerant approach for distributed systems.

Distributed SystemsRound Robinload balancing
0 likes · 9 min read
Which Load‑Balancing Strategy Guarantees the Highest Reliability?
Architecture Digest
Architecture Digest
Apr 16, 2017 · Operations

Common Load‑Balancing Strategies and Their Reliability Analysis in Distributed Systems

The article reviews hardware and software load‑balancing, explains classic strategies such as round‑robin, random, minimum‑response‑time, least‑connections and hash, and quantitatively evaluates their fault‑tolerance using probability formulas and example scenarios in distributed systems.

Distributed SystemsLeast ConnectionsRound Robin
0 likes · 10 min read
Common Load‑Balancing Strategies and Their Reliability Analysis in Distributed Systems
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Apr 11, 2017 · Fundamentals

Mastering Distributed Consistency: Paxos, NWR Model, and CAP Trade‑offs

This article explains how distributed systems balance consistency, availability, and performance by covering consensus algorithms like Paxos, replication models such as Master‑Slave and Master‑Master, and Amazon Dynamo’s N‑R‑W and Vector Clock approaches, while illustrating the CAP theorem trade‑offs.

ConsensusDistributed SystemsDynamo
0 likes · 12 min read
Mastering Distributed Consistency: Paxos, NWR Model, and CAP Trade‑offs
21CTO
21CTO
Apr 11, 2017 · Backend Development

Simulating 10 Billion Red‑Packet Requests on One Server: Achieving 60k QPS with Go

This article details how to design, implement, and benchmark a single‑machine backend capable of handling up to 1 million concurrent connections and 60 000 queries per second while simulating the shake‑and‑send red‑packet workflow of a large‑scale messaging app, including capacity calculations, architecture choices, Go‑based implementation, and multi‑stage performance testing.

BackendDistributed SystemsGolang
0 likes · 18 min read
Simulating 10 Billion Red‑Packet Requests on One Server: Achieving 60k QPS with Go
Architecture Digest
Architecture Digest
Apr 11, 2017 · Backend Development

Design and Practice of a High‑Throughput Spring Festival Red‑Packet System Supporting One Million Connections

This article describes how to design, implement, and evaluate a backend system that simulates the Spring Festival red‑packet service, achieving up to 6 × 10⁴ QPS on a single server while handling one million concurrent connections, and discusses the hardware, software, architecture, and performance results.

BackendDistributed SystemsGolang
0 likes · 18 min read
Design and Practice of a High‑Throughput Spring Festival Red‑Packet System Supporting One Million Connections
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Apr 11, 2017 · Databases

Why High Availability Triggers a Consistency‑Performance Trade‑off in Distributed Databases

The article explains how achieving high availability through data redundancy introduces consistency challenges that in turn affect performance, and it reviews partitioning, mirroring, consistency models, replication architectures, and two/three‑phase commit protocols in distributed systems.

Data ConsistencyDistributed SystemsReplication
0 likes · 18 min read
Why High Availability Triggers a Consistency‑Performance Trade‑off in Distributed Databases
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Apr 9, 2017 · Backend Development

Mastering Distributed Locks and Idempotency for High‑Concurrency Systems

This article explores the challenges of mutual exclusion and idempotency in distributed environments, explains the underlying principles of locks in multi‑threaded and multi‑process contexts, and presents practical implementations using Zookeeper, Redis, Tair, and the Cerberus and GTIS frameworks to ensure reliable, scalable operations.

Distributed SystemsIdempotencyZooKeeper
0 likes · 35 min read
Mastering Distributed Locks and Idempotency for High‑Concurrency Systems
Architecture Digest
Architecture Digest
Apr 6, 2017 · Fundamentals

Distributed Service System Consistency: Best Practices and Patterns

This article examines the challenges of achieving consistency in large‑scale distributed service systems, outlines common inconsistency scenarios such as split‑brain and lost updates, and presents practical patterns—including ACID/BASE trade‑offs, two‑phase and three‑phase commit, TCC, query, compensation, and reliable messaging—to guide engineers in designing robust, eventually consistent architectures.

ACIDBASEConsistency
0 likes · 35 min read
Distributed Service System Consistency: Best Practices and Patterns
Architecture Digest
Architecture Digest
Apr 1, 2017 · Backend Development

Distributed Consistency and Transactional Messaging Solutions

This article explains the challenges of achieving consistency in distributed systems and presents practical solutions such as two‑phase commit, asynchronous assurance, compensating transactions, message retry mechanisms, idempotent designs, and a custom Redis‑based delayed queue (DelayQ) with a transactional proxy (TMQProxy) to provide reliable transactional messaging.

Distributed SystemsIdempotencyMessage Retry
0 likes · 19 min read
Distributed Consistency and Transactional Messaging Solutions
21CTO
21CTO
Mar 31, 2017 · Backend Development

How to Build Highly Available and Scalable Distributed Systems

This article explains the key challenges of high availability and scalability in distributed architectures and provides practical solutions for each layer—entry, business, cache, and database—using techniques such as heartbeat IPs, stateless services, consistent hashing, and sharding.

Backend ArchitectureDistributed SystemsScalability
0 likes · 18 min read
How to Build Highly Available and Scalable Distributed Systems
Efficient Ops
Efficient Ops
Mar 26, 2017 · Operations

How Google Scales App Engine: Lessons in Cloud Scalability and SRE

The article shares Google SRE veteran Minghua Ye’s insights on App Engine’s evolution, emphasizing the critical role of automatic scalability, distributed locks, service discovery, load balancing, and open‑source tools like gRPC, Protobuf, gflags, glog, and Googletest in building reliable, high‑traffic cloud services.

Distributed SystemsGoogle App EngineProtobuf
0 likes · 12 min read
How Google Scales App Engine: Lessons in Cloud Scalability and SRE
ITPUB
ITPUB
Mar 22, 2017 · Backend Development

What Makes Taobao’s Massive Scale Demand Hundreds of Elite Engineers?

The article explains how a high‑traffic e‑commerce platform like Taobao relies on distributed storage, search engines, massive caching, load‑balancing, CDN, sophisticated advertising and analytics systems, all of which require large teams of top engineers to design, implement, and operate.

BackendDistributed SystemsScalability
0 likes · 12 min read
What Makes Taobao’s Massive Scale Demand Hundreds of Elite Engineers?
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 15, 2017 · Operations

Alibaba IDC and Network Monitoring System Architecture and Practices

The article details Alibaba's globally distributed IDC and network monitoring systems, describing their fully distributed data collection, centralized computation, storage strategies, alarm mechanisms, and frontend visualization that together enable real‑time infrastructure and network health management for large‑scale operations.

Distributed SystemsIDCInfrastructure
0 likes · 13 min read
Alibaba IDC and Network Monitoring System Architecture and Practices
Architecture Digest
Architecture Digest
Mar 9, 2017 · Backend Development

Common Design Issues and Best Practices for Distributed System Interfaces

The article outlines key challenges in distributed API design—including date formatting, decimal precision, response structures, idempotency, security, and naming consistency—and provides practical recommendations to improve usability, scalability, and maintainability across backend services.

BackendDistributed SystemsIdempotency
0 likes · 12 min read
Common Design Issues and Best Practices for Distributed System Interfaces
21CTO
21CTO
Feb 21, 2017 · Backend Development

How WeChat and Alibaba Handle Billions of Red Packets: High‑Concurrency Architecture Secrets

This article examines the high‑availability architectures behind massive online transaction systems such as Alibaba's Double 11 sales, Alipay and WeChat red packets, detailing the challenges of billions of requests and the engineering solutions that ensure performance, reliability, and security.

Distributed SystemsWeChathigh concurrency
0 likes · 20 min read
How WeChat and Alibaba Handle Billions of Red Packets: High‑Concurrency Architecture Secrets
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Feb 16, 2017 · Backend Development

How VIPshop Evolved from Monolithic LAMP to Distributed Service Architecture

This article examines VIPshop's transformation from a single‑application LAMP system to a vertically split and finally distributed service‑oriented architecture, detailing the business model, key design requirements, platform governance, and the technical services that enable a scalable e‑commerce operation.

Cloud ComputingDistributed SystemsMicroservices
0 likes · 13 min read
How VIPshop Evolved from Monolithic LAMP to Distributed Service Architecture
ITPUB
ITPUB
Feb 10, 2017 · Backend Development

How to Generate Globally Unique IDs in Distributed Systems: Snowflake and Its Variants

This article explains the challenges of generating globally unique IDs across distributed shards, outlines the requirements for such IDs, and details Twitter's Snowflake algorithm—including its structure, generation process, and clock handling—before exploring three notable Snowflake variants and their trade‑offs.

BackendDistributed SystemsID generation
0 likes · 10 min read
How to Generate Globally Unique IDs in Distributed Systems: Snowflake and Its Variants
Architects' Tech Alliance
Architects' Tech Alliance
Feb 10, 2017 · Industry Insights

Inside Scality Ring: How Its Scale‑Out Architecture Powers Object, File, and Block Storage

The article provides a detailed technical overview of Scality Ring 6.0, explaining its three‑layer software stack, X86‑based scale‑out hardware, diverse connectors, storage node design, management tools, routing protocol, data durability, multi‑site deployment models, and consistency guarantees.

Data DurabilityDistributed SystemsMulti-site Deployment
0 likes · 13 min read
Inside Scality Ring: How Its Scale‑Out Architecture Powers Object, File, and Block Storage
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Feb 7, 2017 · Operations

Master System Architecture: CAP Theory, Shared‑Nothing, Load Balancing & HA

This article explores core system architecture concepts—including the CAP theorem and its BASE extension, the shared‑nothing design, various load‑balancing algorithms and deployment modes, and high‑availability patterns such as active‑standby, active‑active and clustering—providing practical guidance for building scalable, reliable distributed applications.

CAP theoremDistributed Systemshigh availability
0 likes · 22 min read
Master System Architecture: CAP Theory, Shared‑Nothing, Load Balancing & HA
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 4, 2017 · Cloud Computing

Inside Alibaba’s Feitian Middleware: Powering Massive E‑Commerce and Cloud Innovation

Alibaba’s transformation from e‑commerce leader to tech powerhouse is highlighted by its Feitian middleware platform, a cloud‑based, highly available solution that supports diverse industries, enables massive transaction volumes, and exemplifies the evolution of large‑scale distributed architectures pioneered since Alibaba’s early IOE days.

Alibaba CloudDistributed SystemsMicroservices
0 likes · 7 min read
Inside Alibaba’s Feitian Middleware: Powering Massive E‑Commerce and Cloud Innovation
dbaplus Community
dbaplus Community
Jan 15, 2017 · Databases

How JD’s JIMDB Achieves Zero‑Downtime Scaling and Automatic Failover for Massive Caches

JIMDB is JD’s in‑house distributed cache platform that combines automatic fault detection, seamless online scaling, multi‑language support, and containerized deployment to replace traditional Memcached/Redis solutions, offering features such as one‑click cluster creation, elastic expansion, lossless scaling, and comprehensive monitoring for high‑traffic e‑commerce services.

CacheDistributed Systemselastic scaling
0 likes · 23 min read
How JD’s JIMDB Achieves Zero‑Downtime Scaling and Automatic Failover for Massive Caches
Architecture Digest
Architecture Digest
Jan 12, 2017 · Backend Development

Evolution of Internet Technical Architecture: From Single‑Server to Distributed Microservices

This article traces the evolution of internet‑scale technical architecture across three eras—single‑machine, cluster, and distributed—detailing the motivations, core patterns, advantages, and drawbacks of monolithic, layered, data‑separated, cached, load‑balanced, CDN‑accelerated, redundant, service‑oriented, sharded, and microservice designs.

BackendDistributed SystemsMicroservices
0 likes · 12 min read
Evolution of Internet Technical Architecture: From Single‑Server to Distributed Microservices
21CTO
21CTO
Jan 8, 2017 · Backend Development

Unlocking High‑Availability: A Sneak Peek at the New Internet Architecture Series

The author announces a forthcoming series on Internet high‑availability architecture, outlining topics such as CAP theory, distributed caching, SOA, message queues, search systems, and real‑world case studies, and invites readers to suggest additional content while promising detailed, valuable guidance for developers and architects.

Distributed SystemsSoftware ArchitectureSystem Design
0 likes · 3 min read
Unlocking High‑Availability: A Sneak Peek at the New Internet Architecture Series
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jan 6, 2017 · Operations

How Qcmd Revolutionizes Automated Operations for 7,000+ Servers

Qcmd, the command execution system behind 360’s private HULK cloud platform, replaces SaltStack with an asynchronous, Golang‑based architecture that ensures high‑availability, encrypted messaging, and reliable mass‑host command execution across thousands of servers, dramatically reducing task timeouts and operational overhead.

Command ExecutionDistributed SystemsGolang
0 likes · 10 min read
How Qcmd Revolutionizes Automated Operations for 7,000+ Servers
21CTO
21CTO
Jan 4, 2017 · Operations

How to Build Truly High‑Availability Systems: Principles and Practices

This article explains what high availability means for distributed systems, outlines common availability tiers, and describes how redundancy, load balancing, and automatic failover across a typical Internet architecture can achieve reliable, scalable services.

Distributed SystemsOperationsReliability
0 likes · 6 min read
How to Build Truly High‑Availability Systems: Principles and Practices
Architecture Digest
Architecture Digest
Dec 30, 2016 · Operations

Zero‑Point Battle: Evolution of Alibaba's Double 11 High‑Availability Architecture

The talk details how Alibaba tackled the massive technical challenges of Double 11 over eight years by evolving a highly available, scalable architecture through capacity planning, distributed middleware, hybrid‑cloud deployment, online stress testing, and fine‑grained traffic control to balance cost, performance, and user experience.

AlibabaDistributed SystemsDouble 11
0 likes · 22 min read
Zero‑Point Battle: Evolution of Alibaba's Double 11 High‑Availability Architecture
Qunar Tech Salon
Qunar Tech Salon
Dec 30, 2016 · Operations

Mesos Architecture and Its Practical Use at Qunar: Framework Unification and Operational Insights

This article explains the Mesos distributed system kernel, its resource‑allocation workflow, and how Qunar engineers applied and evolved Mesos, Marathon, and custom frameworks to achieve fine‑grained scheduling, high availability, service discovery, and multi‑tenant management in a large‑scale production environment.

Cluster ManagementDistributed SystemsFramework
0 likes · 14 min read
Mesos Architecture and Its Practical Use at Qunar: Framework Unification and Operational Insights
Architects' Tech Alliance
Architects' Tech Alliance
Dec 23, 2016 · Fundamentals

Advanced Distributed Systems Theory: Paxos, Raft, and Zab

This article provides an in‑depth exploration of distributed consensus protocols, detailing the basics of Paxos, extending to Multi‑Paxos, and comparing it with Raft and Zab while discussing leader election, quorum, lease mechanisms, and practical considerations for implementing these algorithms in real‑world systems.

ConsensusDistributed SystemsPaxos
0 likes · 21 min read
Advanced Distributed Systems Theory: Paxos, Raft, and Zab
Architects' Tech Alliance
Architects' Tech Alliance
Dec 22, 2016 · Fundamentals

Fundamentals of Distributed Systems: Consensus, 2PC/3PC, CAP Theorem, and Logical Clocks

This article introduces core distributed‑system concepts—including the definition of consensus, the two‑phase and three‑phase commit protocols, the CAP theorem and its engineering implications, and logical‑clock mechanisms such as Lamport timestamps, vector clocks, and version vectors—explaining their models, challenges, and practical trade‑offs.

CAP theoremConsensusDistributed Systems
0 likes · 21 min read
Fundamentals of Distributed Systems: Consensus, 2PC/3PC, CAP Theorem, and Logical Clocks
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Dec 20, 2016 · Backend Development

Designing Scalable Web Architectures: Key Principles and Practices

This article explains the essential design principles, trade‑offs, and core components—such as availability, performance, reliability, scalability, manageability, and cost—required to build large‑scale, high‑availability web systems and illustrates them with an image‑hosting example.

Distributed Systemscachingweb architecture
0 likes · 37 min read
Designing Scalable Web Architectures: Key Principles and Practices
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Dec 13, 2016 · Backend Development

How to Build a High‑Performance Flash Sale System: Strategies & Pitfalls

This article outlines the key technical challenges of flash‑sale (秒杀) systems—high concurrency, cache usage, distributed locking, database pressure, and overselling—and presents practical front‑end and back‑end design patterns, including atomic counters, memcached decrements, queueing, Redis off‑loading, and two‑phase commit solutions.

Distributed Systemscachingflash sale
0 likes · 8 min read
How to Build a High‑Performance Flash Sale System: Strategies & Pitfalls
Architecture Digest
Architecture Digest
Dec 9, 2016 · Cloud Native

Deep Dive into Etcd Architecture, Consistency, Storage, Watch Mechanisms, and Comparison with Zookeeper and Consul

This article analyzes Etcd's distributed architecture, Raft‑based consistency, storage implementation, watch and lease mechanisms, differences between v2 and v3, and compares it with Zookeeper and Consul, providing practical usage tips and surrounding tooling for developers of distributed systems.

ConsulDistributed Systemsetcd
0 likes · 18 min read
Deep Dive into Etcd Architecture, Consistency, Storage, Watch Mechanisms, and Comparison with Zookeeper and Consul
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 7, 2016 · Big Data

How Alibaba Handled Real‑Time Billions of Events During Double 11

This article outlines Alibaba Cloud's big‑data platform challenges and solutions during the 2016 Double 11 event, covering sub‑second real‑time processing, multi‑million‑records‑per‑second throughput, full‑day high availability, and massive offline workloads exceeding hundreds of petabytes.

AlibabaDistributed SystemsMaxCompute
0 likes · 3 min read
How Alibaba Handled Real‑Time Billions of Events During Double 11
Art of Distributed System Architecture Design
Art of Distributed System Architecture Design
Dec 2, 2016 · Backend Development

Mastering Distributed Transaction Consistency: From CAP to Message‑Based Compensation

This article examines the fundamental challenges of achieving consistency in distributed systems, explains the CAP theorem, compares two‑phase and three‑phase commit protocols, explores XA transactions, and presents practical compensation patterns such as local message tables, non‑transactional and transactional MQ designs, highlighting their trade‑offs and applicability.

CAP theoremDistributed SystemsMessage Queue
0 likes · 15 min read
Mastering Distributed Transaction Consistency: From CAP to Message‑Based Compensation
Architecture Digest
Architecture Digest
Dec 2, 2016 · Fundamentals

Fundamentals of Distributed Version Control with Git

This article explains the core concepts of distributed version control, compares it with centralized systems, describes repository structures, outlines the advantages of Git, and provides step‑by‑step command examples for initializing, committing, branching, merging, cloning, pulling, and pushing changes in a collaborative development workflow.

CollaborationDistributed Systemssoftware development
0 likes · 21 min read
Fundamentals of Distributed Version Control with Git
dbaplus Community
dbaplus Community
Nov 29, 2016 · Fundamentals

Essential Distributed System Components: ZooKeeper, Queues, Docker & Logs

Distributed systems rely on coordinated services such as ZooKeeper for state management, message queues like ActiveMQ for inter‑process communication, robust transaction handling, automated deployment tools like Docker, and comprehensive logging solutions, each playing a critical role in achieving high availability, scalability, and operational visibility.

Distributed Systemslogging
0 likes · 16 min read
Essential Distributed System Components: ZooKeeper, Queues, Docker & Logs
Weidian Tech Team
Weidian Tech Team
Nov 28, 2016 · Big Data

How We Built the Mars Big Data Platform to Boost Development Efficiency

The article explains why Weidian needed a new big data development platform, outlines the functional features of the Mars system, describes its architecture, scheduling mechanisms, task execution flow, and discusses remaining challenges and future enhancements.

Distributed SystemsHadoopplatform architecture
0 likes · 11 min read
How We Built the Mars Big Data Platform to Boost Development Efficiency
Architects' Tech Alliance
Architects' Tech Alliance
Nov 25, 2016 · Databases

Why NoSQL Matters: From ACID to CAP and Beyond

An in‑depth overview of NoSQL databases explains the limitations of traditional relational systems, details ACID properties, introduces the CAP theorem and BASE model, compares RDBMS with NoSQL, outlines advantages, disadvantages, history, classifications, and real‑world usage examples.

ACIDBASECAP theorem
0 likes · 12 min read
Why NoSQL Matters: From ACID to CAP and Beyond
Architecture Digest
Architecture Digest
Nov 23, 2016 · Backend Development

Evolution of .NET Web Architecture: From Single Server to Distributed Cloud Services

The article outlines the step‑by‑step evolution of a .NET‑based web system, describing how a single‑server setup grows into a multi‑tier, load‑balanced, clustered, stateless, micro‑service architecture that leverages caching, NoSQL, search engines, cloud services, Docker and CDN to handle large‑scale traffic and data processing.

BackendDistributed Systemscaching
0 likes · 10 min read
Evolution of .NET Web Architecture: From Single Server to Distributed Cloud Services
Ctrip Technology
Ctrip Technology
Nov 22, 2016 · Backend Development

Evolution and Service Decomposition of Qunar's Payment System (1.0 → 2.0)

The article outlines the five‑year evolution of Qunar's payment platform from a tightly coupled monolith (1.0) to a highly available, service‑oriented distributed architecture (2.0), detailing component breakdown, challenges, and the resulting core transaction, payment, cashier, and API layers.

Backend ArchitectureDistributed Systemsfinancial technology
0 likes · 9 min read
Evolution and Service Decomposition of Qunar's Payment System (1.0 → 2.0)
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Nov 20, 2016 · Backend Development

How Meizu Scales Real‑Time Push to 600 M Messages/min: Architecture, Pitfalls & Solutions

The article details Meizu’s real‑time push system that supports 25 million online users and 6 million messages per minute, describing its four‑layer architecture, power‑saving strategies, network‑instability fixes, massive‑connection handling, monitoring practices, and gray‑release deployment techniques.

Distributed Systemshigh concurrencymonitoring
0 likes · 12 min read
How Meizu Scales Real‑Time Push to 600 M Messages/min: Architecture, Pitfalls & Solutions
Architecture Digest
Architecture Digest
Nov 19, 2016 · Backend Development

Why Distributed Systems Are Essential for Scaling Internet Services and How to Build Them

The article explains that massive user traffic forces servers to adopt distributed systems to achieve high throughput, concurrency, low latency, and load balancing, and it details architectural patterns, concurrency models, caching, storage, coordination, messaging, transaction, deployment, and monitoring techniques for building and managing such systems.

BackendDistributed SystemsMessage Queue
0 likes · 39 min read
Why Distributed Systems Are Essential for Scaling Internet Services and How to Build Them
Meituan Technology Team
Meituan Technology Team
Nov 11, 2016 · Operations

Common Service Fault Tolerance Patterns

The article explains how Meituan‑Dianping applies classic fault‑tolerance patterns—timeout and retry, rate limiting/load shedding, circuit breaker, bulkhead isolation, and fallback—to design for failure, prevent cascading service outages, and enhance system stability and high‑availability in a service‑oriented architecture.

Distributed SystemsFallbackRetry
0 likes · 14 min read
Common Service Fault Tolerance Patterns
Architecture Digest
Architecture Digest
Nov 10, 2016 · Operations

Interview with Lu Pengcheng on Mogu Street’s Monitoring System Architecture and Evolution

In this interview, Lu Pengcheng, a platform architect at Mogu Street, discusses the company’s large‑scale e‑commerce architecture, the evolution of its monitoring platform, design choices for high‑availability distributed systems, and future open‑source plans, providing practical insights for engineers and technical managers.

C++Distributed SystemsOperations
0 likes · 9 min read
Interview with Lu Pengcheng on Mogu Street’s Monitoring System Architecture and Evolution
dbaplus Community
dbaplus Community
Nov 10, 2016 · Backend Development

Why Distributed Systems Are Essential for Scaling Modern Web Services

The article explains why distributed systems are fundamental for handling massive web traffic, detailing concepts such as high throughput, concurrency, low latency, load balancing, layered architectures, concurrency models, caching, NoSQL storage, fault tolerance, scaling, deployment, and monitoring, while highlighting practical techniques and challenges.

Distributed SystemsNoSQLScalability
0 likes · 23 min read
Why Distributed Systems Are Essential for Scaling Modern Web Services
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Nov 5, 2016 · Operations

Distributed vs Cluster: What’s the Real Difference and When to Use Each?

This article explains the core differences between distributed systems and clusters, detailing their architectures, efficiency goals, typical use cases such as Hadoop MapReduce and load‑balancing clusters, and outlines key concepts like scalability, high availability, load balancing, and error recovery.

Cluster ComputingDistributed SystemsHPC
0 likes · 10 min read
Distributed vs Cluster: What’s the Real Difference and When to Use Each?
Architects' Tech Alliance
Architects' Tech Alliance
Nov 3, 2016 · Industry Insights

Scaling Billion‑Level Ads: Architecture Lessons from Sogou’s Senior Engineer

In this interview, Sogou architect Liu Jian shares how his team built a highly available, scalable commercial advertising platform, discusses the evolution of its infrastructure, offers practical advice for engineers aspiring to become architects, and reflects on emerging technologies and time‑management strategies.

ArchitectureBig DataDistributed Systems
0 likes · 10 min read
Scaling Billion‑Level Ads: Architecture Lessons from Sogou’s Senior Engineer
Meituan Technology Team
Meituan Technology Team
Oct 28, 2016 · Big Data

Design and Architecture of the CAT Real-Time Monitoring System

The CAT real‑time monitoring system, open‑sourced in 2014 for Java applications, combines a lightweight ThreadLocal‑based client SDK, Netty‑driven asynchronous transport, and a highly scalable backend that processes ~100 TB of logs daily across 70 machines, using custom binary serialization, in‑memory modeling, segmented storage with 48‑bit indexing, and hourly aggregation to provide near‑full‑volume fault detection, localization, and performance analysis.

ArchitectureDistributed SystemsReal-Time
0 likes · 18 min read
Design and Architecture of the CAT Real-Time Monitoring System
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Oct 27, 2016 · Big Data

Inside Taobao’s Massive Data Architecture: How 1.5 PB Daily Is Processed and Served

The article explains Taobao’s five‑layer data product architecture—covering data sources, compute, storage, query, and product layers—and describes how massive volumes of data are ingested, processed in batch and streaming, stored in MySQL and HBase clusters, and served efficiently through a unified middle‑layer and sophisticated caching mechanisms.

Big DataDistributed SystemsHBase
0 likes · 15 min read
Inside Taobao’s Massive Data Architecture: How 1.5 PB Daily Is Processed and Served
ITPUB
ITPUB
Oct 22, 2016 · Backend Development

Designing Scalable High‑Concurrency Architecture: Practical Strategies and Patterns

This guide explains how to design and test high‑concurrency systems by choosing appropriate server architectures, load‑balancing, database clustering, caching layers, message queues, first‑level caches, and static data strategies to ensure smooth operation under heavy user traffic.

Backend ArchitectureDistributed SystemsLoad Testing
0 likes · 12 min read
Designing Scalable High‑Concurrency Architecture: Practical Strategies and Patterns
High Availability Architecture
High Availability Architecture
Oct 20, 2016 · Big Data

Understanding HDFS EditLog Format and Quorum Journal Manager Recovery Process

This article explains the HDFS EditLog file structure, the design of the Quorum Journal Manager for high‑availability, the write‑path optimizations such as batch flushing and double‑buffering, and the detailed Multi‑Paxos based recovery algorithm including isolation, segment selection, prepare and accept phases, and handling journal node failures.

Distributed SystemsEditLogHDFS
0 likes · 12 min read
Understanding HDFS EditLog Format and Quorum Journal Manager Recovery Process
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Oct 16, 2016 · Big Data

Mastering Data Sync, Real-Time Analytics, and Scalable Storage for Modern Systems

This article explains how to design and implement heterogeneous data synchronization, leverage batch and stream processing frameworks like Hadoop and Storm for large‑scale analysis, and choose appropriate storage solutions—from in‑memory databases to distributed column‑family stores—while addressing performance, reliability, and monitoring in complex distributed environments.

Big DataDistributed Systemsdata synchronization
0 likes · 26 min read
Mastering Data Sync, Real-Time Analytics, and Scalable Storage for Modern Systems
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Oct 15, 2016 · Operations

How E‑Commerce Platforms Achieve High Availability and Scalability: Architecture Practices

This article outlines comprehensive e‑commerce platform architecture practices—including caching strategies, indexing, parallel and distributed computing, load balancing, sharding, high availability, monitoring, resource optimization, and messaging—to improve system performance, scalability, and reliability under high concurrency.

ArchitectureDistributed Systemscaching
0 likes · 28 min read
How E‑Commerce Platforms Achieve High Availability and Scalability: Architecture Practices
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Oct 8, 2016 · Fundamentals

Mastering Distributed Systems: Overcoming Network Challenges and Consistency Trade‑offs

This article explores the core difficulties of distributed systems—including network latency, failures, the CAP theorem, consistency models, and common techniques such as consistent hashing, quorum, vector clocks, lease mechanisms, gossip protocols, and distributed transaction protocols—providing practical insights and references for building robust scalable architectures.

CAP theoremDistributed SystemsNWR quorum
0 likes · 22 min read
Mastering Distributed Systems: Overcoming Network Challenges and Consistency Trade‑offs
WeChat Backend Team
WeChat Backend Team
Sep 21, 2016 · Fundamentals

Understanding Master Election with Paxos: Lease Algorithm Explained

This article explains the concept of a Master role in distributed systems, how Master election relies on strong consistency via Paxos, details a lease‑based election algorithm, and discusses correctness guarantees and renewal mechanisms using versioning.

ConsensusDistributed SystemsMaster Election
0 likes · 7 min read
Understanding Master Election with Paxos: Lease Algorithm Explained
Nightwalker Tech
Nightwalker Tech
Sep 12, 2016 · Databases

Technical Discussion on Learning Strategies, MySQL Replication, High Availability, PHP Coroutines, and Nginx Proxy

A community discussion covers practical ways to improve personal skills, detailed MySQL master‑slave and high‑availability setups, PHP coroutine mechanisms, and the behavior of Nginx when proxying image resources, providing insights into learning, database management, concurrency, and proxy configurations.

Distributed SystemsMySQLPHP
0 likes · 12 min read
Technical Discussion on Learning Strategies, MySQL Replication, High Availability, PHP Coroutines, and Nginx Proxy
Architecture Digest
Architecture Digest
Sep 9, 2016 · Backend Development

Designing a High‑Concurrency, Reliable Backend Architecture for JD E‑Commerce

The article examines how JD’s e‑commerce platform tackles massive traffic spikes and high concurrency by employing modular system decomposition, distributed architecture, API‑centric services, performance tuning, and virtual elasticity to ensure stability, scalability, and a smooth user experience during major promotional events.

API ServiceBackend ArchitectureDistributed Systems
0 likes · 7 min read
Designing a High‑Concurrency, Reliable Backend Architecture for JD E‑Commerce
dbaplus Community
dbaplus Community
Sep 6, 2016 · Big Data

Choosing the Right Log Collection Framework for Massive Data Streams

This article reviews major open‑source log collection tools—Chukwa, Scribe, Flume, Logstash, Kafka, and TT—examining their architectures, strengths, and limitations to help engineers select the most suitable solution for high‑volume, low‑latency data pipelines.

Apache FlumeDistributed SystemsKafka
0 likes · 13 min read
Choosing the Right Log Collection Framework for Massive Data Streams
Efficient Ops
Efficient Ops
Sep 5, 2016 · Operations

Inside Google’s Data Centers: How SRE Manages Hardware, Borg, and Global Services

This article explains how Google’s Site Reliability Engineering team designs and operates uniform hardware in its data centers, uses the Borg cluster manager, implements storage layers, SDN networking, monitoring, and a sample Shakespeare search service to achieve high‑availability, scalable production services.

BorgDistributed SystemsGoogle SRE
0 likes · 21 min read
Inside Google’s Data Centers: How SRE Manages Hardware, Borg, and Global Services
ITPUB
ITPUB
Aug 30, 2016 · Databases

How to Build a Distributed KV Store with Full ACID Guarantees

This talk walks through the evolution of databases, explains the challenges of building a distributed key‑value system that provides full ACID properties, and details the architectural components, sharding, migration, snapshot isolation, two‑phase commit, timestamp handling, and distributed transaction processing techniques.

ACIDDistributed SystemsNewSQL
0 likes · 13 min read
How to Build a Distributed KV Store with Full ACID Guarantees
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Aug 23, 2016 · Fundamentals

Why Git Beats SVN: Real‑World Scenarios Every Developer Should Know

The article explains the three main types of version control systems, contrasts centralized tools like SVN with distributed ones like Git, and illustrates through three developer scenarios how Git’s branching and conflict‑resolution features simplify collaboration, speed up workflows, and avoid common pitfalls.

Distributed SystemsGitVersion Control
0 likes · 5 min read
Why Git Beats SVN: Real‑World Scenarios Every Developer Should Know
Java High-Performance Architecture
Java High-Performance Architecture
Aug 21, 2016 · Fundamentals

How ZooKeeper Leader Election Ensures High Availability in Distributed Systems

ZooKeeper's leader election mechanism creates temporary sequential nodes under a shared /ELECTION path, allowing services to elect a primary node, monitor predecessor nodes, and automatically re-elect a new leader when the current one fails, ensuring continuous high availability across distributed applications.

Distributed SystemsZooKeeperleader election
0 likes · 6 min read
How ZooKeeper Leader Election Ensures High Availability in Distributed Systems
Efficient Ops
Efficient Ops
Aug 1, 2016 · Operations

How Tencent Shifted 70M Users During Tianjin Explosion – A Multi‑Active Ops Playbook

This article details how Tencent's operations team orchestrated a seamless, zero‑impact migration of over 70 million users across three data centers during the 2015 Tianjin explosion, highlighting the four key capabilities—distribution, scheduling, data synchronization, and automated operations—that enabled multi‑active disaster recovery at massive scale.

Distributed SystemsOperationsdata synchronization
0 likes · 22 min read
How Tencent Shifted 70M Users During Tianjin Explosion – A Multi‑Active Ops Playbook
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Aug 1, 2016 · Databases

What Is NFV and Why Databases Are Its Brain?

This article explains Network Function Virtualization (NFV), its advantages over traditional telecom hardware, the critical role of distributed high‑performance databases in enabling elastic scaling, rapid service rollout, and telecom‑grade reliability, and outlines the challenges of unifying database solutions for NFV.

Distributed SystemsNFVNetwork Function Virtualization
0 likes · 6 min read
What Is NFV and Why Databases Are Its Brain?
Architect
Architect
Jul 14, 2016 · Backend Development

Using Multiple Streams and Groups in Apache Storm Topology

This article explains how to declare and emit multiple stream IDs in Apache Storm, demonstrates code examples for MultiStream and MultiGroup patterns, discusses common pitfalls, and shows how to abstract stream declarations and bolt configurations for more flexible and dynamic topologies.

Apache StormDistributed Systemsjava
0 likes · 9 min read
Using Multiple Streams and Groups in Apache Storm Topology

Designing a Business‑Oriented High Availability Architecture for a Game Access System

The article presents a business‑centric high‑availability solution for a large‑scale game access platform, detailing measurable goals, a three‑dimensional architecture that includes client‑side retry, HTTP‑DNS, functional separation, multi‑region active‑active deployment, and automated, visual monitoring to achieve rapid problem detection, recovery, and minimal outage frequency.

Distributed Systemsbusiness continuityfault tolerance
0 likes · 23 min read
Designing a Business‑Oriented High Availability Architecture for a Game Access System
DevOps
DevOps
Jul 8, 2016 · Fundamentals

Understanding Version Control Systems: From Manual Practices to Distributed VCS

This article explains the principles of version control systems, illustrating the evolution from manual file‑locking methods to modern distributed workflows with checkout, check‑in, merging, branching, and collaborative features exemplified by the custom Hit and HitHub platforms.

CollaborationDistributed SystemsGit
0 likes · 11 min read
Understanding Version Control Systems: From Manual Practices to Distributed VCS
Baidu Intelligent Testing
Baidu Intelligent Testing
Jul 7, 2016 · Operations

Challenges and Pain Points of Distributed System Integration Testing

This article examines the common pain points of integration testing in distributed systems, including weak cross‑coupling coverage, lack of clear standards, chaotic composite scenarios, and insufficient risk assessment, and discusses current mitigation approaches and open questions for future exploration.

Distributed SystemsSoftware Testingintegration testing
0 likes · 7 min read
Challenges and Pain Points of Distributed System Integration Testing
Architecture Digest
Architecture Digest
Jul 6, 2016 · Backend Development

Designing a Message Queue: Key Considerations and Architecture

The article explains why and when to use message queues, then walks through designing one from scratch, covering decoupling, eventual consistency, broadcast, flow control, RPC protocols, high availability, storage choices, consumer relationships, reliable delivery, transactions, performance optimizations, and push versus pull models.

AsynchronousBackend ArchitectureDistributed Systems
0 likes · 35 min read
Designing a Message Queue: Key Considerations and Architecture
Art of Distributed System Architecture Design
Art of Distributed System Architecture Design
Jul 4, 2016 · Backend Development

Understanding Microservice Architecture: Principles, Characteristics, and Deployment

The article explains microservice architecture as a lightweight, service‑oriented approach where each service performs a single, well‑defined function, runs in its own process with an independent database, can be deployed separately on various platforms, and communicates via HTTP or other lightweight protocols, highlighting its key characteristics and benefits.

Distributed SystemsMicroservicesSoftware Architecture
0 likes · 13 min read
Understanding Microservice Architecture: Principles, Characteristics, and Deployment
Art of Distributed System Architecture Design
Art of Distributed System Architecture Design
Jun 30, 2016 · Backend Development

Evolution of 58.com Architecture: From a Single‑Server All‑In‑One to a Scalable Service‑Oriented System

The article chronicles how 58.com’s website architecture evolved from a tiny, single‑machine setup to a distributed, service‑oriented platform by progressively addressing scaling challenges with LAMP, read/write separation, vertical business splitting, caching, cloud‑native services, and automation, illustrating the practical steps needed as traffic grows from tens of thousands to billions of requests.

ArchitectureBackendDistributed Systems
0 likes · 15 min read
Evolution of 58.com Architecture: From a Single‑Server All‑In‑One to a Scalable Service‑Oriented System
dbaplus Community
dbaplus Community
Jun 27, 2016 · Backend Development

Why Monoliths Struggle and How Service‑Oriented Architecture Can Help

The article examines the high development and operational costs of traditional monolithic applications, outlines service‑oriented solutions such as splitting, decoupling, and layering, details practical implementation techniques like subscription‑publish mechanisms, fault isolation, and governance, and discusses the limits and future evolution toward microservices.

DevOpsDistributed Systemsservice governance
0 likes · 23 min read
Why Monoliths Struggle and How Service‑Oriented Architecture Can Help
Qunar Tech Salon
Qunar Tech Salon
Jun 24, 2016 · Backend Development

Overview of Alibaba's Open Source Projects

This article provides a comprehensive overview of Alibaba's numerous open‑source projects, ranging from high‑performance service frameworks and databases to messaging middleware, frontend tools, testing platforms, and infrastructure utilities, highlighting their key features and typical use cases.

AlibabaBackendBig Data
0 likes · 22 min read
Overview of Alibaba's Open Source Projects
dbaplus Community
dbaplus Community
Jun 23, 2016 · Backend Development

Mastering Rate Limiting: Algorithms, Application, Distributed and Edge Strategies

This article provides a comprehensive guide to rate limiting in high‑concurrency systems, covering core concepts, token‑bucket and leaky‑bucket algorithms, application‑level techniques with Guava, distributed implementations using Redis+Lua and Nginx+Lua, and edge‑layer controls via Nginx modules, complete with configuration examples and test results.

Distributed SystemsGuavaToken Bucket
0 likes · 28 min read
Mastering Rate Limiting: Algorithms, Application, Distributed and Edge Strategies
WeChat Client Technology Team
WeChat Client Technology Team
Jun 22, 2016 · Fundamentals

Understanding Production‑Grade Paxos: How PhxPaxos Works and Its Engineering Secrets

This article explains the core principles and engineering details of the production‑grade Paxos library PhxPaxos, covering consistency concepts, the roles of proposer, acceptor and learner, instance management, state‑machine integration, performance optimizations, checkpointing, and correctness verification in distributed asynchronous environments.

ConsensusDistributed SystemsPaxos
0 likes · 29 min read
Understanding Production‑Grade Paxos: How PhxPaxos Works and Its Engineering Secrets
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 19, 2016 · Backend Development

Master Rate Limiting: Token & Leaky Buckets, Distributed Strategies

This article explains how caching, degradation, and especially rate limiting—using token bucket, leaky bucket, and counter‑based methods—protect high‑concurrency systems, covering algorithm basics, application‑level techniques, and distributed implementations with Redis+Lua and Nginx+Lua.

Distributed SystemsToken Bucketleaky bucket
0 likes · 18 min read
Master Rate Limiting: Token & Leaky Buckets, Distributed Strategies
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 19, 2016 · Backend Development

How to Tackle Common Cache Problems in Distributed Systems

This article explores typical cache challenges in distributed systems—including data consistency, high availability, cache avalanche, and cache penetration—explaining their causes, real‑world scenarios, and practical mitigation strategies to ensure reliable and efficient caching.

CacheData ConsistencyDistributed Systems
0 likes · 9 min read
How to Tackle Common Cache Problems in Distributed Systems
Tencent Music Tech Team
Tencent Music Tech Team
Jun 17, 2016 · Backend Development

Design Considerations for a High‑Scale Messaging System: Capacity Estimation, Consistency Guarantees, and Avalanche Prevention

Designing Quanmin K‑Song’s high‑scale messaging system requires careful capacity estimation of throughput, storage and network traffic, robust consistency via unique transaction IDs and operation logs, and avalanche prevention through selective retries, scaling and priority‑based throttling to maintain reliability under load.

ConsistencyDistributed Systemsavalanche prevention
0 likes · 7 min read
Design Considerations for a High‑Scale Messaging System: Capacity Estimation, Consistency Guarantees, and Avalanche Prevention
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 16, 2016 · Backend Development

Optimizing Large‑Scale E‑Commerce Site Architecture: Business Splitting, Clustering, Caching & SSO

This article examines the analysis and optimization of a high‑traffic e‑commerce website, covering problems such as server waste and tight coupling, and presenting solutions including business splitting, application clustering, multi‑level caching, distributed sessions, and deployment diagrams to improve scalability and reliability.

ArchitectureBackendDistributed Systems
0 likes · 7 min read
Optimizing Large‑Scale E‑Commerce Site Architecture: Business Splitting, Clustering, Caching & SSO
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 12, 2016 · Backend Development

Designing a Scalable E‑Commerce Architecture: From Simple Setup to Distributed Systems

This article walks through the functional and non‑functional requirements of a B2C e‑commerce platform, illustrates a progression from a three‑server starter architecture to a clustered high‑availability design, and details capacity‑planning calculations for supporting millions of users and peak traffic spikes.

ArchitectureBackendDistributed Systems
0 likes · 9 min read
Designing a Scalable E‑Commerce Architecture: From Simple Setup to Distributed Systems
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 9, 2016 · Operations

Mastering Load Balancing: Principles, Types, and Practical Tips

This article explains how load balancing solves high‑traffic, high‑concurrency challenges by distributing requests across multiple servers, covering its core principles, vertical and horizontal scaling, various classification methods such as DNS, IP, link‑layer, and hybrid approaches, along with their advantages, drawbacks, and deployment recommendations.

Distributed Systemshigh availabilityhorizontal scaling
0 likes · 10 min read
Mastering Load Balancing: Principles, Types, and Practical Tips
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 9, 2016 · Fundamentals

Distributed vs Cluster: Key Differences and When to Use Each

This article explains the core distinctions between distributed systems and clusters, covering their architectures, efficiency goals, typical use cases, and examples such as Hadoop MapReduce and load‑balancing clusters, while also detailing cluster types, high‑availability, load balancing, and high‑performance computing.

Cluster ComputingDistributed SystemsHigh‑performance computing
0 likes · 10 min read
Distributed vs Cluster: Key Differences and When to Use Each
21CTO
21CTO
Jun 7, 2016 · Operations

Mastering Load Balancing: Lessons from Alibaba’s VIPServer Journey

This article explores the fundamentals and advanced techniques of load balancing, compares DNS round‑robin with dedicated load balancers, discusses scaling strategies, health‑check mechanisms, and introduces Alibaba’s VIPServer as a modern mid‑tier solution addressing real‑world operational challenges.

Distributed SystemsVIPServercloud operations
0 likes · 21 min read
Mastering Load Balancing: Lessons from Alibaba’s VIPServer Journey
Architecture Digest
Architecture Digest
Jun 5, 2016 · Backend Development

Web System Load Balancing, Caching, and Scaling Strategies for High Traffic

This article explains how to design and optimize web system architecture—including load‑balancing methods, multi‑level caching, MySQL scaling, distributed deployment, and disaster‑recovery techniques—to handle traffic growth from hundreds of thousands to hundreds of millions of daily visits.

Backend ArchitectureDistributed SystemsMySQL
0 likes · 19 min read
Web System Load Balancing, Caching, and Scaling Strategies for High Traffic
Architecture Digest
Architecture Digest
May 24, 2016 · Game Development

Evolution of Game Server Architectures: From Early MUDs to Modern Action MMOs

This article traces the historical progression of game server architectures—from simple HTTP‑based servers for low‑interaction mobile games, through the first‑generation MUDOS engine, second‑generation database‑driven designs, third‑generation seamless world nodes with dynamic load balancing, to specialized Battle.net, casual, and modern action MMO server models—highlighting the technical motivations and challenges at each stage.

Distributed SystemsMMONetworking
0 likes · 18 min read
Evolution of Game Server Architectures: From Early MUDs to Modern Action MMOs
Java High-Performance Architecture
Java High-Performance Architecture
May 23, 2016 · Cloud Native

What Uber’s Microservices Reveal About the Pros and Cons of Distributed Architecture

Uber’s adoption of microservices showcases both the flexibility of using multiple languages and independent release cycles, while also exposing challenges such as duplicated effort across teams, type‑unsafe JSON interfaces, and the need for rigorous failure testing, offering valuable lessons for large‑scale system design.

ArchitectureDistributed SystemsMicroservices
0 likes · 5 min read
What Uber’s Microservices Reveal About the Pros and Cons of Distributed Architecture