Tagged articles
2122 articles
Page 18 of 22
Java Captain
Java Captain
Sep 11, 2018 · Fundamentals

A Comprehensive Skill Tree and Learning Guide for Software Engineers

This article presents a detailed skill‑tree and practical learning roadmap for software engineers, covering computer fundamentals, data structures, Java core concepts, multithreading, JVM internals, databases, distributed systems, personal branding, and effective study methods such as videos, books, and paid resources.

AlgorithmsBackendDistributed Systems
0 likes · 17 min read
A Comprehensive Skill Tree and Learning Guide for Software Engineers
Architects' Tech Alliance
Architects' Tech Alliance
Sep 11, 2018 · Databases

Pessimistic vs Optimistic Locking: A Story-Based Explanation

This article uses a humorous story of two threads battling over inventory updates to illustrate the principles, advantages, and drawbacks of pessimistic and optimistic locking in distributed systems, helping readers understand when to apply each concurrency control method.

Concurrency ControlDistributed Systemsdatabase
0 likes · 8 min read
Pessimistic vs Optimistic Locking: A Story-Based Explanation
Big Data and Microservices
Big Data and Microservices
Sep 10, 2018 · Backend Development

Understanding RPC: Architecture, Workflow, and Core Technologies

This article explains the fundamentals of Remote Procedure Call (RPC), covering its definition, architectural components, communication protocols, serialization process, complete call workflow, and the main technologies and frameworks used in modern backend systems.

Distributed SystemsRPCService Architecture
0 likes · 9 min read
Understanding RPC: Architecture, Workflow, and Core Technologies
Architecture Digest
Architecture Digest
Sep 10, 2018 · Backend Development

Low‑Latency and High‑Availability Design of RocketMQ for Double‑11 Peak Traffic

This article reviews the evolution of Alibaba's Aliware message engine, analyzes the latency and availability challenges faced during Double‑11, and describes the low‑latency optimizations, capacity‑guarantee strategies, and multi‑replica high‑availability architecture implemented in RocketMQ to sustain trillion‑level message flows.

Distributed SystemsLow latencyMessage Queue
0 likes · 22 min read
Low‑Latency and High‑Availability Design of RocketMQ for Double‑11 Peak Traffic
21CTO
21CTO
Sep 5, 2018 · Fundamentals

Why Exactly‑Once Delivery Is Impossible: Understanding Message Delivery Semantics

The article explains how messages travel in monolithic and distributed systems, why network communication is inherently unreliable, and how different delivery semantics—at‑most‑once, at‑least‑once, and exactly‑once—affect reliability, ordering, and protocol choices such as AMQP and MQTT.

At-Least-OnceDistributed SystemsExactly-Once
0 likes · 14 min read
Why Exactly‑Once Delivery Is Impossible: Understanding Message Delivery Semantics
21CTO
21CTO
Sep 4, 2018 · Fundamentals

Why Traditional Service Architecture Beats Decentralized Designs in Enterprise Integration

This article examines the shortcomings of decentralized and micro‑service‑only approaches for enterprise integration, argues for version‑less services, explains how a centralized ESB and multi‑center architecture can balance security, efficiency, and scalability, and outlines practical design patterns and component roles.

Distributed SystemsMicroservicesSOA
0 likes · 15 min read
Why Traditional Service Architecture Beats Decentralized Designs in Enterprise Integration
Architects' Tech Alliance
Architects' Tech Alliance
Aug 27, 2018 · Fundamentals

Design Principles and Architecture of Distributed File Systems

This article provides a comprehensive overview of distributed file systems, covering their historical evolution, essential requirements, architectural models (centralized and decentralized), persistence strategies, scalability, high availability, performance optimization, security mechanisms, and additional considerations such as space allocation, file deletion, small‑file handling, and fingerprint‑based deduplication.

ConsistencyDistributed SystemsScalability
0 likes · 19 min read
Design Principles and Architecture of Distributed File Systems
Java Captain
Java Captain
Aug 25, 2018 · Blockchain

Implementing a Simple Blockchain in Java with Web API Endpoints

This tutorial walks through building a simple blockchain in Java, covering core concepts, block and transaction structures, proof‑of‑work implementation, and exposing API endpoints via Servlets for creating transactions, mining blocks, viewing the chain, and achieving consensus across nodes.

BlockchainConsensusDistributed Systems
0 likes · 27 min read
Implementing a Simple Blockchain in Java with Web API Endpoints
21CTO
21CTO
Aug 23, 2018 · Databases

Mastering Database Transactions: From ACID to Distributed Saga

This article explains the fundamentals of database transactions, the ACID properties, and how they are implemented, then delves into distributed transaction challenges and solutions such as two‑phase commit, three‑phase commit, XA, and Saga patterns, highlighting their trade‑offs and practical usage.

2PCACIDDistributed Systems
0 likes · 19 min read
Mastering Database Transactions: From ACID to Distributed Saga
Youzan Coder
Youzan Coder
Aug 17, 2018 · Databases

Designing ZanKV: A Scalable Distributed KV Store Built on RocksDB, Raft, and Redis Protocol

This article details the design, architecture, and implementation of ZanKV—a high‑performance, distributed key‑value store that combines RocksDB storage, etcd‑Raft consensus, and a Redis‑compatible protocol, covering data partitioning, namespace isolation, expiration strategies, cross‑datacenter deployment, and performance tuning.

Distributed SystemsKV StorePerformance Tuning
0 likes · 23 min read
Designing ZanKV: A Scalable Distributed KV Store Built on RocksDB, Raft, and Redis Protocol
MaGe Linux Operations
MaGe Linux Operations
Aug 16, 2018 · Operations

Unlock Zabbix Monitoring: Complete Setup, Custom Alerts & Distributed Management

Zabbix offers a web‑based, enterprise‑grade solution for distributed system and network monitoring; this guide walks Linux ops engineers through why monitoring matters, key availability metrics, what to monitor, step‑by‑step installation, web UI configuration, custom checks, alerting, visualization, template sharing, full‑network scaling, auto‑discovery, proxy deployment, and SNMP integration.

AlertingDistributed SystemsLinux
0 likes · 23 min read
Unlock Zabbix Monitoring: Complete Setup, Custom Alerts & Distributed Management
Architects' Tech Alliance
Architects' Tech Alliance
Aug 11, 2018 · Industry Insights

Mastering Cache Architecture: From CDN to Distributed Systems

This article provides a comprehensive overview of caching in large distributed systems, covering CDN, reverse‑proxy, local and distributed caches, popular implementations such as Ehcache, Guava, Memcached and Redis, common pitfalls like consistency and avalanche, and practical solutions with real‑world case studies.

Distributed SystemsEhcacheGuava
0 likes · 11 min read
Mastering Cache Architecture: From CDN to Distributed Systems
dbaplus Community
dbaplus Community
Aug 11, 2018 · Databases

Achieving Multi‑Active Disaster Recovery with Distributed Databases in Finance

Amid rising cloud outages and strict financial regulations, this article examines traditional multi‑active database solutions such as Oracle RAC and IBM GDPS, contrasts them with modern distributed database designs, and details SequoiaDB’s multi‑active architecture and concrete disaster‑recovery procedures for single‑node, site‑wide, and network failures.

Distributed SystemsFinancial ServicesSequoiaDB
0 likes · 13 min read
Achieving Multi‑Active Disaster Recovery with Distributed Databases in Finance
Qunar Tech Salon
Qunar Tech Salon
Aug 10, 2018 · Cloud Native

Deep Dive into Ant Financial’s Service Mesh Data Plane SOFAMosn

This article provides a comprehensive technical overview of Ant Financial’s Service Mesh data‑plane SOFAMosn, covering its architectural background, layered design, core modules, performance optimizations, supported protocols, and future roadmap within a cloud‑native microservices ecosystem.

Cloud NativeDistributed SystemsGolang
0 likes · 13 min read
Deep Dive into Ant Financial’s Service Mesh Data Plane SOFAMosn
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Aug 9, 2018 · Backend Development

Essential Backend Architecture Knowledge Map for High‑Concurrency Systems

This article presents a curated knowledge map covering backend architecture, Java best practices, microservice design, distributed consistency, high‑traffic handling, Alibaba frameworks, and design patterns, offering internet high‑concurrency architects a comprehensive reference of essential concepts and skills.

ArchitectureBackendDesign Patterns
0 likes · 2 min read
Essential Backend Architecture Knowledge Map for High‑Concurrency Systems
AntTech
AntTech
Aug 7, 2018 · Databases

A Timeline Review of Optimistic Concurrency Control (OCC) from Theory to Production Systems

This article presents a chronological overview of Optimistic Concurrency Control (OCC), covering its early theoretical foundations, key research papers, prototype implementations such as MVCC+OCC+2PC and Hekaton, and its adoption in modern distributed NewSQL databases like Megastore, F1, and MaaT, highlighting both advantages and challenges.

Concurrency ControlDatabase TransactionsDistributed Systems
0 likes · 33 min read
A Timeline Review of Optimistic Concurrency Control (OCC) from Theory to Production Systems
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Aug 4, 2018 · Backend Development

From 0 to 25M Users: Lessons in Scaling Backend Services Over Six Years

Over six years the author chronicles the evolution of a backend system—from its initial three‑day launch, through successive capacity expansions, distributed refactoring, and micro‑service architecture—highlighting the challenges of scaling to millions of users, handling concurrency, and the continuous pursuit of simplicity and resilience.

Distributed SystemsMicroservicesPerformance Optimization
0 likes · 6 min read
From 0 to 25M Users: Lessons in Scaling Backend Services Over Six Years
System Architect Go
System Architect Go
Jul 29, 2018 · Databases

What Is Elasticsearch? Core Concepts and Fundamentals

Elasticsearch is an open‑source, scalable, high‑availability distributed full‑text search engine that operates in near real‑time, using clusters of nodes, indexes, documents, shards and replicas to efficiently store and retrieve large volumes of data.

ClusterDistributed SystemsElasticsearch
0 likes · 4 min read
What Is Elasticsearch? Core Concepts and Fundamentals
Java Backend Technology
Java Backend Technology
Jul 22, 2018 · Backend Development

Why RPC Is the Hidden Backbone of Modern Enterprise Services

This article explains how Remote Procedure Call (RPC) underpins micro‑service architectures, connects load balancers like Nginx, powers big‑data frameworks such as Hadoop, fuels AI platforms like TensorFlow, and why mastering RPC is essential for senior backend engineers.

Distributed SystemsRPCservice communication
0 likes · 10 min read
Why RPC Is the Hidden Backbone of Modern Enterprise Services
Big Data and Microservices
Big Data and Microservices
Jul 21, 2018 · Industry Insights

How System Architecture Evolves: From Single-Server LAMP to Distributed Services

This article traces the step‑by‑step evolution of a web system—from a single‑server LAMP setup through service and data separation, caching, server clusters, read/write splitting, CDN acceleration, distributed databases, NoSQL integration, business splitting, and finally distributed services—highlighting the motivations, characteristics, and practical outcomes of each stage.

BackendDistributed SystemsScalability
0 likes · 8 min read
How System Architecture Evolves: From Single-Server LAMP to Distributed Services
Youzan Coder
Youzan Coder
Jul 20, 2018 · Big Data

How Youzan Built a Scalable Big Data Development Platform (DP)

This article details the design, architecture, and operational experience of Youzan's Data Platform (DP), covering its scheduling, data‑sync, service, and monitoring modules, the custom Airflow‑based task scheduler, current production metrics, supported task types, and future improvement plans.

AirflowBig DataData Platform
0 likes · 12 min read
How Youzan Built a Scalable Big Data Development Platform (DP)
UCloud Tech
UCloud Tech
Jul 20, 2018 · Game Development

Global Server Architecture for Multiplayer Games: Microservices, UDP & Deployment

This article explores the advantages, architectural layout, network planning, and practical technologies for global server games, emphasizing microservice-based modular design, automated deployment, frame‑synchronization over UDP, centralized versus distributed deployment models, and considerations of latency, data consistency, and blockchain integration.

Distributed SystemsGame ArchitectureUDP
0 likes · 10 min read
Global Server Architecture for Multiplayer Games: Microservices, UDP & Deployment
360 Tech Engineering
360 Tech Engineering
Jul 19, 2018 · Backend Development

Understanding Idempotency in Software Systems and How to Ensure It

The article explains the concept of idempotency, why it is critical for data consistency in backend services, and presents practical techniques such as unique business identifiers, locking, optimistic concurrency, deduplication tables, distributed locks, token mechanisms, and payment buffering to achieve reliable idempotent operations.

Distributed SystemsIdempotency
0 likes · 5 min read
Understanding Idempotency in Software Systems and How to Ensure It
21CTO
21CTO
Jul 17, 2018 · Fundamentals

Mastering Distributed Architecture: Principles, Applications, and Evolution

This article explains the core concepts of distributed architecture, including cohesion and transparency, explores common applications such as distributed file systems, caches, databases, and middleware like MyCat, and outlines the evolutionary stages and challenges of building scalable distributed systems.

Distributed SystemsMycatSystem Architecture
0 likes · 11 min read
Mastering Distributed Architecture: Principles, Applications, and Evolution
ITPUB
ITPUB
Jul 15, 2018 · Databases

How Meituan Dianping Evolved MySQL HA: From MMM to MHA+Zebra and Beyond

This article traces Meituan Dianping's MySQL high‑availability journey, detailing the legacy MMM system, its migration to MHA, integration with Zebra and Proxy middleware, and future architectural ideas such as distributed agents, semi‑sync replication, and MySQL Group Replication.

Distributed SystemsMHAMySQL
0 likes · 12 min read
How Meituan Dianping Evolved MySQL HA: From MMM to MHA+Zebra and Beyond
Java Captain
Java Captain
Jul 11, 2018 · Backend Development

Essential Knowledge Areas: Source Code Analysis, Distributed Architecture, Microservices, Performance Optimization, Java Engineering, and Concurrency Programming

This article outlines the critical knowledge points for backend developers, covering source code analysis, distributed systems, microservice architecture, performance tuning, Java engineering practices, and comprehensive concurrency programming techniques.

Distributed SystemsPerformance Optimizationbackend-development
0 likes · 6 min read
Essential Knowledge Areas: Source Code Analysis, Distributed Architecture, Microservices, Performance Optimization, Java Engineering, and Concurrency Programming
System Architect Go
System Architect Go
Jul 8, 2018 · Backend Development

Overview of NSQ: Architecture, Components, and Usage

NSQ is a lightweight, high‑performance open‑source message queue written in Go, and this article explains its core components (nsqd, nsqlookupd, nsqadmin, utilities), message flow architecture, configuration details, deployment recommendations, and additional considerations such as persistence, delivery guarantees, and scalability.

Distributed SystemsMessage QueueNSQ
0 likes · 5 min read
Overview of NSQ: Architecture, Components, and Usage
JD Tech
JD Tech
Jul 4, 2018 · Big Data

ClickHouse Overview: Features, Performance, Engines, and Comparison with Hadoop

This article introduces ClickHouse as a high‑performance, column‑oriented database designed for real‑time big‑data analytics, outlines its key features, performance characteristics, supported interfaces, differences from Hadoop, and explains its main storage engines—MergeTree and Distributed—while also noting its current limitations.

ClickHouseColumnar DatabaseDistributed Systems
0 likes · 11 min read
ClickHouse Overview: Features, Performance, Engines, and Comparison with Hadoop
Java Backend Technology
Java Backend Technology
Jul 4, 2018 · Backend Development

Designing High‑Availability Distributed Systems: SOA, Microservices & Service Mesh

This article explores the evolution and core concepts of modern distributed architectures—including SOA, microservices, and service mesh—explains key theories such as CAP and BASE, and provides practical guidelines for achieving high availability, scalability, and efficient content delivery through techniques like load balancing, CDN, and gray‑release strategies.

CAP theoremDistributed SystemsMicroservices
0 likes · 18 min read
Designing High‑Availability Distributed Systems: SOA, Microservices & Service Mesh
Java Architect Essentials
Java Architect Essentials
Jul 3, 2018 · Backend Development

Detailed Explanation of the Dubbo RPC Call Process from Consumer to Provider

This article provides a step‑by‑step walkthrough of how a Dubbo consumer initiates a remote call, how the request is processed through clustering, load balancing, and invocation layers, and how the provider receives, executes, and returns the result, including the underlying code structures and network interactions.

Distributed SystemsDubboRPC
0 likes · 7 min read
Detailed Explanation of the Dubbo RPC Call Process from Consumer to Provider
AntTech
AntTech
Jul 3, 2018 · Backend Development

Evolution of Financial‑Grade Message Queues at Ant Financial

The article reviews the ten‑year evolution of Ant Financial's message queue, detailing its core reliability, consistency, availability and performance requirements, the architectural mechanisms built to meet them, the shift to pull‑mode and API‑mode designs, and the recent integration of compute capabilities to create a smart data transmission platform.

Big DataDistributed SystemsMessage Queue
0 likes · 13 min read
Evolution of Financial‑Grade Message Queues at Ant Financial
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jul 1, 2018 · Fundamentals

Why Componentization and Modularity Are Key to Microservice Success

This article explains core microservice concepts—including componentization, modularization, centralized vs. distributed systems, and the architectural principles behind microservices—while comparing them to SOA and highlighting practical distinctions that help developers design loosely‑coupled, reusable services.

ComponentizationDistributed SystemsMicroservices
0 likes · 8 min read
Why Componentization and Modularity Are Key to Microservice Success
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 19, 2018 · Backend Development

Mastering Dubbo: From Java RMI Basics to Building a Full RPC Application

This article explains the fundamentals of remote method invocation, compares traditional RMI with modern RPC frameworks, introduces Dubbo's architecture and roles, and provides a step‑by‑step guide to quickly develop, build, and run a complete Dubbo application using API‑centric configuration and multicast registration.

Distributed SystemsDubboMicroservices
0 likes · 14 min read
Mastering Dubbo: From Java RMI Basics to Building a Full RPC Application
Architecture Digest
Architecture Digest
Jun 17, 2018 · Backend Development

From Moore's Law to Distributed Architecture: Load Balancing, Session Management, and Database Sharding

The article explains how the slowdown of Moore's Law drives the shift from monolithic applications to distributed systems, covering load balancing, session handling strategies, read‑write separation, and database sharding to achieve scalable, high‑availability backend architectures.

Backend ArchitectureDistributed SystemsSession Management
0 likes · 8 min read
From Moore's Law to Distributed Architecture: Load Balancing, Session Management, and Database Sharding
Architecture Digest
Architecture Digest
Jun 9, 2018 · Backend Development

Evolution of Architecture for Large-Scale Websites

The article outlines the key characteristics of large-scale websites and traces their architectural evolution from single‑server setups to multi‑tier, cache‑enhanced, clustered, and distributed systems, highlighting strategies such as load balancing, database read/write separation, CDN usage, NoSQL adoption, and service‑oriented decomposition.

Distributed SystemsScalabilitybackend-development
0 likes · 5 min read
Evolution of Architecture for Large-Scale Websites
Java Backend Technology
Java Backend Technology
Jun 9, 2018 · R&D Management

How to Transition from Senior Developer to Software Architect: Key Steps and Skills

This article explains the common misconceptions about software architects, outlines the essential conditions and skills required for senior developers to upgrade, describes practical learning paths, and details the daily responsibilities and interpersonal abilities needed to succeed as an architect.

Career DevelopmentDistributed SystemsSoftware Architecture
0 likes · 12 min read
How to Transition from Senior Developer to Software Architect: Key Steps and Skills
Architecture Digest
Architecture Digest
Jun 7, 2018 · Backend Development

Technical Summary of Large-Scale Distributed E‑Commerce Website Architecture

This article provides a comprehensive technical overview of large distributed website architecture, covering performance, high availability, scalability, security, and agility, and illustrates the evolution, design patterns, and practical optimization techniques for modern e‑commerce platforms.

ArchitectureDistributed SystemsScalability
0 likes · 32 min read
Technical Summary of Large-Scale Distributed E‑Commerce Website Architecture
21CTO
21CTO
Jun 6, 2018 · Operations

From Single Machines to Distributed Architecture: Tracing the Evolution of IT Systems

This article outlines the four major stages of IT architecture evolution—from single‑machine setups, through dual‑machine hot‑standby, multi‑node active clusters, to fully distributed systems—explaining the motivations, challenges, and technologies that drive each transition.

ArchitectureDistributed SystemsIT infrastructure
0 likes · 8 min read
From Single Machines to Distributed Architecture: Tracing the Evolution of IT Systems
Architecture Digest
Architecture Digest
Jun 6, 2018 · Operations

Evolution of System Architecture: From Single‑Machine to Distributed Solutions

The article outlines the four major stages of enterprise IT architecture—single‑machine, dual‑machine hot‑standby, multi‑node active‑active, and distributed architectures—explaining their motivations, advantages, limitations, and how businesses should choose the appropriate model based on performance, availability, and scalability requirements.

ArchitectureDistributed SystemsScalability
0 likes · 8 min read
Evolution of System Architecture: From Single‑Machine to Distributed Solutions
ITPUB
ITPUB
Jun 5, 2018 · Operations

How Meituan Achieved Near‑Zero Downtime for Its Account Service

This article details Meituan's practical approaches to boosting account service reliability, covering MTBF/MTTR metrics, business‑level monitoring, flexible availability with circuit‑breaker patterns, cross‑region active‑active deployment, data synchronization techniques, and the measurable performance gains achieved.

Active-ActiveDistributed Systemscircuit breaker
0 likes · 13 min read
How Meituan Achieved Near‑Zero Downtime for Its Account Service
dbaplus Community
dbaplus Community
Jun 4, 2018 · Backend Development

How Weibo Scales to Billions: Inside Its Multi‑Layer Cache Architecture

This article explains how Weibo handles massive daily traffic of over a hundred billion requests by employing a five‑layer feed system and a six‑layer cache architecture that evolved from simple KV storage to sophisticated counter and existence‑check services, highlighting design choices, performance optimizations, and future directions.

CacheDistributed SystemsWeibo
0 likes · 16 min read
How Weibo Scales to Billions: Inside Its Multi‑Layer Cache Architecture
Architecture Digest
Architecture Digest
Jun 4, 2018 · Backend Development

Evolution and Key Concepts of Distributed Architecture for Large-Scale Systems

This article traces the historical development of distributed architecture from early computers to modern large‑scale web systems, explains core concepts such as clustering, replication, and middleware, and outlines eight evolutionary stages that address scalability, reliability, and performance challenges.

ArchitectureBackendDistributed Systems
0 likes · 10 min read
Evolution and Key Concepts of Distributed Architecture for Large-Scale Systems
Tencent Cloud Developer
Tencent Cloud Developer
Jun 1, 2018 · Backend Development

Building Tencent Xinge: Architecture and Practices for Massive Mobile Push Service

The talk details Tencent Xinge’s architecture and cloud‑native practices that enable hundred‑billion‑level mobile push, combining terminal integration, real‑time backend filtering, distributed bitmap selection, precise‑push AI models, and DevOps pipelines to deliver fast, scalable, data‑driven notifications with effect tracking.

Backend ArchitectureBig DataDistributed Systems
0 likes · 18 min read
Building Tencent Xinge: Architecture and Practices for Massive Mobile Push Service
Tencent Cloud Developer
Tencent Cloud Developer
May 31, 2018 · Backend Development

Tencent Billing System (Mi Master): Architecture, Reliability, Security, and Global Capabilities

The Mi Master billing platform, a SaaS service from Tencent that processes over 735 billion RMB quarterly, provides a unified, modular architecture with distributed multi‑master databases, high‑availability cross‑region deployment, multi‑stage fraud detection, and global support for 80+ payment channels across 180+ countries, delivering seamless APIs, automated reconciliation, and extensive revenue‑sharing tools for products such as Honor of Kings, PUBG, WeChat Pay, and QQ Wallet.

Distributed SystemsReliabilitybilling architecture
0 likes · 19 min read
Tencent Billing System (Mi Master): Architecture, Reliability, Security, and Global Capabilities
Efficient Ops
Efficient Ops
May 30, 2018 · Databases

How SF Express Transformed Its Database Operations: From Legacy to Open‑Source, Distributed, and Intelligent Ops

This talk details SF Express’s journey from heterogeneous legacy databases to standardized open‑source, distributed architectures and intelligent operations, covering standardization, migration to open‑source, scaling with Mycat, automated resource pooling, and the ThinkDB platform that drives proactive, automated DBA workflows.

Distributed SystemsMycatOperations
0 likes · 18 min read
How SF Express Transformed Its Database Operations: From Legacy to Open‑Source, Distributed, and Intelligent Ops
Efficient Ops
Efficient Ops
May 21, 2018 · Databases

Why Do Database Failures Happen and How to Prevent Them?

This article examines common hardware and network failures in data centers, analyzes real‑world outage cases, classifies fault domains, and presents comprehensive strategies for database fault handling—including logging, checkpointing, backup, replication, and high‑availability architectures—to improve reliability and reduce downtime.

BackupDistributed Systemsdatabase
0 likes · 22 min read
Why Do Database Failures Happen and How to Prevent Them?
Java Backend Technology
Java Backend Technology
May 20, 2018 · Backend Development

Which Cache Update Strategy Guarantees Consistency? A Deep Dive into DB‑Cache Synchronization

This article examines three common cache‑update approaches—updating the cache after the database, deleting the cache before updating the database, and updating the database then deleting the cache—analyzes their drawbacks, and presents practical solutions such as delayed double‑delete and retry mechanisms to ensure data consistency.

BackendCacheConsistency
0 likes · 10 min read
Which Cache Update Strategy Guarantees Consistency? A Deep Dive into DB‑Cache Synchronization
Alibaba Cloud Developer
Alibaba Cloud Developer
May 16, 2018 · Cloud Computing

From Mall to Cloud: How Alibaba’s Tech Evolution Shaped Modern Cloud Computing

Senior Alibaba engineer Xiao Xie recounts his decade‑long journey from the early Taobao Mall project to leading the Cloud Computing “Flying‑Sky Eight” team, detailing pivotal initiatives like the Five‑Color Stone integration, full‑link stress testing for Double‑11, and the evolution toward self‑developed, distributed cloud technologies.

AlibabaCloud ComputingDistributed Systems
0 likes · 12 min read
From Mall to Cloud: How Alibaba’s Tech Evolution Shaped Modern Cloud Computing
ITFLY8 Architecture Home
ITFLY8 Architecture Home
May 12, 2018 · Backend Development

What Drives the Architecture of Billion‑User Platforms? Lessons from Weibo

This article explores the essence of system architecture for massive web services, illustrating strategic and tactical considerations through examples like Uber and Weibo, and discusses key capabilities such as abstraction, classification, performance, service decomposition, multi‑level caching, distributed tracing, and continuous learning for scalable backend design.

Distributed SystemsPerformance OptimizationSystem Architecture
0 likes · 21 min read
What Drives the Architecture of Billion‑User Platforms? Lessons from Weibo
Qunar Tech Salon
Qunar Tech Salon
May 11, 2018 · Databases

Minsheng Bank’s Distributed Transformation and NewSQL Practice with SequoiaDB

The article details Minsheng Bank’s shift to distributed architecture, outlining regulatory drivers, business requirements, the adoption of sharding, cross‑center high‑availability, and new‑type distributed databases, and showcases performance results of SequoiaDB 3.0 across multiple high‑throughput banking scenarios.

BankingDistributed SystemsNewSQL
0 likes · 9 min read
Minsheng Bank’s Distributed Transformation and NewSQL Practice with SequoiaDB
21CTO
21CTO
May 9, 2018 · Operations

How Alipay Built Seamless High Availability and Disaster Recovery for Millions of Transactions

This article examines Alipay's evolution from a simple single‑datacenter setup to a multi‑active‑active, unit‑based architecture, detailing the technical challenges of high availability, disaster recovery, failover design, blue‑green deployment, and how these solutions enable continuous service during massive traffic spikes like Double 11.

AlipayBlue‑Green deploymentDistributed Systems
0 likes · 17 min read
How Alipay Built Seamless High Availability and Disaster Recovery for Millions of Transactions
Architecture Digest
Architecture Digest
May 9, 2018 · Operations

High Availability and Disaster Recovery Architecture: The Evolution of Alipay’s System Design

This article examines the importance of high‑availability and disaster‑recovery architectures, tracing Alipay’s evolution from a simple load‑balanced setup through multi‑datacenter, failover, and unit‑based designs that address scalability, data consistency, and continuous service delivery challenges.

Distributed SystemsScalabilitydisaster recovery
0 likes · 16 min read
High Availability and Disaster Recovery Architecture: The Evolution of Alipay’s System Design
21CTO
21CTO
May 5, 2018 · Backend Development

From Single Server to Scalable Architecture: Key Lessons from Large‑Scale Site Design

This comprehensive note distills the evolution of large‑website architecture—from single‑server setups to layered, distributed, and highly available systems—covering caching, clustering, read/write separation, CDN, NoSQL, business splitting, scalability, extensibility, and automation strategies.

Distributed Systemshigh availabilitylarge-scale architecture
0 likes · 20 min read
From Single Server to Scalable Architecture: Key Lessons from Large‑Scale Site Design
Architecture Digest
Architecture Digest
May 5, 2018 · Backend Development

Evolution and Core Principles of Large‑Scale Website Architecture

This article summarizes the evolution stages, architectural patterns, and key concerns such as performance, scalability, extensibility, high availability, and distributed design that large‑scale websites must address, providing practical insights and visual diagrams for each concept.

Distributed SystemsScalabilitycaching
0 likes · 21 min read
Evolution and Core Principles of Large‑Scale Website Architecture
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
May 3, 2018 · Backend Development

Fundamentals and Evolution of Large-Scale Website Architecture Design

This article explains the essence of software architecture as a process of reducing system entropy through splitting and merging, outlines the capabilities required of architects, and details the step‑by‑step evolution of large‑scale website infrastructures including caching, CDN, database sharding, and messaging systems.

Distributed SystemsScalabilitydatabase sharding
0 likes · 10 min read
Fundamentals and Evolution of Large-Scale Website Architecture Design
ITFLY8 Architecture Home
ITFLY8 Architecture Home
May 2, 2018 · Backend Development

How WeChat’s SeqSvr Generates Trillions of Sequence Numbers Daily

This article explains the design and evolution of WeChat's high‑availability seqsvr service, which provides per‑user 64‑bit sequence numbers for data synchronization, handling trillions of requests with millisecond latency through pre‑allocation, section sharing, and layered storage architecture.

Distributed SystemsScalabilitysequence generation
0 likes · 10 min read
How WeChat’s SeqSvr Generates Trillions of Sequence Numbers Daily
Architecture Digest
Architecture Digest
Apr 29, 2018 · Backend Development

Designing High‑Concurrency Architecture for Large‑Scale E‑Commerce Applications

This article outlines practical strategies for building high‑concurrency back‑end systems—including server architecture, load balancing, database clustering, caching, message queues, asynchronous processing, and service‑oriented design—to ensure smooth operation of traffic‑intensive e‑commerce services.

Backend ArchitectureDistributed Systemscaching
0 likes · 19 min read
Designing High‑Concurrency Architecture for Large‑Scale E‑Commerce Applications
Java Captain
Java Captain
Apr 26, 2018 · Backend Development

Dubbo Overview, Architecture, and a Step‑by‑Step Demo with Zookeeper and Spring

This article introduces Dubbo’s background, explains the evolution of e‑commerce architectures to RPC‑based distributed systems, details Dubbo’s components, advantages, and drawbacks, and provides a complete Maven‑based demo—including Zookeeper installation, Spring configuration, and Java code—for building and consuming a Dubbo service.

Distributed SystemsDubboRPC
0 likes · 19 min read
Dubbo Overview, Architecture, and a Step‑by‑Step Demo with Zookeeper and Spring
Meituan Technology Team
Meituan Technology Team
Apr 19, 2018 · Backend Development

How Meituan Waimai Supports Ten Million Daily Orders: Evolution of Its Backend Architecture

Meituan Waimai handles ten‑million daily orders by evolving from a tiny monolithic prototype to a distributed, micro‑service‑based platform that uses sharded databases, caches, set‑based traffic partitioning, automated AIOps, dynamic container scaling, prioritized degradation switches, and AI‑driven features to sustain massive, growing traffic.

Distributed SystemsMeituanScalability
0 likes · 19 min read
How Meituan Waimai Supports Ten Million Daily Orders: Evolution of Its Backend Architecture
Architecture Digest
Architecture Digest
Apr 18, 2018 · Databases

Understanding Distributed Architecture and Its Applications in MySQL and Large‑Scale Systems

The article explains the concept of distributed architecture, its key characteristics such as cohesion and transparency, showcases how MySQL and middleware like Mycat are used in e‑commerce platforms, and outlines the evolution, practical implementations, and challenges of building scalable distributed database systems.

Big DataDatabase ArchitectureDistributed Systems
0 likes · 15 min read
Understanding Distributed Architecture and Its Applications in MySQL and Large‑Scale Systems
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Apr 11, 2018 · Fundamentals

Mastering Distributed System Design: Core Principles Every Engineer Should Know

This article outlines essential distributed system concepts—including system decomposition, concurrency, caching strategies, online vs. offline processing, push/pull communication, load limiting, service degradation, CAP theorem, and eventual consistency—to help engineers design scalable, reliable architectures for high‑traffic applications.

CAP theoremDistributed SystemsMicroservices
0 likes · 13 min read
Mastering Distributed System Design: Core Principles Every Engineer Should Know
Architecture Digest
Architecture Digest
Apr 10, 2018 · Fundamentals

Reliability, Scalability, and Maintainability in Distributed System Design

This article examines core distributed system design principles—reliability, scalability, and maintainability—explaining how techniques such as replication, partitioning, consensus algorithms, and transactions address hardware, software, and human failures, and discusses vertical and horizontal scaling strategies to achieve robust, extensible, and maintainable architectures.

ConsensusDistributed SystemsReplication
0 likes · 8 min read
Reliability, Scalability, and Maintainability in Distributed System Design
dbaplus Community
dbaplus Community
Apr 8, 2018 · Databases

Mastering Multi‑Tenant Load Balancing in Alibaba Cloud Table Store

This article explains the architecture, data model, and multi‑tenant load‑balancing strategies of Alibaba Cloud Table Store, detailing the challenges of distributed NoSQL systems and presenting practical solutions for resource quantification, fairness, trigger timing, and SLA‑driven automation.

Alibaba CloudDistributed SystemsNoSQL
0 likes · 20 min read
Mastering Multi‑Tenant Load Balancing in Alibaba Cloud Table Store
21CTO
21CTO
Apr 6, 2018 · Cloud Native

Why Service Mesh Is the Next Evolution of Microservices

This article examines the limitations of traditional microservice frameworks, introduces service mesh as a solution with sidecar architecture, outlines its definition, evolution stages, and timeline, and concludes with resources for further learning and practical implementation.

Cloud NativeDistributed SystemsMicroservices
0 likes · 9 min read
Why Service Mesh Is the Next Evolution of Microservices
Architecture Digest
Architecture Digest
Apr 6, 2018 · Cloud Native

An Overview of Service Mesh: Addressing the Limitations of Traditional Microservices

This article reviews the challenges of early microservice frameworks—high technical barriers, limited multi‑language support, and intrusive code—and explains how service mesh architectures with sidecar proxies, exemplified by Linkerd, Envoy, and Istio, provide a dedicated, language‑agnostic infrastructure layer that simplifies service governance and operations.

Cloud NativeDistributed SystemsIstio
0 likes · 9 min read
An Overview of Service Mesh: Addressing the Limitations of Traditional Microservices
Efficient Ops
Efficient Ops
Apr 1, 2018 · Backend Development

Ele.me’s Secret to Seamless Multi-Region Active-Active Architecture

This article details how Ele.me engineered a cross‑region active‑active system that scales elastically, tolerates whole‑data‑center failures, and maintains real‑time food‑delivery performance through geographic sharding, intelligent routing, and robust data‑replication middleware.

Distributed Systemsdata replicationgeographic sharding
0 likes · 18 min read
Ele.me’s Secret to Seamless Multi-Region Active-Active Architecture
AntTech
AntTech
Mar 29, 2018 · Artificial Intelligence

Ant Group CTO Cheng Li’s Money 20/20 Asia Presentation on FinTech Innovation: AI, Blockchain, Cloud and Mobile Payments

In his Money 20/20 Asia keynote, Ant Group CTO Cheng Li outlines the company’s fintech roadmap, highlighting AI‑driven risk engines, blockchain‑based trust mechanisms, cloud‑native infrastructure, and innovative mobile payment solutions that aim to make financial services more inclusive and efficient.

Artificial IntelligenceBlockchainCloud Computing
0 likes · 16 min read
Ant Group CTO Cheng Li’s Money 20/20 Asia Presentation on FinTech Innovation: AI, Blockchain, Cloud and Mobile Payments
Architecture Digest
Architecture Digest
Mar 28, 2018 · Operations

Implementing High-Concurrency Performance Testing and Practical Solutions Based on Server Architecture

This article explains the concept of high concurrency, outlines a server architecture that supports it—including load balancing, distributed databases, NoSQL caches and CDN—and presents practical testing methods and implementation patterns such as caching strategies and message‑queue designs to handle massive simultaneous requests.

Distributed SystemsServer Architecturecaching
0 likes · 7 min read
Implementing High-Concurrency Performance Testing and Practical Solutions Based on Server Architecture
Architecture Digest
Architecture Digest
Mar 26, 2018 · Operations

Alipay’s Double 11 Architecture: Logical Data Centers, Distributed Transactions, and High‑Availability Strategies

The article details Alipay’s comprehensive architecture for the Double 11 shopping festival, covering its three‑layer IAAS/PAAS/SAAS model, logical data‑center design, multi‑active disaster‑recovery, blue‑green deployment, distributed data sharding, transaction processing, and the Ant Credit Pay service’s performance and risk‑control mechanisms.

AlipayArchitectureBig Data
0 likes · 16 min read
Alipay’s Double 11 Architecture: Logical Data Centers, Distributed Transactions, and High‑Availability Strategies
Architecture Digest
Architecture Digest
Mar 20, 2018 · Backend Development

Source Code Analysis, Distributed Architecture, Microservices, Performance Optimization, and Java Engineering Overview

This article discusses the importance of source code analysis, outlines key concepts in distributed systems, explains microservice architecture, highlights performance optimization techniques for Java applications, and presents practical engineering advice for modern backend development.

Distributed SystemsJava engineeringMicroservices
0 likes · 8 min read
Source Code Analysis, Distributed Architecture, Microservices, Performance Optimization, and Java Engineering Overview
Java Backend Technology
Java Backend Technology
Mar 19, 2018 · Fundamentals

Why Distributed Consistency Matters: From CAP to BASE Explained

This article explores the importance of data consistency in distributed systems, illustrating real‑world scenarios, explaining consistency models such as strong, weak and eventual, and detailing the challenges and theories like CAP and BASE that guide system designers in balancing consistency, availability, and partition tolerance.

BASE theoryCAP theoremConsistency
0 likes · 18 min read
Why Distributed Consistency Matters: From CAP to BASE Explained
Efficient Ops
Efficient Ops
Mar 15, 2018 · Operations

How Baidu’s CCS System Scales Command Execution Across Millions of Servers

This article examines Baidu’s Cluster Control System (CCS), detailing its two‑level data model, four‑tier scheduling architecture, and three‑layer execution agents, and explains how control and execution information, redundancy, and fault‑tolerant designs enable reliable large‑scale command execution across thousands of servers.

Command ExecutionDistributed SystemsOperations
0 likes · 12 min read
How Baidu’s CCS System Scales Command Execution Across Millions of Servers
Efficient Ops
Efficient Ops
Mar 15, 2018 · Operations

Mastering Large-Scale Command Execution: From Basics to Baidu’s Cluster Control System

This article explores the fundamentals of command execution, examines the challenges of scaling command delivery across hundreds of thousands of servers, and details Baidu’s Cluster Control System architecture that enables efficient, flexible, and extensible distributed command management for operations teams.

Command ExecutionDeploymentDistributed Systems
0 likes · 10 min read
Mastering Large-Scale Command Execution: From Basics to Baidu’s Cluster Control System
Java Backend Technology
Java Backend Technology
Mar 13, 2018 · Fundamentals

Why Consistent Hashing Is the Key to Scalable Redis Clusters

This article explains the limitations of simple modulo hashing for Redis clusters, introduces consistent hashing with a virtual‑node ring to achieve fault tolerance and seamless scaling, and demonstrates how the algorithm reduces data skew and improves cache performance in distributed systems.

Distributed Systemscachingconsistent hashing
0 likes · 11 min read
Why Consistent Hashing Is the Key to Scalable Redis Clusters
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Mar 1, 2018 · Operations

AI-Driven Strategies for Optimizing Resource Management in Distributed Systems

This article reviews cloud gaming resource management, introduces search‑engine instance distribution techniques, explores AI‑based disk‑failure prediction and load forecasting, and presents replica and DDoS‑detection strategies to improve efficiency and reliability of large‑scale distributed systems.

AIDistributed Systemsfailure prediction
0 likes · 12 min read
AI-Driven Strategies for Optimizing Resource Management in Distributed Systems
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Feb 28, 2018 · Backend Development

Inside Alibaba’s Live Streaming Architecture: Lessons from a Senior Engineer

In this extensive interview, senior Alibaba engineer Chen Kangxian shares his experiences designing large‑scale distributed systems, live‑streaming platforms, and high‑concurrency architectures, offering practical insights on technology choices, failure handling, and career growth for software architects.

Distributed SystemsMicroservicesSoftware Architecture
0 likes · 34 min read
Inside Alibaba’s Live Streaming Architecture: Lessons from a Senior Engineer
Hulu Beijing
Hulu Beijing
Feb 28, 2018 · Big Data

How Hulu’s Nesto Engine Delivers Near‑Real‑Time OLAP on TB‑Scale Data

This article introduces Hulu's in‑house OLAP engine Nesto, detailing its near‑real‑time data ingestion, nested data model, TB‑level storage using HBase and Parquet, MPP query execution, custom predicate library, and the overall architecture that enables sub‑second ad‑hoc queries for user analytics.

Big DataColumnar StorageDistributed Systems
0 likes · 22 min read
How Hulu’s Nesto Engine Delivers Near‑Real‑Time OLAP on TB‑Scale Data
Java Backend Technology
Java Backend Technology
Feb 27, 2018 · Backend Development

Mastering Large-Scale Website Architecture: 10 Essential Patterns Explained

This article outlines ten fundamental architecture patterns for high‑traffic websites—including layering, partitioning, distribution, clustering, caching, asynchronous processing, redundancy, automation, and security—explaining their goals, benefits, challenges, and best‑practice constraints to help engineers build scalable, reliable, and maintainable systems.

Distributed SystemsScalabilityautomation
0 likes · 11 min read
Mastering Large-Scale Website Architecture: 10 Essential Patterns Explained
Java Backend Technology
Java Backend Technology
Feb 22, 2018 · Backend Development

From Single Server to Global Scale: Evolution of Large Website Architecture

This article explores the defining traits of large‑scale websites and walks through the step‑by‑step evolution of their architecture—from single‑server setups to distributed systems with caching, load balancing, database sharding, and micro‑services—while highlighting common design pitfalls and best‑practice recommendations.

Backend ArchitectureDistributed SystemsScalability
0 likes · 8 min read
From Single Server to Global Scale: Evolution of Large Website Architecture
AI Cyberspace
AI Cyberspace
Jan 29, 2018 · Backend Development

Mastering Celery: Periodic Tasks, Sync Calls, Result Storage, and Monitoring

Explore how to configure Celery’s periodic (Beat) tasks, perform synchronous task calls, persist results using Redis, monitor workers with Flower, and debug remotely via telnet, with practical code examples and step‑by‑step instructions for robust backend task management.

Distributed SystemsPythonTask Queue
0 likes · 7 min read
Mastering Celery: Periodic Tasks, Sync Calls, Result Storage, and Monitoring
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jan 28, 2018 · Backend Development

Designing Scalable E‑Commerce Architecture: From Business to Technical Layers

This article explores how to design a high‑performance, highly available, and scalable e‑commerce platform by separating business and technical architectures, detailing subsystem decomposition, scaling strategies, and the evolution from simple single‑server setups to distributed, clustered solutions.

ArchitectureBackendDistributed Systems
0 likes · 13 min read
Designing Scalable E‑Commerce Architecture: From Business to Technical Layers
Meituan Technology Team
Meituan Technology Team
Jan 26, 2018 · Big Data

Design and Implementation of a Real-Time Data Processing System at Meituan

Meituan designed a Storm‑based real‑time data processing platform that guarantees at‑least‑once delivery and high availability, employs a custom spout, regression‑driven traffic smoothing, and a low‑latency KV store with atomic operations, persisting results in Kafka, MySQL and Cellar to power merchant dashboards and heat‑tag analytics, while planning broader real‑time analytics expansion.

Big DataDistributed SystemsStorm
0 likes · 10 min read
Design and Implementation of a Real-Time Data Processing System at Meituan
Java Backend Technology
Java Backend Technology
Jan 21, 2018 · Backend Development

Is Microservices Doomed? Uncovering the Hidden Complexities Behind the Hype

The article critically examines micro‑services, outlining their promised benefits such as independent development, deployment and scaling, while exposing the hidden operational, dev‑ops, state‑management, communication, versioning and distributed‑transaction challenges that can turn them into a fragile, overly complex system.

Distributed SystemsMicroservicesSoftware Architecture
0 likes · 15 min read
Is Microservices Doomed? Uncovering the Hidden Complexities Behind the Hype
Architect's Tech Stack
Architect's Tech Stack
Jan 20, 2018 · Backend Development

What Is Microservices? Concepts, Design Guidelines, Integration Patterns, and Trade‑offs

This article explains the microservices architecture style, compares it with monolithic and SOA approaches, outlines design principles, communication mechanisms, data decentralization, integration patterns, and discusses the advantages and disadvantages of adopting microservices in modern software systems.

Distributed SystemsService Integrationdesign principles
0 likes · 16 min read
What Is Microservices? Concepts, Design Guidelines, Integration Patterns, and Trade‑offs
Vipshop Quality Engineering
Vipshop Quality Engineering
Jan 17, 2018 · Backend Development

Why Zookeeper Connections Fail After 1 MB and How to Fix Them

A staging environment’s new scheduled task kept failing due to Zookeeper disconnections caused by packets exceeding the default 1 MB maxBuffer, and the article explains the root cause, heartbeat timing, and how adjusting Djute.maxbuffer or upgrading Zookeeper resolves the issue.

BackendDistributed SystemsZooKeeper
0 likes · 4 min read
Why Zookeeper Connections Fail After 1 MB and How to Fix Them