Tagged articles

2122 articles

Page 8 of 22

Jul 26, 2022 · Operations

Mastering Service Governance: From Distributed Systems to Reliable RPC Frameworks

This article explains the fundamentals of service governance in distributed systems, covering RPC communication, common anomalies, replica types, design principles, performance metrics, and the architecture and workflow of RPC frameworks, while also detailing key practices such as registration, discovery, load balancing, traffic shaping, version compatibility, circuit breaking, degradation, and rate limiting.

Distributed SystemsMicroservicesRPC

0 likes · 15 min read

Mastering Service Governance: From Distributed Systems to Reliable RPC Frameworks

Selected Java Interview Questions

Jul 25, 2022 · Backend Development

Four Approaches to Solving Session Management in Distributed Environments

This article examines four strategies for handling session data in distributed backend systems—session replication, client-side storage, hash-based load balancing, and centralized storage with Redis—detailing their principles, advantages, disadvantages, and practical recommendations for different deployment scales.

BackendDistributed SystemsSession

0 likes · 9 min read

Four Approaches to Solving Session Management in Distributed Environments

Top Architect

Jul 25, 2022 · Backend Development

Monolithic vs Microservice Architecture: Pros, Cons, Service Gateway, Registration, and Configuration Center

This article explains the differences between monolithic and microservice architectures, outlines their advantages and disadvantages, and then dives into essential components such as service gateways, service registration and discovery, and distributed configuration centers, providing practical guidance for building scalable backend systems.

Backend ArchitectureConfiguration CenterDistributed Systems

0 likes · 26 min read

Monolithic vs Microservice Architecture: Pros, Cons, Service Gateway, Registration, and Configuration Center

FunTester

Jul 24, 2022 · Operations

Boost Service Reliability with Chaos Engineering: Practical Steps & Evaluation

Chaos engineering, a discipline for experimenting on distributed systems, helps teams identify hidden weaknesses, improve high‑availability, and build confidence in production by defining stable states, injecting realistic failures, and measuring impact through observability metrics, with practical steps, tool choices, maturity stages, and evaluation methods.

Distributed SystemsFault InjectionObservability

0 likes · 11 min read

Boost Service Reliability with Chaos Engineering: Practical Steps & Evaluation

Top Architect

Jul 23, 2022 · Cloud Native

Comprehensive Guide to etcd: Overview, Architecture, Deployment, and Usage

This article provides a detailed introduction to etcd, covering its purpose as a highly available distributed key‑value store, core concepts like Raft consensus, key features, common use cases such as service discovery and configuration management, step‑by‑step installation for single‑node and cluster deployments, and essential etcdctl commands for managing data and cluster members.

Cloud NativeConfiguration ManagementDistributed Systems

0 likes · 24 min read

Comprehensive Guide to etcd: Overview, Architecture, Deployment, and Usage

IT Architects Alliance

Jul 21, 2022 · Big Data

Explanation of Kafka Components and Architecture

This article provides a comprehensive overview of Kafka’s core components—including brokers, topics, partitions, producers, and consumers—explaining their roles, log structures, replication mechanisms, and the system’s architecture, supplemented with practical visual illustrative diagrams.

Distributed SystemsMessagingPartitioning

0 likes · 8 min read

Explanation of Kafka Components and Architecture

Meituan Technology Team

Jul 21, 2022 · Backend Development

Visualized Full‑Chain Log Tracing for Complex Business Systems

The article analyzes the shortcomings of traditional ELK and distributed tracing for complex business systems, proposes a visualized full‑chain log tracing solution that organizes and dynamically links logs by business chain, and demonstrates its implementation and performance gains at Meituan’s content platform.

BackendDSLDistributed Systems

0 likes · 26 min read

Visualized Full‑Chain Log Tracing for Complex Business Systems

Aikesheng Open Source Community

Jul 21, 2022 · Databases

Migrating from MyCat to DBLE: Parameter Settings, Sharding Functions, and SQL Compatibility

This article details a low‑impact migration from MyCat to DBLE, covering required parameter adjustments, sharding function replacement, data node handling, and business‑SQL compatibility issues, while providing concrete configuration examples and practical recommendations for a smooth transition.

DBLEDistributed SystemsMycat

0 likes · 7 min read

Migrating from MyCat to DBLE: Parameter Settings, Sharding Functions, and SQL Compatibility

FunTester

Jul 21, 2022 · Databases

Scaling Databases with Distributed Sharding and Peak‑Shaving Strategies

The article explains why simply adding hardware cannot keep up with growing database workloads and presents a three‑step approach—business isolation, horizontal sharding, and advanced reporting using PolarDB‑X—to achieve high concurrency, elastic capacity, and efficient caching for modern high‑traffic applications.

Distributed SystemsPeak ShavingPolarDB-X

0 likes · 7 min read

Scaling Databases with Distributed Sharding and Peak‑Shaving Strategies

Architecture Digest

Jul 21, 2022 · Backend Development

Implementing a Reliable Redis‑Based Delay Queue in Go

This article explains how to design and implement a precise, persistent, and retry‑capable delay queue using Redis ordered sets and Lua scripts, provides a complete Go library with usage examples, and discusses the underlying data structures, atomic operations, and garbage‑collection mechanisms.

Distributed SystemsGoMessage Queue

0 likes · 11 min read

Implementing a Reliable Redis‑Based Delay Queue in Go

Java High-Performance Architecture

Jul 20, 2022 · Backend Development

Mastering Load Balancing: From Round Robin to Consistent Hashing in Java

This article explains common load‑balancing strategies—including round‑robin, random, weighted, smooth weighted round‑robin, consistent hashing, least‑active, and fastest‑response algorithms—provides Java implementations with code samples, discusses their advantages, disadvantages, and suitable scenarios, and offers practical guidance for choosing the right method in distributed systems.

Distributed SystemsRound Robinconsistent hashing

0 likes · 35 min read

Mastering Load Balancing: From Round Robin to Consistent Hashing in Java

Laravel Tech Community

Jul 19, 2022 · Backend Development

The Evolution and Architecture of China’s 12306 Railway Ticketing System

This article examines the historical development, distributed architecture, and high‑concurrency challenges of China’s 12306 railway ticketing platform, tracing its origins from early Unix‑based systems to modern multi‑layered backend solutions that support hundreds of millions of users during peak travel periods.

Backend ArchitectureDistributed SystemsRailway

0 likes · 8 min read

The Evolution and Architecture of China’s 12306 Railway Ticketing System

Baidu Geek Talk

Jul 19, 2022 · Cloud Native

How OpenTelemetry and Jaeger Power Cloud‑Native Tracing

This article explains cloud‑native observability, defines its three pillars—metrics, tracing, and logging—details the OpenTelemetry tracing data model and Span structure, reviews industry implementations such as Jaeger and Alibaba Eagle Eye, and shares practical challenges and solutions from real‑world production use.

Alibaba Eagle EyeCloud NativeDistributed Systems

0 likes · 11 min read

How OpenTelemetry and Jaeger Power Cloud‑Native Tracing

Architect's Guide

Jul 19, 2022 · Backend Development

Design and Implementation of a Custom Java RPC Framework

This article presents a comprehensive overview of a hand‑crafted RPC framework built with Spring Boot, Netty, and Zookeeper, covering its core concepts, architecture, custom message protocol, load‑balancing strategies, serialization choices, starter design, deployment steps, and testing procedures.

Distributed SystemsNettyRPC

0 likes · 15 min read

Design and Implementation of a Custom Java RPC Framework

IT Architects Alliance

Jul 18, 2022 · Backend Development

Cache Design and Optimization Strategies in High‑Concurrency Distributed Systems

The article explains the benefits, costs, and various update, penetration, no‑hole, avalanche, and hot‑key optimization techniques for caching in high‑concurrency distributed systems, providing practical guidance on choosing appropriate strategies based on consistency requirements and system load.

Cache EvictionDistributed Systemscache-penetration

0 likes · 16 min read

Cache Design and Optimization Strategies in High‑Concurrency Distributed Systems

Top Architect

Jul 18, 2022 · Backend Development

Common Load Balancing Algorithms and Their Java Implementations

This article explains the principles behind popular load‑balancing strategies such as round‑robin, weighted round‑robin, smooth weighted round‑robin, consistent hashing, least‑active, and best‑response, and provides clear Java code examples for each method.

AlgorithmsBackendDistributed Systems

0 likes · 34 min read

Common Load Balancing Algorithms and Their Java Implementations

IT Architects Alliance

Jul 17, 2022 · Industry Insights

How Meituan Scaled Instant Delivery with Distributed Architecture and AI

This article examines Meituan's five‑year evolution of instant logistics, detailing the distributed, high‑concurrency architecture, AI‑driven optimization, scalability techniques, fault‑tolerance mechanisms, and future challenges faced by its real‑time delivery platform.

AIDistributed SystemsMicroservices

0 likes · 11 min read

How Meituan Scaled Instant Delivery with Distributed Architecture and AI

Java High-Performance Architecture

Jul 15, 2022 · Backend Development

How Meituan Built a High‑Performance Distributed Architecture for Instant Logistics

This article explains how Meituan's instant logistics platform evolved from a monolithic system to a highly available, scalable, and AI‑enhanced distributed architecture, detailing the technical challenges, architectural upgrades, fault‑tolerance strategies, and future scalability concerns.

AI integrationBackend ArchitectureDistributed Systems

0 likes · 10 min read

How Meituan Built a High‑Performance Distributed Architecture for Instant Logistics

Top Architect

Jul 15, 2022 · Backend Development

Design and Evolution of Meituan's Real-Time Logistics Distributed System

This article details Meituan's instant logistics platform architecture, covering its background, distributed system design, high‑availability deployment, AI‑driven optimization, and future challenges, while sharing practical solutions for scalability, fault tolerance, and operational efficiency in a high‑concurrency environment.

Distributed SystemsMeituanMicroservices

0 likes · 9 min read

Design and Evolution of Meituan's Real-Time Logistics Distributed System

IT Architects Alliance

Jul 14, 2022 · Operations

Common Load Balancing Algorithms and Their Java Implementations

This article reviews common load balancing strategies—including round-robin, random, weighted, smooth weighted round-robin, consistent hashing, least-active and optimal-response algorithms—explains their advantages and drawbacks, and provides complete Java code examples for each method.

AlgorithmsDistributed SystemsWeighted Round Robin

0 likes · 33 min read

Architecture & Thinking

Jul 14, 2022 · Fundamentals

Understanding Message Queues: Execution Principles and Choosing the Right MQ

This article explains the fundamentals of message middleware, covering its architecture, components, communication models, advantages, common protocols, major open‑source solutions, detailed feature comparisons, and practical guidance for selecting the most suitable message queue based on system scale, business needs, performance, high availability, and operational considerations.

Distributed SystemsKafkaMQ

0 likes · 21 min read

Understanding Message Queues: Execution Principles and Choosing the Right MQ

Architect

Jul 12, 2022 · Backend Development

Load Balancing Algorithms: Round Robin, Random, Weighted, Smooth Weighted, Consistent Hashing, Least Active and Optimal Response

This article explains common load‑balancing strategies—including basic round‑robin, random, weighted and smooth weighted algorithms, as well as consistent hashing, least‑active and optimal‑response methods—provides Java implementations for each, discusses their advantages, disadvantages, and suitable scenarios, and concludes with practical recommendations.

AlgorithmsDistributed Systemsconsistent hashing

0 likes · 34 min read

Load Balancing Algorithms: Round Robin, Random, Weighted, Smooth Weighted, Consistent Hashing, Least Active and Optimal Response

Top Architect

Jul 12, 2022 · Backend Development

Understanding Interface Idempotency and Distributed Rate Limiting with Token Bucket, Leaky Bucket, Guava RateLimiter, Nginx, and Redis+Lua

This article explains the concept of interface idempotency, demonstrates how to achieve idempotent update operations using version control and token mechanisms, and provides a comprehensive guide to distributed rate limiting—including time‑window and resource‑based dimensions, token‑bucket and leaky‑bucket algorithms, and practical implementations with Guava RateLimiter, Nginx, and Redis‑Lua scripts.

BackendDistributed SystemsIdempotency

0 likes · 21 min read

Understanding Interface Idempotency and Distributed Rate Limiting with Token Bucket, Leaky Bucket, Guava RateLimiter, Nginx, and Redis+Lua

vivo Internet Technology

Jul 6, 2022 · Backend Development

RocketMQ Consumer Startup Process and Message Consumption Mechanisms

RocketMQ consumers initialize by validating configuration, creating subscription data, establishing a client instance, loading offsets, and starting services before registering with the client factory, after which they can receive messages via push (default low‑latency) or pull modes, support ordered processing, and handle acknowledgments with configurable retries and dead‑letter handling.

ConsumerDistributed SystemsRocketMQ

0 likes · 24 min read

RocketMQ Consumer Startup Process and Message Consumption Mechanisms

Top Architect

Jul 4, 2022 · Backend Development

High‑Concurrency Architecture Design and Practical Solutions

This article explains how to design and implement a high‑concurrency system for e‑commerce scenarios, covering server architecture, load balancing, database clustering, caching strategies, concurrency testing tools, message‑queue based async processing, first‑level cache, static data handling, layering, distribution, redundancy and automation, with concrete examples such as user sign‑in, order queries and red‑packet distribution.

Backend ArchitectureDistributed SystemsPerformance Testing

0 likes · 20 min read

High‑Concurrency Architecture Design and Practical Solutions

Cognitive Technology Team

Jul 3, 2022 · Backend Development

Network Timeouts Do Not Imply Server Failure: Effective Retry, Backoff, and Idempotency Strategies

Network timeouts do not necessarily indicate server‑side failure; handling them with appropriate retry strategies, exponential backoff, and idempotent APIs—combined with mechanisms such as distributed locks and atomic transactions—helps maintain system stability while avoiding duplicate operations and resource exhaustion.

BackendBackoffDistributed Systems

0 likes · 7 min read

Network Timeouts Do Not Imply Server Failure: Effective Retry, Backoff, and Idempotency Strategies

Bilibili Tech

Jun 24, 2022 · Cloud Native

Evolution and Design of Bilibili's Load‑Testing Platform (Platform 2.0)

Bilibili’s load‑testing platform evolved from ad‑hoc JMeter scripts to a fully automated, self‑service system (Platform 2.0) that uses a custom load client, adaptive scheduling, and flexible scenario modes—including traffic replay and data‑isolated testing—to efficiently stress‑test over a hundred microservices for large‑scale events, with further integration and circuit‑breaker enhancements planned.

Distributed SystemsLoad TestingMicroservices

0 likes · 27 min read

Evolution and Design of Bilibili's Load‑Testing Platform (Platform 2.0)

High Availability Architecture

Jun 24, 2022 · Backend Development

Improving Cache Invalidation and Consistency at Scale

Meta engineers describe the challenges of cache invalidation and consistency in large‑scale distributed systems, explain why stale caches are problematic, present their Polaris observability service and consistency‑tracking techniques, and detail how they raised TAO’s cache consistency from six‑nines to ten‑nines.

ConsistencyDistributed SystemsObservability

0 likes · 17 min read

Improving Cache Invalidation and Consistency at Scale

macrozheng

Jun 24, 2022 · Backend Development

Choosing the Right Distributed ID Generator: UUID, Snowflake, NanoID

This article examines the trade‑offs of common distributed ID generators—UUID, timestamp‑based schemes, Snowflake, and NanoID—highlighting their size, ordering, performance, and security implications to help developers select the most suitable approach for their systems.

Distributed SystemsID generationnanoid

0 likes · 6 min read

Choosing the Right Distributed ID Generator: UUID, Snowflake, NanoID

DataFunTalk

Jun 23, 2022 · Big Data

Real‑Time Low‑Latency Log Monitoring and Storage at Ctrip: Architecture, Clog System, CAT Tracing, and TSDB

This article details Ctrip's large‑scale, real‑time log monitoring solution, covering the overall monitoring architecture, the Clog log system, the CAT tracing platform, and the TSDB metric store, and explains design choices such as write‑heavy indexing, segment‑based storage, and migration to ClickHouse for high‑cardinality data.

Distributed SystemsLog MonitoringReal-time Processing

0 likes · 11 min read

Real‑Time Low‑Latency Log Monitoring and Storage at Ctrip: Architecture, Clog System, CAT Tracing, and TSDB

NetEase Smart Enterprise Tech+

Jun 23, 2022 · Backend Development

Mastering Rate Limiting: Choosing the Right Algorithm for High‑Availability Systems

This article explores the importance of rate limiting in distributed micro‑service architectures, explains four core algorithms—fixed window, sliding window, leaky bucket, and token bucket—and details a practical, Redis‑backed multi‑layer throttling solution for a voice‑bot platform, including trade‑offs and implementation tips.

Distributed SystemsMicroservicesSliding Window

0 likes · 15 min read

Mastering Rate Limiting: Choosing the Right Algorithm for High‑Availability Systems

Wukong Talks Architecture

Jun 20, 2022 · Backend Development

Understanding the Eureka Service Heartbeat Mechanism in Microservices

This article explains how Eureka’s heartbeat mechanism works in microservice architectures, covering who sends the heartbeat, the scheduling interval, the request implementation, server-side handling, and the effect on service registration, using code examples and analogies to historical tribute systems.

Distributed SystemsHeartbeateureka

0 likes · 6 min read

Understanding the Eureka Service Heartbeat Mechanism in Microservices

转转QA

Jun 20, 2022 · Operations

Issue 4: Technical Articles on Full‑Chain Stress Testing, Distributed Web UI Automation, WWDC22 Sessions, Efficient Automated Testing, and AutoDiff Traffic Replay

This issue curates five technical articles covering Gaode's full‑chain stress testing platform, a distributed web UI automation system, a comprehensive WWDC22 session review, ShouQianBa's efficient automated testing practices, and Autohome's AutoDiff traffic replay for integrated testing.

Distributed SystemsMobile DevelopmentPerformance

0 likes · 4 min read

Issue 4: Technical Articles on Full‑Chain Stress Testing, Distributed Web UI Automation, WWDC22 Sessions, Efficient Automated Testing, and AutoDiff Traffic Replay

Su San Talks Tech

Jun 20, 2022 · Backend Development

Why RPC Beats HTTP for Enterprise Services: OSI Layers, Architecture & Top Frameworks

This article explains the OSI seven‑layer model, compares RPC and HTTP services, details RPC architecture, synchronous vs asynchronous calls, and reviews popular RPC frameworks, helping developers choose the right approach for enterprise applications.

Backend ArchitectureDistributed SystemsHTTP

0 likes · 9 min read

Why RPC Beats HTTP for Enterprise Services: OSI Layers, Architecture & Top Frameworks

ITPUB

Jun 19, 2022 · Backend Development

Mastering Cache Design: Solving the 7 Classic Pitfalls in High‑Traffic Systems

This article explores common cache challenges such as centralized expiration, cache penetration, avalanche, hot keys, large keys, data consistency, and concurrent pre‑warming, offering practical design patterns and mitigation strategies to build robust, high‑performance backend systems.

BackendCache DesignDistributed Systems

0 likes · 10 min read

Mastering Cache Design: Solving the 7 Classic Pitfalls in High‑Traffic Systems

Qunar Tech Salon

Jun 16, 2022 · Operations

Practical Chaos Engineering Practices at Qunar Travel: Architecture, Scenarios, and Automation

This article details Qunar Travel's mature chaos engineering platform built on chaosblade, covering value analysis, system architecture, shutdown and dependency drills, automated closed‑loop testing, attack‑defense exercises, and the measurable reliability improvements achieved across thousands of services.

Distributed SystemsFault InjectionOperations

0 likes · 18 min read

Practical Chaos Engineering Practices at Qunar Travel: Architecture, Scenarios, and Automation

Java High-Performance Architecture

Jun 11, 2022 · Backend Development

Essential Microservice Architecture Patterns Every Backend Engineer Should Know

This article explores the core goals of microservice architecture, outlines key design principles such as scalability and resilience, and presents a comprehensive catalog of decomposition, integration, cross‑cutting concern, and observability patterns—including API gateway, Strangler, Bulkhead, Saga, and CQRS—to guide developers in building robust, maintainable backend systems.

ArchitectureBackendDistributed Systems

0 likes · 24 min read

Essential Microservice Architecture Patterns Every Backend Engineer Should Know

MaGe Linux Operations

Jun 10, 2022 · Fundamentals

Demystifying Paxos: How Distributed Systems Achieve Consensus

This article explains the Paxos consensus algorithm—its origins, core concepts, roles of proposers, acceptors and learners, safety and liveness constraints, the two-phase protocol, proposal generation, and practical variations—showing why Paxos remains a foundational solution for fault‑tolerant distributed systems.

Consensus AlgorithmDistributed SystemsPaxos

0 likes · 16 min read

Demystifying Paxos: How Distributed Systems Achieve Consensus

ITPUB

Jun 10, 2022 · Operations

How Zookeeper Prevents Split‑Brain Failures in Distributed Clusters

This article explains the split‑brain phenomenon in distributed systems, illustrates how it can occur in Zookeeper clusters, and details Zookeeper's quorum‑based solutions—including majority voting, odd‑node deployment, and additional safeguards—to avoid split‑brain and ensure reliable leader election.

Distributed SystemsSplit-Brainleader election

0 likes · 11 min read

How Zookeeper Prevents Split‑Brain Failures in Distributed Clusters

Laiye Technology Team

Jun 10, 2022 · Backend Development

Understanding System Failures and Principles for Resilient Architecture

The article analyzes why modern software systems repeatedly collapse—due to growing business complexity, unpredictable changes, and architectural decay—and proposes principles such as decentralization, integration, and diversity, along with practical strategies like service mesh and eBPF, to design more sustainable, observable, and self‑evolving architectures.

ArchitectureDistributed SystemsMicroservices

0 likes · 12 min read

Understanding System Failures and Principles for Resilient Architecture

Sohu Tech Products

Jun 8, 2022 · Databases

Cache Working Mechanisms and Consistency Solutions Using Redis

This article explores Redis cache mechanisms, various caching patterns such as Cache‑Aside, Read‑Through, Write‑Through and Write‑Behind, analyzes consistency challenges in write operations, and presents solutions like delayed double deletion, retry mechanisms, and binlog‑based asynchronous cache invalidation to achieve eventual consistency.

Cache ConsistencyDistributed Systemscache-aside

0 likes · 18 min read

Cache Working Mechanisms and Consistency Solutions Using Redis

Efficient Ops

Jun 7, 2022 · Big Data

Visualizing Kafka: Core Concepts Explained with Diagrams

This article visually breaks down Kafka’s fundamental concepts—including topics, partitions, producers, consumers, consumer groups, and cluster architecture—so readers can grasp how messages flow, are stored, and achieve load balancing and ordering within a distributed streaming platform.

Distributed SystemsKafkaMessage Queue

0 likes · 6 min read

Visualizing Kafka: Core Concepts Explained with Diagrams

Snowball Engineer Team

Jun 6, 2022 · Databases

Deep Dive into ClickHouse Join Implementation and Optimization Techniques

This article examines ClickHouse's join mechanisms, detailing the limitations of standard joins, the advantages of Global joins, and optimization strategies such as hash and merge joins, subquery filtering, and memory considerations, illustrated with SQL examples and source‑code analysis.

ClickHouseDistributed SystemsHash Join

0 likes · 16 min read

Deep Dive into ClickHouse Join Implementation and Optimization Techniques

DataFunSummit

Jun 3, 2022 · Big Data

Building and Optimizing JD Retail OLAP Platform: Architecture, Management, and Performance Techniques

This article details JD Retail's OLAP platform construction, covering control plane design, architecture, business and operation management, real‑time data updates, materialized view usage, join optimizations, high‑concurrency and high‑throughput scenarios, and promotional preparation strategies, illustrated with diagrams and performance metrics.

Big DataClickHouseDistributed Systems

0 likes · 20 min read

Building and Optimizing JD Retail OLAP Platform: Architecture, Management, and Performance Techniques

Volcano Engine Developer Services

May 30, 2022 · Databases

How ByteDance Scaled to 10 EB: Evolution of Its Cloud‑Native Database Architecture

This article chronicles ByteDance's journey from early MySQL‑based databases to a sophisticated, cloud‑native, distributed database platform that now supports over 10 EB of storage, detailing the challenges, architectural milestones, and future directions of its database infrastructure.

Big DataDistributed SystemsScalability

0 likes · 17 min read

How ByteDance Scaled to 10 EB: Evolution of Its Cloud‑Native Database Architecture

Code Ape Tech Column

May 27, 2022 · Backend Development

Comparison of Four Popular Open-Source Distributed Message Queues: Kafka, ActiveMQ, RabbitMQ, and RocketMQ

This article introduces and compares four widely used open-source distributed message-queue systems—Kafka, ActiveMQ, RabbitMQ, and RocketMQ—detailing their architectures, deployment requirements, features, high-availability solutions, advantages, disadvantages, and provides guidance for selecting the appropriate middleware in large-scale applications.

ActiveMQDistributed SystemsKafka

0 likes · 38 min read

Comparison of Four Popular Open-Source Distributed Message Queues: Kafka, ActiveMQ, RabbitMQ, and RocketMQ

macrozheng

May 26, 2022 · Backend Development

How RocketMQ Uses Queues, Page Cache, and mmap to Achieve High Performance and Scalability

This article explains how RocketMQ tackles registration latency, synchronous coupling, and traffic spikes by introducing an intermediate queue, designing a persistent high‑availability broker, leveraging Linux page cache and memory‑mapped files, and employing topics, tags, and sharding to enable efficient asynchronous processing and scalable consumption.

Distributed SystemsMessage QueueRocketMQ

0 likes · 28 min read

How RocketMQ Uses Queues, Page Cache, and mmap to Achieve High Performance and Scalability

Sohu Tech Products

May 25, 2022 · Backend Development

Design Principles of RocketMQ: Broker Architecture, Persistence, High Performance and High Availability

The article explains how RocketMQ tackles growing business traffic by introducing an asynchronous broker layer, using commitlog and consumeQueue files, page‑cache, mmap, topic/tag routing, and a nameserver to achieve high‑throughput, low‑latency, and highly available message delivery.

Distributed SystemsMessage QueueRocketMQ

0 likes · 30 min read

Design Principles of RocketMQ: Broker Architecture, Persistence, High Performance and High Availability

Zhuanzhuan Tech

May 25, 2022 · Backend Development

Comparison of Transaction Message Implementations in ZuanZuan's Custom RocketMQ Version and the Open‑Source Community Version

This article analyzes why ZuanZuan built its own RocketMQ transaction‑message solution, compares the custom implementation with the community edition’s two‑phase commit approach, explains the underlying principles, sending flow, failure handling, and unknown‑state processing, and summarizes the trade‑offs of each design.

Distributed SystemsMessage QueueRocketMQ

0 likes · 10 min read

Comparison of Transaction Message Implementations in ZuanZuan's Custom RocketMQ Version and the Open‑Source Community Version

Top Architect

May 24, 2022 · Backend Development

Comprehensive Guide to Cache Optimization and Design Strategies

This article presents a comprehensive overview of cache optimization techniques, covering benefits and costs, update policies, granularity control, penetration and avalanche mitigation, hot‑key reconstruction, and distributed batch‑operation strategies, with practical examples using Redis and MySQL.

BackendDistributed SystemsPerformance

0 likes · 14 min read

Comprehensive Guide to Cache Optimization and Design Strategies

ByteFE

May 23, 2022 · Backend Development

Design and Implementation of ByteDance Wallet Asset Platform for the 2022 Spring Festival Activity Across Eight Apps

The article details the architecture, challenges, and solutions of ByteDance's wallet asset platform that unified reward issuance, display, and usage across eight applications during the 2022 Spring Festival, covering high‑traffic order processing, token‑based asynchronous settlement, budget control, stability mechanisms, and future evolution.

BackendDistributed SystemsMicroservices

0 likes · 29 min read

Design and Implementation of ByteDance Wallet Asset Platform for the 2022 Spring Festival Activity Across Eight Apps

IT Architects Alliance

May 21, 2022 · Backend Development

Cache Usage Techniques and Design Strategies

This article explains how caching can accelerate read/write performance and reduce backend load, analyzes its benefits and costs, and details practical design patterns such as update policies, granularity control, penetration, bottom‑hole, avalanche, and hot‑key optimizations for reliable high‑performance systems.

ConsistencyDistributed SystemsPerformance

0 likes · 13 min read

Cache Usage Techniques and Design Strategies

Bilibili Tech

May 20, 2022 · Backend Development

Design and Implementation of Bilibili Object Storage Service (BOSS): Architecture, Topology, Metadata, Erasure Coding, and Scaling

The article chronicles Bilibili’s 13‑day development of BOSS, a custom object storage service, detailing how it replaces MySQL‑based routing and ID generation with replicated etcd or Raft KV stores, models metadata via protobuf, adopts erasure coding and a Bitcask‑style engine, and implements safe delete, replica repair, and horizontal scaling for a resilient large‑scale system.

Distributed Systemserasure codingmetadata design

0 likes · 28 min read

Design and Implementation of Bilibili Object Storage Service (BOSS): Architecture, Topology, Metadata, Erasure Coding, and Scaling

Qunar Tech Salon

May 19, 2022 · Operations

Design and Optimization of a Large‑Scale Monitoring System at Qunar.com

This article describes the architecture, challenges, and performance optimizations of Qunar.com's Watcher monitoring platform, covering massive metric collection, master‑worker redesign, Graphite/Whisper storage enhancements, and future migration to Go‑based cloud‑native solutions.

CI/CDCloud NativeDistributed Systems

0 likes · 13 min read

Design and Optimization of a Large‑Scale Monitoring System at Qunar.com

vivo Internet Technology

May 18, 2022 · Backend Development

Kafka Cluster Fault Analysis: Root Cause and Cascading Failure Mechanism

A Kafka cluster at vivo suffered a total traffic drop across a resource group when a broker’s disk failed, because the default producer partitioner still hashed keys to the failed partition, exhausting client buffers and blocking all healthy partitions, prompting recommendations to avoid keys or use custom partitioners.

Distributed SystemsKafkaPerformance Optimization

0 likes · 9 min read

Kafka Cluster Fault Analysis: Root Cause and Cascading Failure Mechanism

DevOps

May 18, 2022 · Operations

Understanding and Preventing Cascading Failures in Distributed Systems

The article explains how cascading failures arise from positive feedback loops in distributed systems, illustrates real‑world incidents such as the 2015 DynamoDB outage, outlines anti‑patterns like unlimited retries and unchecked load, and presents practical mitigation techniques including load‑shedding, circuit breakers, exponential back‑off, and controlled replication to improve system resilience.

Distributed SystemsResilienceSRE

0 likes · 19 min read

Understanding and Preventing Cascading Failures in Distributed Systems

DataFunSummit

May 15, 2022 · Databases

Design and Evolution of a Custom Storage Engine for IoT Device Metadata

This article presents a detailed case study of an IoT device metadata management platform, describing the business scenario, the evolution from a single‑node MySQL solution through sharded MySQL, HBase and Elasticsearch, to a self‑developed distributed storage engine that separates compute and storage, supports LSM, multi‑dimensional indexing, routing keys, and parallel scans to meet massive write‑read throughput and complex query requirements.

Big DataDistributed SystemsIoT

0 likes · 14 min read

Design and Evolution of a Custom Storage Engine for IoT Device Metadata

Top Architect

May 14, 2022 · Backend Development

Designing Microservices with Domain‑Driven Design, Bounded Contexts, and Event Storming

This article explains how microservice architecture leverages Domain‑Driven Design concepts such as bounded contexts and aggregates, uses context mapping and Event Storming to define service boundaries, and discusses communication patterns, eventual consistency, and Backend‑for‑Frontend design to build resilient, loosely‑coupled distributed systems.

Bounded ContextDistributed SystemsDomain-Driven Design

0 likes · 18 min read

Designing Microservices with Domain‑Driven Design, Bounded Contexts, and Event Storming

Top Architect

May 13, 2022 · Backend Development

Design and Implementation of Delayed Message Queues in Distributed Systems

This article surveys common delayed message implementations—including external storage, RocksDB, Redis, and open‑source MQs like RocketMQ, Pulsar, and QMQ—analyzing their architectures, advantages, drawbacks, and practical considerations for building reliable distributed asynchronous messaging systems.

Distributed SystemsPulsarQMQ

0 likes · 13 min read

Design and Implementation of Delayed Message Queues in Distributed Systems

Zuoyebang Tech Team

May 13, 2022 · Operations

Build a Scalable, Cost‑Effective Log Retrieval System Without Elasticsearch

This article explains how to design a high‑performance, low‑cost log retrieval architecture that avoids Elasticsearch by partitioning logs into time‑based chunks, indexing only metadata, using multi‑tier storage (local, remote, archive), and orchestrating queries through GD‑Search, Local‑Search, Remote‑Search and Log‑Manager components.

Distributed SystemsStorage Optimizationcost efficiency

0 likes · 14 min read

Build a Scalable, Cost‑Effective Log Retrieval System Without Elasticsearch

dbaplus Community

May 12, 2022 · Big Data

How Bilibili Scaled Presto on Hadoop: Architecture, Optimizations, and Performance Gains

This article details Bilibili's end‑to‑end Presto on Hadoop architecture, covering the multi‑engine SQL stack, dispatcher routing, cluster scale, stability enhancements like coordinator HA and real‑time punish, query limits, Hive UDF compatibility, insert‑overwrite support, Alluxio caching, multi‑datacenter routing, query result caching, Raptorx local cache, JDK upgrades, dynamic filtering, and future roadmap, illustrating how these innovations boosted query throughput and reduced latency.

Big DataCluster ManagementDistributed Systems

0 likes · 32 min read

How Bilibili Scaled Presto on Hadoop: Architecture, Optimizations, and Performance Gains

Architect's Alchemy Furnace

May 10, 2022 · Operations

How to Build Truly High‑Availability Systems: Redundancy, Failover, and Layered Architecture

High availability (HA) is essential for distributed systems, requiring redundancy and automatic failover across each architectural layer—from client to proxy, gateway, business logic, cache, and storage—to minimize downtime, achieve desired “nines” of uptime, and prevent cascading failures such as service snowballing.

Distributed SystemsSystem Architecturefailover

0 likes · 14 min read

How to Build Truly High‑Availability Systems: Redundancy, Failover, and Layered Architecture

Sanyou's Java Diary

May 8, 2022 · Backend Development

How Nacos’s Distro Protocol Ensures High Availability with AP Consistency

This article explains Nacos’s Distro consistency protocol, detailing its design principles, asynchronous replication, periodic synchronization, new‑node data loading, and local read mechanisms, and shows how these mechanisms together provide high‑availability AP consistency for service registration in a distributed cluster.

AP ConsistencyDistributed SystemsDistro Protocol

0 likes · 14 min read

How Nacos’s Distro Protocol Ensures High Availability with AP Consistency

Wukong Talks Architecture

May 5, 2022 · Backend Development

Deep Dive into Nacos Distro Consistency Protocol: Design, Mechanisms, and Source Code Analysis

This article explains the design principles, six core mechanisms, asynchronous replication, periodic synchronization, new‑node sync, and local read strategy of Nacos's Distro consistency protocol, illustrating how AP/CP choices and the JRaft protocol ensure high availability in a distributed service registry.

APConsistencyDistributed Systems

0 likes · 13 min read

Deep Dive into Nacos Distro Consistency Protocol: Design, Mechanisms, and Source Code Analysis

IT Architects Alliance

Apr 29, 2022 · R&D Management

30 Essential Architecture Principles Every Software Architect Should Follow

This article presents thirty practical software‑architecture principles—from keeping designs simple and avoiding unnecessary features to mastering concurrency, distributed systems, and user experience—providing a comprehensive guide that helps architects make informed, ROI‑driven decisions while fostering scalable, maintainable solutions.

Distributed SystemsMVPR&D management

0 likes · 11 min read

30 Essential Architecture Principles Every Software Architect Should Follow

Tencent Cloud Developer

Apr 28, 2022 · Operations

Mastering ZooKeeper: Installation, Znode Model, Watchers, and Leader Election Explained

This comprehensive guide walks you through ZooKeeper's role in distributed systems, detailed installation steps, Znode data structures, watcher mechanisms, ZAB protocol operations, and the FastLeaderElection algorithm, providing practical commands and configuration examples for building reliable coordination services.

Coordination ServiceDistributed SystemsInstallation

0 likes · 32 min read

Mastering ZooKeeper: Installation, Znode Model, Watchers, and Leader Election Explained

Cloud Native Technology Community

Apr 26, 2022 · Backend Development

Rapid Event Notification System (RENO) at Netflix: Design, Architecture, and Lessons Learned

Netflix built the Rapid Event Notification (RENO) system to deliver real‑time, scalable, and prioritized notifications across millions of devices, handling diverse use cases such as watch activity, personalization updates, plan changes, and diagnostics while managing high request‑per‑second loads through a hybrid push‑pull model and targeted delivery.

Backend ArchitectureDistributed SystemsNetflix

0 likes · 13 min read

Rapid Event Notification System (RENO) at Netflix: Design, Architecture, and Lessons Learned

Volcano Engine Developer Services

Apr 26, 2022 · Operations

How Volcano Engine’s TLS Transforms Log Management for Kubernetes at Scale

This article explains the challenges of traditional open‑source log collection in cloud‑native environments, describes Volcano Engine’s unified TLS architecture, its centralized configuration, CRD‑based deployment, and showcases real‑world case studies that demonstrate improved availability, efficiency, and scalability.

Cloud NativeDistributed SystemsKubernetes

0 likes · 15 min read

How Volcano Engine’s TLS Transforms Log Management for Kubernetes at Scale

macrozheng

Apr 26, 2022 · Backend Development

Service Registry Showdown: Zookeeper, Eureka, Nacos, Consul & ETCD

This article examines five popular service registries—Zookeeper, Eureka, Nacos, Consul, and ETCD—explaining their core concepts, architecture, CAP trade‑offs, health‑check mechanisms, multi‑data‑center support, and provides guidance on selecting the most suitable registry for different technology stacks and availability requirements.

CAP theoremDistributed SystemsMicroservices

0 likes · 25 min read

Service Registry Showdown: Zookeeper, Eureka, Nacos, Consul & ETCD

Alibaba Terminal Technology

Apr 25, 2022 · Backend Development

How RTMS Revolutionizes Real‑Time Messaging for Massive Interactive Apps

RTMS (Real Time Message Service) is a frame‑synchronization‑based real‑time communication solution that offloads logic from the server, uses in‑memory processing, and employs decentralized, non‑persistent, publish‑subscribe routing to dramatically improve latency, scalability, and interactivity for high‑frequency multi‑user scenarios.

Backend ArchitectureDecentralizationDistributed Systems

0 likes · 10 min read

How RTMS Revolutionizes Real‑Time Messaging for Massive Interactive Apps

IT Architects Alliance

Apr 24, 2022 · Backend Development

Distributed Locks and Idempotency: Principles, Implementations, and the Cerberus Solution

This article explains the challenges of mutual exclusion and idempotency in distributed systems, reviews Java concurrency primitives, compares common distributed lock implementations such as Zookeeper, Redis, and Tair, and introduces Cerberus and GTIS as robust solutions for high‑availability and repeatable operations.

CerberusDistributed SystemsGTIS

0 likes · 31 min read

Distributed Locks and Idempotency: Principles, Implementations, and the Cerberus Solution

Top Architect

Apr 23, 2022 · Backend Development

Common Implementation Schemes for Delayed Messages in Distributed Systems

This article examines various approaches to implementing delayed (scheduled) messages in distributed message queue systems, comparing external storage, RocksDB, Redis, and open‑source MQ solutions, and discusses their advantages, drawbacks, and practical considerations.

ArchitectureBackendDistributed Systems

0 likes · 13 min read

Common Implementation Schemes for Delayed Messages in Distributed Systems

Code Ape Tech Column

Apr 23, 2022 · Big Data

Kafka Fundamentals: Architecture, Replication, Partitioning, and Performance

This article provides a comprehensive overview of Kafka, covering its role as a message middleware, core concepts, architecture, replication management, partition handling, producer sending modes, partition assignment strategies, load balancing, reliability mechanisms, consumer models, controller election, and factors that affect its high throughput and potential message loss scenarios.

Distributed SystemsKafkaMessage Queue

0 likes · 29 min read

Kafka Fundamentals: Architecture, Replication, Partitioning, and Performance

21CTO

Apr 22, 2022 · Backend Development

From Single Server to Cloud‑Native: How Taobao Scaled to Millions of Users

This article traces Taobao’s architectural evolution—from a single‑machine setup to distributed clusters, caching layers, load‑balancing proxies, database sharding, microservices, ESB, containerization, and finally cloud‑native deployment—highlighting the technologies and design principles that enable scaling from hundreds to tens of millions of concurrent users.

ArchitectureDistributed SystemsMicroservices

0 likes · 22 min read

From Single Server to Cloud‑Native: How Taobao Scaled to Millions of Users

Architect

Apr 20, 2022 · Operations

Load Balancing in High‑Concurrency Scenarios: Alibaba Double 11, 12306 Railway, WeChat Red Packets, and Douyin Spring Festival Gala

This article examines real‑world load‑balancing implementations for ultra‑high traffic cases such as Alibaba's Double 11 shopping festival, China's 12306 railway ticketing system, WeChat's red‑packet service, and Douyin's Spring Festival gala, highlighting architectural principles, DPDK usage, ECMP routing, session synchronization, SET‑based sharding, and service‑mesh techniques.

DPDKDistributed SystemsMicroservices

0 likes · 12 min read

Load Balancing in High‑Concurrency Scenarios: Alibaba Double 11, 12306 Railway, WeChat Red Packets, and Douyin Spring Festival Gala

IT Xianyu

Apr 20, 2022 · Fundamentals

Designing Scalable Systems for Billions of Users: From a Single Server to Distributed Architecture

This article explains how to evolve a simple single‑server web application into a highly available, horizontally and vertically scalable system for billions of users by covering DNS, vertical and horizontal scaling, load balancing, database replication, sharding, denormalization, SQL/NoSQL choices, stateless design, caching, CDN and global deployment.

Database ReplicationDistributed SystemsScalability

0 likes · 18 min read

Designing Scalable Systems for Billions of Users: From a Single Server to Distributed Architecture

JD Tech

Apr 14, 2022 · Backend Development

JD.com Flash Sale Product Pool Expansion: Architecture Upgrade and Performance Optimization

This article details how JD.com tackled rapid growth of its flash‑sale product pool by analyzing JVM GC bottlenecks, redesigning the system architecture, introducing dual‑cache bucket updates, local LRU caching, Bloom filters, and service separation, ultimately achieving significant performance and stability improvements for large‑scale promotions.

Distributed SystemsGC optimizationJVM

0 likes · 12 min read

Architect

Apr 14, 2022 · Backend Development

Understanding Idempotent Design: Concepts, Scenarios, and Eight Practical Implementation Strategies

This article explains the mathematical and computer‑science definition of idempotency, why it is essential for reliable distributed services, how to handle timeout scenarios, and presents eight concrete design patterns—including unique IDs, database constraints, token mechanisms, optimistic/pessimistic locks, and distributed locks—along with code examples and HTTP method analysis.

Distributed SystemsHTTPbackend-development

0 likes · 18 min read

Understanding Idempotent Design: Concepts, Scenarios, and Eight Practical Implementation Strategies

macrozheng

Apr 14, 2022 · Operations

Mastering High Availability: 4 Essential Design Techniques for Scalable Systems

This article outlines the core high‑availability techniques—system splitting, decoupling, asynchronous processing, retry, compensation, backup, multi‑active strategies, isolation, rate limiting, circuit breaking, and degradation—providing practical guidance for designing resilient, scalable backend architectures in large‑scale internet applications.

Distributed SystemsMicroservicesSystem Design

0 likes · 13 min read

Mastering High Availability: 4 Essential Design Techniques for Scalable Systems

Architect

Apr 13, 2022 · Backend Development

Implementation Strategies and Trade‑offs for Delayed Messages in Distributed Message Queues

The article surveys common delayed‑message solutions in distributed MQ systems—including external storage, RocksDB, Redis, thread‑based scanning, and the designs of RocketMQ, Pulsar, and QMQ—detailing their architectures, advantages, and drawbacks for backend developers.

Distributed SystemsRocketMQRocksDB

0 likes · 13 min read

Implementation Strategies and Trade‑offs for Delayed Messages in Distributed Message Queues

Wukong Talks Architecture

Apr 13, 2022 · Backend Development

Understanding the Underlying Mechanism of Nacos Service Registration

This article explains how Nacos registers services by detailing the client‑side request assembly, random node selection in a cluster, routing forwarding, and server‑side handling, while providing code snippets, diagrams, and practical tips for debugging the registration flow.

ClusterDistributed SystemsNacos

0 likes · 16 min read

Understanding the Underlying Mechanism of Nacos Service Registration

Architect's Journey

Apr 13, 2022 · Fundamentals

Is Classifying Distributed Systems as CP or AP a False Dichotomy?

The article revisits the CAP theorem, explains linearizable consistency and strict availability, demonstrates with concrete data‑center examples why the CP/AP split is often misleading, and argues that latency concerns and broader failure modes make the binary classification of distributed systems impractical.

AvailabilityCAP theoremCP vs AP

0 likes · 10 min read

Is Classifying Distributed Systems as CP or AP a False Dichotomy?

DataFunTalk

Apr 12, 2022 · Big Data

Kuaishou Big Data Task Scheduling System: Architecture, Challenges, and Key Technologies

This article presents Kuaishou's large‑scale big‑data task scheduling system, describing its evolution from Airflow to the self‑developed Kwaiflow, the performance and reliability challenges of handling hundreds of thousands of tasks, and the design decisions that achieve low latency, high availability, and strong open capabilities.

Distributed SystemsKuaishouKwaiflow

0 likes · 22 min read

Kuaishou Big Data Task Scheduling System: Architecture, Challenges, and Key Technologies

Code Ape Tech Column

Apr 10, 2022 · Backend Development

Implementing Distributed WebSocket Clusters with Spring Cloud, Consistent Hashing, and Gateway

This article explores practical approaches for building a distributed WebSocket cluster in a Spring Cloud environment, covering session handling, Netty versus Spring WebSocket implementations, session broadcasting, consistent‑hash routing, gateway configuration, Ribbon limitations, and provides code examples for each solution.

BackendDistributed SystemsSpring Cloud

0 likes · 17 min read

Implementing Distributed WebSocket Clusters with Spring Cloud, Consistent Hashing, and Gateway

IT Architects Alliance

Apr 10, 2022 · Backend Development

Understanding Elasticsearch: Core Concepts, Architecture, and Performance Tips

This article provides a comprehensive overview of Elasticsearch, covering data types, Lucene fundamentals, cluster discovery, node roles, shard and replica management, mapping, installation, health monitoring, indexing mechanics, storage strategies, refresh and translog processes, segment merging, and practical performance optimizations for production deployments.

Distributed SystemsElasticsearchindexing

0 likes · 39 min read

Understanding Elasticsearch: Core Concepts, Architecture, and Performance Tips

JD Tech

Apr 8, 2022 · Big Data

Designing a High‑Throughput Log Collection System with ClickHouse, UDP and Compression

The article analyses the massive cost and performance challenges of traditional log‑storage pipelines at JD.com, then proposes a streamlined architecture that eliminates disk and MQ stages, uses in‑memory buffering, UDP transport, Snappy/ZSTD compression, and ClickHouse storage to achieve multi‑gigabyte per‑second throughput with over 70% hardware cost reduction.

ClickHouseDistributed SystemsHigh Throughput

0 likes · 15 min read

Designing a High‑Throughput Log Collection System with ClickHouse, UDP and Compression

IT Architects Alliance

Apr 7, 2022 · Fundamentals

30 Architectural Principles for Software Engineers

This article presents thirty concise architectural principles covering simplicity, YAGNI, iterative development, testing, ROI, user focus, modular design, configuration, server concurrency, distributed system challenges, and user experience, offering practical guidance for engineers to build scalable, maintainable software systems.

Distributed SystemsEngineeringSoftware Architecture

0 likes · 12 min read

30 Architectural Principles for Software Engineers

Wukong Talks Architecture

Apr 7, 2022 · Fundamentals

Understanding Kafka Topic Partitions and Consumer Groups with a Plants vs Zombies Analogy

This article uses the Plants vs Zombies game to illustrate why Kafka topics need partitions, how consumer groups work, and the benefits of partitioning, replication, and load balancing in a distributed messaging system.

Distributed SystemsKafkaMessage Queue

0 likes · 9 min read

Understanding Kafka Topic Partitions and Consumer Groups with a Plants vs Zombies Analogy

Architects' Tech Alliance

Apr 2, 2022 · Industry Insights

How Financial Institutions Secure Database Continuity: Disaster Recovery Strategies & Market Trends

This article examines the critical role of databases in finance, defines disaster recovery and backup concepts, outlines industry requirements and regulations, analyzes market growth, and compares distributed database disaster‑recovery architectures such as single‑center, city‑level mutual backup, active‑active, and two‑site three‑center solutions.

BackupDistributed SystemsFinancial Services

0 likes · 15 min read

How Financial Institutions Secure Database Continuity: Disaster Recovery Strategies & Market Trends

Top Architect

Mar 31, 2022 · Backend Development

Meituan Instant Logistics: Distributed System Architecture Evolution and Challenges

The article details Meituan's five‑year journey building a high‑concurrency, AI‑enhanced instant logistics platform, describing its distributed architecture, scalability and reliability practices, fault‑tolerance mechanisms, and future challenges in microservice and unit‑based operations.

AIArchitectureDistributed Systems

0 likes · 10 min read

Meituan Instant Logistics: Distributed System Architecture Evolution and Challenges

Java Interview Crash Guide

Mar 31, 2022 · Backend Development

How We Achieved 20k TPS High‑Availability for a Billion‑User Membership System

This article details the design and implementation of a highly available, high‑performance membership system serving billions of users, covering Elasticsearch dual‑center clusters, traffic‑isolated architectures, deep ES optimizations, Redis caching with distributed locks, dual‑center MySQL partitioning, migration strategies, abnormal account handling, and future fine‑grained flow‑control and degradation policies.

Distributed SystemsElasticsearchMySQL

0 likes · 20 min read

How We Achieved 20k TPS High‑Availability for a Billion‑User Membership System

Top Architect

Mar 30, 2022 · Backend Development

Interface Idempotency and Distributed Rate Limiting: Concepts, Algorithms, and Practical Implementations

The article discusses how to achieve interface idempotency using unique business identifiers or token mechanisms, then explores distributed rate limiting by outlining its dimensions, common algorithms such as token bucket and leaky bucket, and provides concrete implementation examples with Guava RateLimiter, Nginx, and Redis‑Lua scripts.

BackendDistributed SystemsIdempotency

0 likes · 20 min read

Interface Idempotency and Distributed Rate Limiting: Concepts, Algorithms, and Practical Implementations

High Availability Architecture

Mar 29, 2022 · Fundamentals

Abstract Paxos: Unifying Paxos and Raft through Formal Derivation of Distributed Consensus

This article presents abstract‑paxos, a unified framework that derives Paxos and Raft from first principles, defines information certainty, quorum, and a total order on states using commit_index, and details a two‑phase protocol, member‑change handling, and how the model maps to classic Paxos and Raft implementations.

ConsensusDistributed SystemsPaxos

0 likes · 25 min read

Abstract Paxos: Unifying Paxos and Raft through Formal Derivation of Distributed Consensus

IT Architects Alliance

Mar 28, 2022 · Fundamentals

Evolution and Fundamentals of Storage Systems: From Single-Node to Cloud‑Native

This article outlines the historical development of storage systems—from single-node and centralized architectures to distributed, cloud, and cloud‑native solutions—detailing storage types (block, file, object), data models, system components, and key design principles for modern storage infrastructures.

Data ModelsDistributed Systemsblock storage

0 likes · 7 min read

Evolution and Fundamentals of Storage Systems: From Single-Node to Cloud‑Native

IT Architects Alliance

Mar 27, 2022 · Backend Development

Simulating a 10‑Billion Red‑Envelope System with Go: From 3K to 6K QPS

This article details a step‑by‑step engineering experiment that reproduces a high‑throughput "red‑envelope" service, outlining the required hardware, software stack, load‑generation logic, monitoring setup, and performance results for handling up to 6 000 QPS on a 100‑million‑user scale.

BackendDistributed SystemsGo

0 likes · 21 min read

Simulating a 10‑Billion Red‑Envelope System with Go: From 3K to 6K QPS

IT Architects Alliance

Mar 27, 2022 · Backend Development

From a Simple MVP Monolith to a Complex Distributed Architecture: Taobao Case Study

This article walks through the step‑by‑step evolution of a basic single‑server e‑commerce system into a large‑scale, distributed architecture, explaining core concepts, each architectural upgrade, the technologies involved, and key design principles for building scalable, high‑availability services.

ArchitectureCloud ComputingDistributed Systems

0 likes · 20 min read

From a Simple MVP Monolith to a Complex Distributed Architecture: Taobao Case Study

Architects' Tech Alliance

Mar 26, 2022 · Fundamentals

A Comprehensive Overview of Storage Development History, Types, and Models

This article provides a detailed overview of the evolution of storage systems—from single‑node to cloud‑native—covering fundamental concepts, data classifications, storage types such as block, file, and object, and examples like Rook and MinIO, aimed at beginners seeking a macro understanding of storage architecture.

Data ModelsDistributed Systemsblock storage

0 likes · 8 min read

A Comprehensive Overview of Storage Development History, Types, and Models

Java Backend Technology

Mar 26, 2022 · Backend Development

XXL-JOB Mastery: A Step-by-Step Spring Boot Guide to Distributed Scheduling

This tutorial walks you through installing and configuring the open‑source XXL‑JOB scheduler, comparing it with ElasticJob, setting up the admin console, creating a Spring Boot project, adding the necessary dependencies, configuring database and logging, and implementing scheduled tasks using Bean, Method, and GLUE modes, complete with code examples and screenshots.

Distributed SystemsXXL-JOBtask scheduling

0 likes · 15 min read

XXL-JOB Mastery: A Step-by-Step Spring Boot Guide to Distributed Scheduling

IT Services Circle

Mar 25, 2022 · Fundamentals

Consistent Hashing Algorithm: Principles, Advantages, and Applications

Consistent hashing maps both servers and keys onto a virtual hash ring, allowing keys to be assigned to the nearest clockwise server, which minimizes data movement during node addition or removal, improves load distribution with virtual nodes, and is widely used in distributed caching and load balancing systems.

CacheDistributed Systemsconsistent hashing

0 likes · 11 min read

Consistent Hashing Algorithm: Principles, Advantages, and Applications

Top Architect

Mar 25, 2022 · Backend Development

Design and Architecture of Baidu Transaction Middleware Order System

This article explains the business model, capabilities, service types, order lifecycle, and detailed architectural evolution of Baidu's transaction middle‑platform order system, covering instruction orchestration, performance optimizations such as caching, async execution, ThreadLocal handling, and database sharding for high‑throughput scenarios.

Backend ArchitectureDistributed Systemsdatabase sharding

0 likes · 15 min read

Design and Architecture of Baidu Transaction Middleware Order System