Tagged articles
2122 articles
Page 4 of 22
Architect
Architect
Sep 9, 2024 · Backend Development

Understanding Redisson DelayedQueue: Internal Mechanisms and Practical Usage

This article explains how Redisson's DelayedQueue works internally, covering its data structures, the basic workflow for sending and receiving delayed messages, the initialization process, and the scheduling logic that moves expired items from the delay queue to the target queue.

DelayedQueueDistributed SystemsMessage Queue
0 likes · 16 min read
Understanding Redisson DelayedQueue: Internal Mechanisms and Practical Usage
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Sep 5, 2024 · Databases

How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024

This article reviews the 70‑year evolution of databases, explains how vector databases and Retrieval‑Augmented Generation (RAG) are reshaping AI applications, and details Baidu Intelligent Cloud's VectorDB architecture, performance advantages, real‑world use cases, and future trends in data engineering.

AIDatabase ArchitectureDistributed Systems
0 likes · 16 min read
How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024
Tencent Cloud Developer
Tencent Cloud Developer
Sep 3, 2024 · Backend Development

Rate Limiting: Purpose, Algorithms, Implementation Methods, Strategies, and Considerations

Rate limiting safeguards system stability by capping request rates, employing algorithms such as fixed‑window, sliding‑window, leaky‑bucket, and token‑bucket, and can be applied at application, proxy, or hardware layers while using strategies like threshold setting, request classification, feedback, and ensuring fairness, flexibility, and transparency.

BackendDistributed SystemsGolang
0 likes · 28 min read
Rate Limiting: Purpose, Algorithms, Implementation Methods, Strategies, and Considerations
JD Retail Technology
JD Retail Technology
Aug 30, 2024 · Artificial Intelligence

GPU Optimization Practices for Training and Inference in JD Advertising Recommendation Systems

The article details JD Advertising's technical challenges and solutions for large‑scale sparse recommendation models, describing GPU‑focused storage, compute and I/O optimizations for both training and low‑latency inference, including distributed pipelines, heterogeneous deployment, batch aggregation, multi‑stream execution, and compiler extensions.

Distributed SystemsGPU OptimizationInference
0 likes · 13 min read
GPU Optimization Practices for Training and Inference in JD Advertising Recommendation Systems
Xiaolei Talks DB
Xiaolei Talks DB
Aug 28, 2024 · Databases

What 15 Years of China’s DTCC Conferences Reveal About Database Evolution

The author reflects on a decade‑plus journey through China’s DTCC database conferences, describing personal growth from novice to speaker and organizer, sharing insights on Redis Cluster, distributed database selection, openGauss, time‑series databases, and the evolving themes that chart the industry's progress.

Career DevelopmentDistributed SystemsTime Series Database
0 likes · 6 min read
What 15 Years of China’s DTCC Conferences Reveal About Database Evolution
Cognitive Technology Team
Cognitive Technology Team
Aug 25, 2024 · Operations

Fault Isolation Techniques for High Availability in Distributed Systems

The article explains fault isolation as a key technique for improving distributed system availability, detailing multiple isolation levels—from data‑center to user‑level—and complementary strategies such as circuit breakers, timeouts, fast‑fail, load balancing, caching, and degradation switches.

Distributed SystemsResource Isolationcircuit breaker
0 likes · 10 min read
Fault Isolation Techniques for High Availability in Distributed Systems
Top Architect
Top Architect
Aug 25, 2024 · Backend Development

Integrating Spring Boot with XXL‑Job for Distributed Task Scheduling

This article provides a comprehensive guide on integrating Spring Boot with the open‑source distributed scheduler XXL‑Job, covering installation, configuration of the admin console and executor, code examples, parameter details, comparison with other schedulers, and best practices for reliable task management.

Distributed SystemsXXL-JOBjava
0 likes · 17 min read
Integrating Spring Boot with XXL‑Job for Distributed Task Scheduling
Architect
Architect
Aug 23, 2024 · Backend Development

Understanding RocketMQ: Basic Concepts, Ordered Messages, and Transactional Messages

This article explains RocketMQ's core components, including domain models, message transmission models, reliability mechanisms, the challenges of ordered messaging, practical usage scenarios, and the design and implementation of transactional messages in both open‑source and proprietary versions.

BackendDistributed SystemsMessage Queue
0 likes · 21 min read
Understanding RocketMQ: Basic Concepts, Ordered Messages, and Transactional Messages
DaTaobao Tech
DaTaobao Tech
Aug 14, 2024 · Backend Development

Deep Dive into RocketMQ Architecture and Core Components

The article thoroughly explains RocketMQ’s architecture—including stateless NameServers for routing, master‑slave Brokers with sequential CommitLog storage, stateless Producers and pull‑or‑push Consumers, message flow, indexing, and synchronous double‑write reliability—showing how these components deliver high‑throughput, scalable, fault‑tolerant messaging for distributed systems.

BrokerConsumerDistributed Systems
0 likes · 20 min read
Deep Dive into RocketMQ Architecture and Core Components
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Aug 14, 2024 · Cloud Computing

How Huawei’s YuanRong Is Redefining General-Purpose Serverless Computing

Huawei Cloud unveiled YuanRong, a general‑purpose serverless platform presented at ACM SIGCOMM 2024, which tackles core challenges of function coordination, cold‑start latency, external state handling, and backend integration, and demonstrates its impact through automotive IoV and ERP case studies while outlining future AI‑driven innovations.

Distributed SystemsFunction Cold StartHuawei Cloud
0 likes · 12 min read
How Huawei’s YuanRong Is Redefining General-Purpose Serverless Computing
JD Cloud Developers
JD Cloud Developers
Aug 12, 2024 · Backend Development

Mastering High‑Concurrency: From Thread Pitfalls to Distributed Inventory Architecture

This article explores the three‑high requirements of modern internet systems, compares multithreading with asynchronous methods, and presents a practical, partition‑based inventory‑deduction architecture that leverages Redis caching, MySQL sharding, and asynchronous updates to achieve scalable high‑concurrency performance.

Distributed SystemsMySQLasynchronous programming
0 likes · 19 min read
Mastering High‑Concurrency: From Thread Pitfalls to Distributed Inventory Architecture
Su San Talks Tech
Su San Talks Tech
Aug 11, 2024 · Backend Development

Mastering Idempotency: Design Patterns and Code Examples for Reliable APIs

This article explains the concept of idempotency, outlines scenarios where it is essential, analyzes common causes of idempotency issues, and presents multiple practical solutions—including unique constraints, optimistic and pessimistic locks, distributed locks, token mechanisms, state machines, deduplication tables, and global request IDs—accompanied by concrete code examples.

Distributed SystemsIdempotencyToken
0 likes · 14 min read
Mastering Idempotency: Design Patterns and Code Examples for Reliable APIs
DaTaobao Tech
DaTaobao Tech
Aug 9, 2024 · Cloud Native

Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow

RocketMQ stores messages in a sequential CommitLog while using a ConsumerQueue index for fast random access, and employs two looping threads—RebalanceService for load‑balancing queue assignments and PullMessageService for continuously pulling messages and dispatching them to user callbacks via a thread‑pooled consumer pipeline.

ConsumerDistributed SystemsMessage Queue
0 likes · 12 min read
Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow
IT Services Circle
IT Services Circle
Aug 9, 2024 · Backend Development

Interview on xxl-job Task Scheduling Framework and Handling Overlapping Tasks

The interview discusses various routing and blocking strategies of the xxl-job distributed task scheduling framework, explains how it addresses task overlap, idempotency issues, and provides practical solutions such as single‑machine execution, locking mechanisms, and using a business date to avoid date‑related problems.

BackendDistributed SystemsXXL-JOB
0 likes · 10 min read
Interview on xxl-job Task Scheduling Framework and Handling Overlapping Tasks
Lobster Programming
Lobster Programming
Aug 9, 2024 · Backend Development

Why Zookeeper Is Losing Favor: Consistency vs. Availability Trade‑offs

Zookeeper, the open‑source coordination service modeled after Google’s Chubby, offers strong consistency for distributed applications, but its CP design, synchronous master‑slave replication, log‑heavy state changes, and limited health checks lead many large tech firms to favor more available alternatives, though it still fits certain use cases.

ConsistencyDistributed Systemscp
0 likes · 5 min read
Why Zookeeper Is Losing Favor: Consistency vs. Availability Trade‑offs
Tencent Cloud Developer
Tencent Cloud Developer
Aug 6, 2024 · Backend Development

Nine Common Techniques for Service Performance Optimization

The article outlines nine broadly applicable techniques—caching, parallel and batch processing, data compression, lock‑free design, sharding, eliminating unnecessary requests, and resource pooling—that together can dramatically cut service latency and improve throughput, as demonstrated by an 80% latency reduction in a real‑world project.

Distributed SystemsGolangPerformance Optimization
0 likes · 28 min read
Nine Common Techniques for Service Performance Optimization
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 6, 2024 · Operations

ZooKeeper Core Concepts: Data Model, Node Types, Sessions, Cluster, Election, ZAB, Watch, ACL, and Distributed Lock Patterns

This article explains ZooKeeper's hierarchical data model, node types, session mechanism, cluster roles and election process, ZAB protocol, watch mechanism, ACL permissions, and common distributed lock implementations, providing a comprehensive overview of its core concepts and practical usage.

ACLCluster ManagementCoordination Service
0 likes · 17 min read
ZooKeeper Core Concepts: Data Model, Node Types, Sessions, Cluster, Election, ZAB, Watch, ACL, and Distributed Lock Patterns
Su San Talks Tech
Su San Talks Tech
Aug 5, 2024 · Fundamentals

Mastering Distributed Systems: Common Pitfalls and How to Avoid Them

This article explains the core concepts of distributed systems—including the CAP theorem, BASE theory, message‑queue challenges, Redis sentinel issues, sharding strategies, unique ID generation, and distributed transaction patterns—while offering practical guidance to prevent common pitfalls and improve reliability.

CAP theoremDistributed SystemsID generation
0 likes · 27 min read
Mastering Distributed Systems: Common Pitfalls and How to Avoid Them
JD Cloud Developers
JD Cloud Developers
Aug 2, 2024 · Operations

How JD’s Advertising Platform Optimizes Load Balancing for Heterogeneous Clusters

Exploring the evolution of JD’s advertising online model system, this article examines the challenges of heterogeneous hardware load balancing, outlines static and dynamic strategies—including DNS, Nginx, LVS, Ribbon, and Dubbo—and presents a multi‑objective framework that improves service availability and resource utilization, achieving up to 20%+ efficiency gains.

Distributed Systemsheterogeneous hardwareload balancing
0 likes · 17 min read
How JD’s Advertising Platform Optimizes Load Balancing for Heterogeneous Clusters
JD Tech
JD Tech
Aug 1, 2024 · Backend Development

Overview of RPC and How to Build a Custom RPC Framework Using Netty

This article explains the fundamentals and key characteristics of Remote Procedure Call (RPC), outlines common use cases and popular frameworks, and then walks through the design and implementation of a simple custom RPC system—including a bespoke binary protocol, client‑side proxy generation, serialization, Netty‑based encoding/decoding, server‑side request handling, and result delivery—using Java and Netty.

Distributed SystemsNettyRPC
0 likes · 21 min read
Overview of RPC and How to Build a Custom RPC Framework Using Netty
Architect
Architect
Jul 26, 2024 · Backend Development

Designing a High‑Performance Go‑Job Scheduler: Architecture, SDK & Task Flow

This article presents a comprehensive technical deep‑dive into Go‑Job, a Go‑native distributed task scheduling framework, covering its background, three‑layer architecture, service and task design, SDK modules, code examples, practical integration steps, and future enhancements for robust backend operations.

ArchitectureDistributed SystemsGo
0 likes · 24 min read
Designing a High‑Performance Go‑Job Scheduler: Architecture, SDK & Task Flow
DeWu Technology
DeWu Technology
Jul 24, 2024 · Backend Development

Mastering Unique Identifiers and Distributed Locks: From UUIDs to CAS

This article explores how various unique identifier schemes—random UUIDs, sequential numbers, hierarchical paths, and distributed ID generators—are applied across programming languages, file systems, databases, and networks, and explains when and how to enforce uniqueness using pre‑validation, database constraints, or distributed locks such as Redis and Zookeeper, while also linking these mechanisms to CAS‑based concurrency control.

CASDistributed SystemsIdempotency
0 likes · 23 min read
Mastering Unique Identifiers and Distributed Locks: From UUIDs to CAS
JD Cloud Developers
JD Cloud Developers
Jul 24, 2024 · Operations

How JD.com’s Buffalo Scheduler Achieves High‑Performance, Scalable DAG Orchestration

Buffalo, JD.com’s in‑house distributed DAG scheduler, tackles massive task volumes and complex dependencies through a dual‑layer entity model, instance‑based execution, tiered scheduling, high‑availability architecture, event‑driven processing, in‑memory and cold‑hot data separation, delivering scalable, low‑latency ETL orchestration.

DAG schedulingDistributed SystemsETL orchestration
0 likes · 12 min read
How JD.com’s Buffalo Scheduler Achieves High‑Performance, Scalable DAG Orchestration
JD Tech Talk
JD Tech Talk
Jul 15, 2024 · Backend Development

Performance Optimizations in RaftKeeper v2.1.0: Benchmark Results and Engineering Details

The article presents a detailed engineering analysis of RaftKeeper v2.1.0, highlighting benchmark‑driven performance improvements such as 11% write throughput gains, up to 198% faster read‑write mixed workloads, and multiple optimizations—including response serialization, list‑request handling, system‑call reduction, thread‑pool redesign, and asynchronous snapshot processing—validated on large ClickHouse clusters.

BenchmarkDistributed SystemsPerformance
0 likes · 11 min read
Performance Optimizations in RaftKeeper v2.1.0: Benchmark Results and Engineering Details
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jul 11, 2024 · Operations

Cloud Music Guizhou Data Center Migration: A Large-Scale Infrastructure Migration Case Study

In 2023 NetEase Cloud Music executed its largest ever data‑center migration, moving over 20,000 applications and more than one million queries per second to a new Guizhou facility while meeting zero‑downtime, strict latency and bandwidth limits through a batch‑wise, cross‑team strategy that incorporated automated upgrade platforms, standardized operations, and extensive risk‑mitigation measures.

Data Center MigrationDistributed Systemslarge-scale infrastructure
0 likes · 27 min read
Cloud Music Guizhou Data Center Migration: A Large-Scale Infrastructure Migration Case Study
DeWu Technology
DeWu Technology
Jul 8, 2024 · Game Development

StarWish Forest: Design and Implementation of a Growth-Based Interactive Game System

StarWish Forest, DeWu’s first growth‑based interactive game, lets users nurture virtual trees by collecting water droplets through daily check‑ins, team planting, PK battles and e‑commerce tasks, automatically advancing through fifteen levels and earning rewards, while its Redis‑driven architecture and modular design enable scalable social engagement and commercial conversion.

Distributed SystemsGame Developmentcompetitive gameplay
0 likes · 16 min read
StarWish Forest: Design and Implementation of a Growth-Based Interactive Game System
IT Services Circle
IT Services Circle
Jul 6, 2024 · Backend Development

Design Techniques for High Availability in Large‑Scale Internet Architecture

This article explains the essential high‑availability design techniques for large‑scale internet systems, covering system splitting, decoupling, asynchronous processing, retry mechanisms, compensation, backup, multi‑active strategies, isolation, rate limiting, circuit breaking, and graceful degradation to ensure robust, scalable backend services.

BackendDistributed SystemsSystem Design
0 likes · 13 min read
Design Techniques for High Availability in Large‑Scale Internet Architecture
Su San Talks Tech
Su San Talks Tech
Jul 6, 2024 · Backend Development

Mastering High Availability: 10 Essential Design Techniques for Scalable Systems

This article explains ten core techniques—system splitting, decoupling, asynchrony, retry, compensation, backup, multi‑active strategies, isolation, rate limiting, circuit breaking, and degradation—that together enable robust, high‑availability architectures for modern backend services.

Distributed SystemsSystem Designfault tolerance
0 likes · 12 min read
Mastering High Availability: 10 Essential Design Techniques for Scalable Systems
Java Tech Enthusiast
Java Tech Enthusiast
Jul 2, 2024 · Databases

Practical Redis Use Cases and Code Examples

This guide walks backend developers through twenty real‑world Redis use cases—from basic caching, lotteries, and like/collect features to ranking, PV/UV counting, Bloom filters, sign‑in, geo‑search, rate limiting, ID generation, distributed locks, messaging, session sharing, and more—providing Spring Boot code samples, limitations, and best‑practice advice.

Data StructuresDistributed SystemsMessage Queue
0 likes · 55 min read
Practical Redis Use Cases and Code Examples
Open Source Tech Hub
Open Source Tech Hub
Jul 2, 2024 · Backend Development

How to Build RPC Services with ThinkPHP Swoole: Step-by-Step Guide

This guide explains how to set up remote procedure calls in a ThinkPHP application using the Swoole extension, covering extension installation, Composer dependencies, server and client configuration, interface definition, hot‑reload handling, and common pitfalls such as Xdebug conflicts.

BackendDistributed SystemsPHP
0 likes · 6 min read
How to Build RPC Services with ThinkPHP Swoole: Step-by-Step Guide
dbaplus Community
dbaplus Community
Jun 23, 2024 · Databases

How Vitess Scales MySQL for YouTube: Architecture and Lessons

This article explains how Vitess was created to overcome MySQL leader‑follower replication limits at YouTube, detailing its sidecar VTTablet, stateless VTGate router, topology key‑value store, and scaling strategies that enable billions of users to be served reliably.

Database ArchitectureDistributed SystemsMySQL scaling
0 likes · 7 min read
How Vitess Scales MySQL for YouTube: Architecture and Lessons
Efficient Ops
Efficient Ops
Jun 20, 2024 · Operations

How Intelligent Ops Platforms Transform Distributed Banking Systems

This article explains how Chinese commercial banks are adopting intelligent operation platforms to collect, analyze, and visualize distributed system data in real time, enabling rapid root‑cause detection, full‑link tracing, and automated solution recommendations for complex financial services.

BankingDistributed SystemsRoot Cause Analysis
0 likes · 8 min read
How Intelligent Ops Platforms Transform Distributed Banking Systems
Ctrip Technology
Ctrip Technology
Jun 20, 2024 · Backend Development

Design and Architecture of Ctrip Service Registration Center

The article explains Ctrip's service registration center architecture, including its two‑layer Data and Session design, multi‑sharding, fault‑tolerance mechanisms, Redis‑based cluster discovery, design trade‑offs such as proxy versus Smart SDK, hashing strategy, and operational considerations for burst traffic and future scaling.

Distributed SystemsRedis discoveryfault tolerance
0 likes · 16 min read
Design and Architecture of Ctrip Service Registration Center
Wukong Talks Architecture
Wukong Talks Architecture
Jun 20, 2024 · Backend Development

Seven Classic Use Cases of Message Queues

This article shares seven practical scenarios—such as asynchronous processing, traffic smoothing, message bus, delayed tasks, broadcast consumption, distributed transactions, and data hub integration—where message queues like RocketMQ, Kafka, ActiveMQ, and RabbitMQ help solve high‑concurrency challenges in modern backend systems.

Broadcast ConsumptionDelay MessagesDistributed Systems
0 likes · 11 min read
Seven Classic Use Cases of Message Queues
Aikesheng Open Source Community
Aikesheng Open Source Community
Jun 11, 2024 · Databases

ClickHouse Data Recovery Procedure for a Failed Disk in a 4‑Shard 3‑Replica Cluster

This article details a step‑by‑step recovery of a ClickHouse 4‑shard, 3‑replica cluster after a node’s disks failed, covering verification of residual data, Zookeeper metadata cleanup, table reconstruction, distributed table restoration, and validation of synchronization across replicas.

Data RecoveryDatabase operationsDistributed Systems
0 likes · 8 min read
ClickHouse Data Recovery Procedure for a Failed Disk in a 4‑Shard 3‑Replica Cluster
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jun 6, 2024 · Backend Development

Comprehensive Introduction to RocketMQ: Architecture, Principles, and Use Cases

This article provides a detailed overview of RocketMQ, covering its core concepts, four primary usage scenarios, architectural components such as NameServer, Broker, Producer, and Consumer, as well as message storage, ordering guarantees, high availability, fault tolerance, and transactional messaging.

Distributed SystemsMessage QueueRocketMQ
0 likes · 6 min read
Comprehensive Introduction to RocketMQ: Architecture, Principles, and Use Cases
Sohu Tech Products
Sohu Tech Products
Jun 5, 2024 · Big Data

Why Kafka Is the Backbone of Modern Data Pipelines: Core Architecture and Use Cases

This article explains Kafka's role as a high‑throughput distributed message queue, detailing its core components, topic‑partition model, consumer groups, storage mechanisms, fault‑tolerance features, delivery guarantees, ZooKeeper coordination, and scalability strategies for building reliable real‑time data pipelines.

Big DataDistributed SystemsKafka
0 likes · 14 min read
Why Kafka Is the Backbone of Modern Data Pipelines: Core Architecture and Use Cases
Efficient Ops
Efficient Ops
Jun 2, 2024 · Operations

Why Observability Is the Key to Reliable Distributed Systems

Observability, defined as measuring system state through logs, metrics, and tracing, enhances stability of distributed architectures by enabling rapid fault detection, deeper insight, and proactive issue resolution, distinguishing it from traditional monitoring and supporting DevOps, SRE, and business objectives.

Distributed Systemsmonitoring
0 likes · 17 min read
Why Observability Is the Key to Reliable Distributed Systems
Architect
Architect
May 31, 2024 · Backend Development

7 Common Cache Pitfalls and How to Avoid Them

This article outlines seven typical cache problems—including cache penetration, breakdown, avalanche, large keys, hot keys, hit‑rate issues, and data inconsistency—and provides practical solutions such as parameter validation, Bloom filters, null‑value caching, locking, auto‑renewal, random expiration, high‑availability setups, compression, and cache warming to improve system reliability and performance.

Distributed SystemsPerformancecache pitfalls
0 likes · 22 min read
7 Common Cache Pitfalls and How to Avoid Them
Architect
Architect
May 30, 2024 · Backend Development

Inside Pulsar’s Bookie: A Deep Dive into Storage Architecture and Write/Read Paths

This article dissects Pulsar’s storage layer by examining the Bookie component, detailing its architecture, the sequential write‑ahead log, ledger management, journal handling, caching mechanisms, and the complete read/write call chains with concrete code examples and performance considerations.

Distributed SystemsLedgerPerformance
0 likes · 33 min read
Inside Pulsar’s Bookie: A Deep Dive into Storage Architecture and Write/Read Paths
Architect
Architect
May 28, 2024 · Backend Development

Mastering Microservice Architecture: Key Components and Best Practices

This article provides a comprehensive overview of a typical microservice architecture, detailing the roles of Nginx as the traffic entry, gateways, business services, service registries, caching, databases, messaging, logging, scheduling, and distributed storage, and offering practical guidance for high‑availability deployment.

Backend ArchitectureDistributed SystemsMicroservices
0 likes · 10 min read
Mastering Microservice Architecture: Key Components and Best Practices
Ctrip Technology
Ctrip Technology
May 17, 2024 · Backend Development

Comprehensive Log Governance and Mining Solution for Distributed Systems

This article presents a comprehensive log governance and mining solution for distributed systems, covering background challenges, usage scenarios, and detailed strategies such as distributed log integration, front‑back end traceability, standardized log management, large‑payload handling, efficient cleaning, and future plans for componentization and sampling.

Distributed SystemsLog ManagementPerformance Optimization
0 likes · 16 min read
Comprehensive Log Governance and Mining Solution for Distributed Systems
Cognitive Technology Team
Cognitive Technology Team
May 16, 2024 · Operations

Guide to Building Stability in Distributed Systems

This guide presents comprehensive principles, best practices, and techniques for designing, deploying, and maintaining stable distributed systems, covering fault tolerance, monitoring, capacity planning, incident response, and operational reliability to help engineers achieve high availability.

Distributed SystemsOperationsreliability engineering
0 likes · 1 min read
Guide to Building Stability in Distributed Systems
Sohu Tech Products
Sohu Tech Products
May 15, 2024 · Backend Development

Design and Implementation of a Simplified RocketMQ SDK

To simplify RocketMQ usage, I designed an internal SDK that abstracts complex client configurations, offering single‑call ordered message sending, per‑message consumption, unified result handling, and enforced consistent subscription settings, thereby reducing developer effort and preventing common integration errors.

Distributed SystemsMessagingRocketMQ
0 likes · 6 min read
Design and Implementation of a Simplified RocketMQ SDK
Top Architect
Top Architect
May 15, 2024 · Backend Development

Implementing a Simple Java RPC Framework: Architecture, Service Registration, Serialization, and Proxy Generation

This article explains the principles and implementation of a Java RPC framework, covering service registration with Zookeeper, serialization, compression, network communication via Netty, dynamic proxy generation using reflection and Javassist, and performance considerations, providing extensive code examples for each component.

Distributed SystemsJavassistNetty
0 likes · 25 min read
Implementing a Simple Java RPC Framework: Architecture, Service Registration, Serialization, and Proxy Generation
Cognitive Technology Team
Cognitive Technology Team
May 15, 2024 · Fundamentals

The Fallacies of Distributed Systems: Understanding Common Network Assumptions

This article revisits the classic “Fallacies of Distributed Systems” introduced by Peter Deutsch, explaining why assumptions such as reliable networks, zero latency, infinite bandwidth, secure and homogeneous communication are false, and offering practical strategies like retries, caching, batching, and security‑first design to build robust distributed applications.

Distributed SystemsFallaciesLatency
0 likes · 4 min read
The Fallacies of Distributed Systems: Understanding Common Network Assumptions
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
May 11, 2024 · Big Data

Comprehensive Introduction to Apache Kafka: Architecture, Features, and Use Cases

This article provides a detailed overview of Apache Kafka, covering its core characteristics, distributed architecture, key components such as topics, partitions, brokers, producers, consumers, ZooKeeper, and common application scenarios like log collection, event‑driven architecture, real‑time analytics, and monitoring.

ArchitectureBig DataDistributed Systems
0 likes · 7 min read
Comprehensive Introduction to Apache Kafka: Architecture, Features, and Use Cases
Architect
Architect
May 10, 2024 · Backend Development

Rate Limiting Algorithms: Fixed Window, Sliding Window, Leaky Bucket, Token Bucket, and Distributed Rate Limiting

This article explains the importance of rate limiting in microservice architectures, introduces four basic algorithms—fixed window, sliding window, leaky bucket, and token bucket—compares their advantages and disadvantages, and presents both single-machine and distributed implementations with Go code examples.

Distributed SystemsGolangMicroservices
0 likes · 29 min read
Rate Limiting Algorithms: Fixed Window, Sliding Window, Leaky Bucket, Token Bucket, and Distributed Rate Limiting
Architect
Architect
May 1, 2024 · Fundamentals

Why ULID Beats UUID: A Deep Dive into Features, Specs, and Python Usage

This article compares UUID and ULID, explains the limitations of UUID versions, details ULID's timestamp‑based, lexicographically sortable design, presents its binary layout and encoding, and shows how to generate and manipulate ULIDs in Python with concrete code examples.

Distributed SystemsPythonSorting
0 likes · 8 min read
Why ULID Beats UUID: A Deep Dive into Features, Specs, and Python Usage
Su San Talks Tech
Su San Talks Tech
May 1, 2024 · Backend Development

7 Common Cache Pitfalls and How to Avoid Them in Backend Systems

This article outlines seven typical cache issues—including cache penetration, breakdown, avalanche, data inconsistency, large keys, hot keys, and low hit rates—explains their causes, and provides practical solutions such as parameter validation, Bloom filters, locking, expiration randomization, key sharding, and cache warming to improve system reliability and performance.

BackendCacheDistributed Systems
0 likes · 20 min read
7 Common Cache Pitfalls and How to Avoid Them in Backend Systems
FunTester
FunTester
Apr 28, 2024 · Backend Development

Tackling Data Consistency: Master‑Slave, Master‑Master & Leaderless Architectures

The article examines why distributed systems inevitably face data‑consistency challenges and breaks down three common service‑architecture patterns—master‑slave, master‑master, and leaderless—detailing their replication mechanisms, advantages, drawbacks, and practical solutions such as synchronous, semi‑synchronous, asynchronous replication, quorum handling, node‑failure recovery, and conflict resolution strategies.

Data ConsistencyDistributed SystemsMaster‑Slave
0 likes · 14 min read
Tackling Data Consistency: Master‑Slave, Master‑Master & Leaderless Architectures
AntTech
AntTech
Apr 23, 2024 · Databases

The Cloud Era of Databases: Insights from OceanBase Chief Scientist Yang Zhenkun

In his OceanBase developer conference keynote, chief scientist Yang Zhenkun explains how cloud resource pooling enables distributed databases to achieve elastic compute and storage, discusses the evolution of databases, the challenges of transaction processing, and envisions fully shared, on‑demand cloud database services.

Cloud ComputingDistributed SystemsOceanBase
0 likes · 7 min read
The Cloud Era of Databases: Insights from OceanBase Chief Scientist Yang Zhenkun
ITPUB
ITPUB
Apr 22, 2024 · Backend Development

How Meta Achieves Near‑Perfect Cache Consistency: Lessons from Polaris

This article explains Meta's approach to cache invalidation and consistency, detailing why ultra‑high consistency matters, how their Polaris monitoring system detects and resolves inconsistencies, and provides a simplified Python example that illustrates the underlying mechanisms and challenges.

BackendConsistencyDistributed Systems
0 likes · 12 min read
How Meta Achieves Near‑Perfect Cache Consistency: Lessons from Polaris
21CTO
21CTO
Apr 20, 2024 · Backend Development

Why Backend Development Is Harder Today Than 20 Years Ago

Over the past two decades, backend development has shifted from simple server‑side coding to a complex, multidisciplinary field, driven by soaring user expectations, massive system scale, an overwhelming array of technologies, and the need for cloud, container, and cross‑team collaboration skills.

BackendDistributed SystemsSoftware Architecture
0 likes · 9 min read
Why Backend Development Is Harder Today Than 20 Years Ago
Architects' Tech Alliance
Architects' Tech Alliance
Apr 17, 2024 · Cloud Native

What Is Microservices? Core Concepts, Architecture, and Implementation Explained

Microservices, also known as microservice architecture, is a cloud‑native approach that breaks a single application into loosely coupled, independently deployable services, each with its own tech stack, communicating via REST APIs, event streams, or message brokers, and organized around business capabilities and bounded contexts.

ArchitectureCloud NativeDistributed Systems
0 likes · 3 min read
What Is Microservices? Core Concepts, Architecture, and Implementation Explained
Tongcheng Travel Technology Center
Tongcheng Travel Technology Center
Apr 17, 2024 · Backend Development

In-Depth Analysis of Apache RocketMQ Architecture, Operation Principles, and High‑Throughput Mechanisms

This article provides a comprehensive overview of Apache RocketMQ, detailing its core components, producer and consumer workflows, storage strategies, master‑slave synchronization, Raft‑based half‑write and leader election mechanisms, and best‑practice recommendations for high‑throughput, fault‑tolerant messaging systems.

Distributed SystemsHigh ThroughputMessage Queue
0 likes · 22 min read
In-Depth Analysis of Apache RocketMQ Architecture, Operation Principles, and High‑Throughput Mechanisms
Architect
Architect
Apr 9, 2024 · Backend Development

Microservice Architecture Design Patterns: Overview, Benefits, Drawbacks, and Usage Guidelines

This article provides a comprehensive overview of microservice architecture, detailing its definition, key characteristics, advantages and disadvantages, and presenting ten essential design patterns—including database per service, event sourcing, CQRS, Saga, BFF, API gateway, Strangler, circuit breaker, externalized configuration, and consumer‑driven contract testing—along with their pros, cons, and appropriate scenarios.

Backend ArchitectureDesign PatternsDistributed Systems
0 likes · 29 min read
Microservice Architecture Design Patterns: Overview, Benefits, Drawbacks, and Usage Guidelines
FunTester
FunTester
Apr 7, 2024 · Backend Development

Generating Globally Unique Identifiers (GUID) for Performance Testing: UUID, Distributed Services, Snowflake Algorithm, and Thread‑Local Techniques

This article explains why globally unique identifiers are needed in performance testing, compares several common solutions such as Java UUID, Redis/Zookeeper distributed ID generators, the Snowflake algorithm, and thread‑local or shared counters, and provides complete Java code examples for each approach.

Distributed SystemsGUIDPerformance Testing
0 likes · 13 min read
Generating Globally Unique Identifiers (GUID) for Performance Testing: UUID, Distributed Services, Snowflake Algorithm, and Thread‑Local Techniques
Architects' Tech Alliance
Architects' Tech Alliance
Apr 6, 2024 · Artificial Intelligence

How ByteDance Scaled LLM Training to Over 10,000 GPUs: Inside the MegaScale System

The article analyzes ByteDance and Peking University's MegaScale system that enables efficient, stable training of large language models on clusters exceeding ten thousand GPUs, detailing algorithmic tweaks, 3D parallel communication overlap, operator optimizations, data‑pipeline improvements, network tuning, and fault‑tolerance mechanisms that together achieve a 55.2% MFU on a 175B model.

Distributed SystemsGPU clustersLLM training
0 likes · 15 min read
How ByteDance Scaled LLM Training to Over 10,000 GPUs: Inside the MegaScale System
Architect
Architect
Apr 4, 2024 · Backend Development

Mastering High Availability: 9 Essential Design Techniques for Scalable Systems

The article walks through nine practical techniques—system splitting, decoupling, asynchronous processing, retry, compensation, backup, multi‑active deployment, rate limiting, circuit breaking, and degradation—explaining why each is needed, how they are implemented in real‑world microservice architectures, and what trade‑offs to consider.

Distributed SystemsMicroservicesSystem Design
0 likes · 13 min read
Mastering High Availability: 9 Essential Design Techniques for Scalable Systems
Practical DevOps Architecture
Practical DevOps Architecture
Apr 4, 2024 · Databases

ClickHouse Training Course Overview and Curriculum

This article introduces a comprehensive ClickHouse training program that covers fundamental concepts, architecture, installation, distributed cluster design, data import, performance tuning, and includes a detailed list of 33 video modules and additional recommended reading resources for large‑scale data analytics.

Big DataClickHouseColumnar Database
0 likes · 4 min read
ClickHouse Training Course Overview and Curriculum
Su San Talks Tech
Su San Talks Tech
Mar 30, 2024 · Backend Development

7 Real-World Message Queue Patterns Every Backend Engineer Should Know

From asynchronous decoupling to distributed transactions, this article explores seven classic message‑queue use cases—such as peak‑shaving, event buses, delayed tasks, broadcast consumption, and data hub integration—illustrated with real‑world examples from e‑commerce, ride‑hailing, and lottery systems, plus RocketMQ code snippets.

Distributed SystemsKafkaMessage Queue
0 likes · 13 min read
7 Real-World Message Queue Patterns Every Backend Engineer Should Know
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Mar 26, 2024 · Fundamentals

Mastering Message Queues: Core Concepts, Designs, and Popular MQs

Message Queues (MQ) are essential for scalable, decoupled systems, offering asynchronous communication, reliability, buffering, and high availability; this guide explains MQ fundamentals, characteristics, architectures like point-to-point and publish/subscribe, core components, design considerations, and reviews popular solutions such as RabbitMQ, Kafka, ActiveMQ, RocketMQ, and Pulsar.

ArchitectureDistributed SystemsMessage Queue
0 likes · 9 min read
Mastering Message Queues: Core Concepts, Designs, and Popular MQs
Tencent Cloud Developer
Tencent Cloud Developer
Mar 26, 2024 · Databases

Data Consistency in Distributed Systems: Master‑Slave, Master‑Master, and Leaderless Architectures

The article compares master‑slave, master‑master, and leaderless distributed architectures, explaining how synchronous, semi‑synchronous, and asynchronous replication affect consistency, latency and scalability, and showing that each pattern trades write throughput, conflict‑resolution complexity, and availability against strong data correctness.

Data ConsistencyDistributed SystemsMaster‑Slave
0 likes · 15 min read
Data Consistency in Distributed Systems: Master‑Slave, Master‑Master, and Leaderless Architectures
Java Backend Technology
Java Backend Technology
Mar 24, 2024 · Databases

Beyond Caching: How Redis Powers Real‑World Storage and Services

Redis is widely known as a high‑performance cache, but its rich data structures, persistence options, and distributed features enable it to serve as a primary storage engine for many internet services, from user profiles and leaderboards to rate limiting, messaging, and geo‑location, while presenting scalability and reliability challenges.

Data StructuresDistributed SystemsIn-Memory Database
0 likes · 19 min read
Beyond Caching: How Redis Powers Real‑World Storage and Services
dbaplus Community
dbaplus Community
Mar 21, 2024 · Backend Development

Inside DeWu’s Self‑Built Redis: Architecture, Automation & High‑Availability

This article details DeWu's self‑built high‑performance distributed Redis cache system, covering its Proxy‑based architecture, core components like ConfigServer, Redis‑Proxy and Redis‑Server, the automated operations platform for deployment and scaling, as well as monitoring, alerting, stability measures and future roadmap.

Distributed Systemsredis
0 likes · 22 min read
Inside DeWu’s Self‑Built Redis: Architecture, Automation & High‑Availability
FunTester
FunTester
Mar 21, 2024 · Operations

How Dynamic Task‑Grabbing Cuts Distributed Batch Jobs from Hours to Minutes

This article presents a detailed case study of optimizing a distributed batch processing system by replacing static shard‑key concurrency with a dynamic task‑grabbing mechanism, dramatically reducing execution time from several hours to under fifteen minutes while maintaining stable resource usage.

Batch ProcessingDistributed SystemsPerformance Optimization
0 likes · 8 min read
How Dynamic Task‑Grabbing Cuts Distributed Batch Jobs from Hours to Minutes
Cognitive Technology Team
Cognitive Technology Team
Mar 17, 2024 · Backend Development

Understanding Idempotence: Definition, Importance, Common Causes, and Practical Backend Solutions

Idempotence ensures that repeating the same operation yields the same result without side effects, a critical property in distributed and transactional systems; the article defines the concept, explains why it matters, lists common causes of non‑idempotent behavior, and presents several practical backend solutions such as unique indexes, anti‑duplicate tables, optimistic and pessimistic locks, tokens, distributed locks, and state machines.

BackendDistributed SystemsIdempotence
0 likes · 8 min read
Understanding Idempotence: Definition, Importance, Common Causes, and Practical Backend Solutions
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 17, 2024 · Fundamentals

Implementing Log Snapshotting in Raft: A Step‑by‑Step Guide

This article provides a comprehensive tutorial on adding log snapshotting (snapshotting) to a Raft‑based distributed key‑value store, explaining the motivation, the snapshot mechanism, and detailed Go code for generating, transferring, applying, and persisting snapshots to reduce log size and improve performance.

ConsensusDistributed SystemsGo
0 likes · 15 min read
Implementing Log Snapshotting in Raft: A Step‑by‑Step Guide
Architecture & Thinking
Architecture & Thinking
Mar 15, 2024 · Backend Development

Why RPC Is Essential for Building Scalable IM Clusters: A Beginner’s Guide

This article explains the fundamentals of Remote Procedure Call (RPC), why it is crucial for developing production‑grade instant‑messaging clusters, how it differs from plain HTTP, the typical RPC call flow, common frameworks such as gRPC, Thrift and Dubbo, and practical considerations for high‑throughput distributed systems.

Distributed SystemsIMMicroservices
0 likes · 12 min read
Why RPC Is Essential for Building Scalable IM Clusters: A Beginner’s Guide
dbaplus Community
dbaplus Community
Mar 12, 2024 · Databases

How Didi Scaled Log Search by Replacing Elasticsearch with ClickHouse

Facing PB‑scale daily logs and costly Elasticsearch bottlenecks, Didi redesigned its log‑search architecture by migrating to ClickHouse, detailing the challenges, storage redesign, cluster upgrades, performance optimizations, stability fixes, and the resulting cost reduction and query speed gains.

ClickHouseDistributed Systemselasticsearch migration
0 likes · 15 min read
How Didi Scaled Log Search by Replacing Elasticsearch with ClickHouse
Linux Code Review Hub
Linux Code Review Hub
Mar 11, 2024 · Databases

How Didi Built a Next‑Gen Log Storage System with ClickHouse

Didi migrated its massive PB‑scale log data from Elasticsearch to ClickHouse, redesigning storage with separate Log and Trace clusters, optimizing partition and sorting keys, introducing native TCP connectors, and revamping HDFS cold‑hot separation, achieving up to four‑fold query speed gains and 30% lower hardware costs.

ClickHouseDistributed SystemsFlink
0 likes · 15 min read
How Didi Built a Next‑Gen Log Storage System with ClickHouse
DataFunSummit
DataFunSummit
Mar 8, 2024 · Databases

Ant TuGraph Computing Engine Architecture and Applications

Ant TuGraph’s open‑source graph computing engine, led by Fang Zhihong, will be introduced covering its development history, architectural design, technical principles, integrated stream‑batch‑graph processing capabilities, real‑world large‑scale graph use cases, and future roadmap, offering insights into design, implementation, and value.

Big DataDistributed SystemsTuGraph
0 likes · 2 min read
Ant TuGraph Computing Engine Architecture and Applications
Tencent Cloud Developer
Tencent Cloud Developer
Feb 28, 2024 · Backend Development

Comprehensive Guide to Rate Limiting Algorithms and Distributed Rate Limiting Solutions

This guide explains why rate limiting is essential for micro‑service stability, outlines six design principles, details four classic algorithms—fixed window, sliding window, leaky bucket, and token bucket—and compares centralized Redis, load‑balancer cache, and coordination‑service distributed solutions.

Distributed SystemsGolangMicroservices
0 likes · 30 min read
Comprehensive Guide to Rate Limiting Algorithms and Distributed Rate Limiting Solutions
Aikesheng Open Source Community
Aikesheng Open Source Community
Feb 27, 2024 · Cloud Native

Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle

This article provides a comprehensive technical overview of LiteIO, describing its core and CSI components, the complete volume lifecycle within Kubernetes, Disk‑Agent responsibilities, common implementation pitfalls, storage‑pool construction methods, and the design of the node‑disk‑controller, scheduler, and CSI modules.

CSICloud NativeDistributed Systems
0 likes · 13 min read
Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Feb 27, 2024 · Cloud Native

Mastering Spring Cloud: A Deep Dive into Microservices Architecture

This article explains the fundamentals of microservices, why monolithic applications become limiting as they grow, and how Spring Cloud and its components—such as Eureka, Ribbon, Feign, Hystrix, Zuul, Config, and Sleuth—provide a complete framework for building scalable, resilient distributed systems.

Distributed SystemsSpring Cloudcircuit breaker
0 likes · 7 min read
Mastering Spring Cloud: A Deep Dive into Microservices Architecture
Java Architect Essentials
Java Architect Essentials
Feb 25, 2024 · Backend Development

Designing a High‑Concurrency Ticket‑Seckill System: Architecture, Load Balancing, and Go Implementation

The article analyzes the extreme‑traffic challenges of the 12306 ticket‑seckill service, presents a layered load‑balancing architecture, compares inventory‑deduction strategies, and demonstrates a complete Go‑based prototype with Nginx weighted round‑robin, Redis stock management, and ApacheBench performance testing.

Distributed SystemsGoNGINX
0 likes · 19 min read
Designing a High‑Concurrency Ticket‑Seckill System: Architecture, Load Balancing, and Go Implementation
DataFunSummit
DataFunSummit
Feb 20, 2024 · Big Data

BitSail Open‑Source Data Integration Engine: Architecture, New Features, CDC Solutions and Future Outlook

This article introduces ByteDance's open‑source data integration engine BitSail, covering its background, layered architecture, recent feature enhancements, automated testing framework, CDC‑based full‑library synchronization solutions, and future development plans for connectors and real‑time data consistency.

Big DataCDCData Integration
0 likes · 12 min read
BitSail Open‑Source Data Integration Engine: Architecture, New Features, CDC Solutions and Future Outlook
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Feb 20, 2024 · Backend Development

Mastering Dubbo: Deep Dive into Architecture, Components, and Best Practices

This article provides a comprehensive overview of Dubbo, a Java RPC framework, covering its necessity, core components, implementation principles, architecture layers, supported protocols, registry options, load‑balancing strategies, and typical application scenarios for building scalable micro‑services.

Distributed SystemsDubboJava RPC
0 likes · 9 min read
Mastering Dubbo: Deep Dive into Architecture, Components, and Best Practices
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Feb 8, 2024 · Backend Development

Core Principles of Message Queues: 12 Key Concepts Explained

This article provides a comprehensive overview of message queue fundamentals, covering producers, consumers, brokers, point-to-point and publish/subscribe models, ordering, ACK mechanisms, eventual consistency, transactions, persistence, high availability, and selection criteria for various MQ technologies.

BackendDistributed SystemsMQ
0 likes · 10 min read
Core Principles of Message Queues: 12 Key Concepts Explained
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Feb 5, 2024 · Backend Development

Comprehensive Overview of RocketMQ: Architecture, Features, and Application Scenarios

This article provides an in‑depth introduction to RocketMQ, covering its evolution, core components, message domain model, key characteristics such as ordered delivery and deduplication, and typical use cases like traffic shaping, asynchronous decoupling, and distributed transaction messaging.

ApacheDistributed SystemsMessage Queue
0 likes · 8 min read
Comprehensive Overview of RocketMQ: Architecture, Features, and Application Scenarios
Sanyou's Java Diary
Sanyou's Java Diary
Feb 1, 2024 · Backend Development

Message Queues Unveiled: From Decoupling to Platformization and Core Architectures

This article traces the two‑decade evolution of message queues—from early decoupling solutions like ActiveMQ, through high‑throughput designs such as Kafka, to modern platformized systems like RocketMQ and Pulsar—while explaining fundamental concepts, partitioning, and storage architectures that underpin today’s distributed messaging platforms.

Distributed SystemsKafkaMessage Queue
0 likes · 17 min read
Message Queues Unveiled: From Decoupling to Platformization and Core Architectures