Tagged articles
2122 articles
Page 5 of 22
Architect
Architect
Jan 31, 2024 · Backend Development

How to Build Distributed Multi‑Rule Rate Limiting with Redis and Spring AOP

This article explains how to implement multi‑rule rate limiting in a distributed Java application using Redis, covering String‑based counters, Zset timestamp storage, Lua scripts for atomic checks, custom @RateLimiter annotations, key generation logic, and an AOP interceptor that enforces the limits.

Distributed SystemsLuaaop
0 likes · 12 min read
How to Build Distributed Multi‑Rule Rate Limiting with Redis and Spring AOP
Baidu Geek Talk
Baidu Geek Talk
Jan 22, 2024 · Backend Development

Design and Implementation of Baidu's Unified Long‑Connection Service

Baidu’s Go‑based unified long‑connection service delivers secure, high‑concurrency, low‑latency connections for multiple Baidu apps through a four‑layer architecture (SDK, control, access, routing), employing goroutine pooling, two‑layer connection models and binary routing to support tens of millions of concurrent users and million‑level QPS, while simplifying integration and reducing maintenance costs.

Backend ArchitectureDistributed Systemshigh concurrency
0 likes · 21 min read
Design and Implementation of Baidu's Unified Long‑Connection Service
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jan 17, 2024 · Backend Development

Mastering Message Middleware: From Basics to Choosing the Right Solution

This article explains what message middleware is, outlines its key use cases such as asynchronous communication and decoupling, details core principles and models like point‑to‑point and publish/subscribe, reviews popular solutions (Kafka, RabbitMQ, RocketMQ, Pulsar, etc.), and offers selection guidance.

Distributed SystemsKafkaMessage Middleware
0 likes · 8 min read
Mastering Message Middleware: From Basics to Choosing the Right Solution
360 Smart Cloud
360 Smart Cloud
Jan 15, 2024 · Big Data

Design and Optimization of the Ozone Distributed Object Storage System

This article presents a comprehensive overview of Ozone, a Hadoop‑based distributed object storage system, detailing its architecture, metadata management, scalability enhancements, small‑file handling, erasure coding, lifecycle policies, and future improvements aimed at boosting performance and reliability for large‑scale unstructured data workloads.

Big DataDistributed SystemsHadoop
0 likes · 15 min read
Design and Optimization of the Ozone Distributed Object Storage System
dbaplus Community
dbaplus Community
Jan 12, 2024 · Operations

How a Financial Firm Built a Scalable Edge‑Stored APM System for Microservices

This article describes how a securities company tackled the challenges of distributed‑system observability by designing and deploying a self‑developed application performance monitoring platform that supports flexible integration, dynamic metric collection, edge storage, and cross‑center synchronization, delivering measurable improvements in monitoring coverage, alert effectiveness, and bandwidth usage.

APMDistributed SystemsEdge Storage
0 likes · 16 min read
How a Financial Firm Built a Scalable Edge‑Stored APM System for Microservices
ITPUB
ITPUB
Jan 12, 2024 · Databases

What the 2023 Chinese Government Database Procurement Standard Means for Vendors

The 2023 Chinese government database procurement standard defines unified requirements for government agencies, covering scope, procurement principles, mandatory indicators, response and acceptance forms, and detailed technical criteria for centralized and distributed databases, offering clear guidance for vendors on functional, reliability, security, compatibility, service, and safety expectations.

Distributed SystemsReliabilitydatabases
0 likes · 15 min read
What the 2023 Chinese Government Database Procurement Standard Means for Vendors
Senior Tony
Senior Tony
Jan 11, 2024 · Backend Development

10 Powerful Redis Use Cases Beyond Simple Caching

This guide explores ten practical Redis scenarios—including login authentication, counters, fan following, leaderboards, anti‑scraping, message queues, browser history, distributed locks, user sign‑in tracking, and website UV statistics—showcasing commands and patterns to boost performance in real‑world applications.

Caching AlternativesData StructuresDistributed Systems
0 likes · 12 min read
10 Powerful Redis Use Cases Beyond Simple Caching
Sohu Tech Products
Sohu Tech Products
Jan 10, 2024 · Databases

An Introduction to Redis: Basics, Performance, and Comparison with Memcached

Redis is an open‑source, in‑memory NoSQL database that provides ultra‑fast key‑value storage, rich data structures, persistence, clustering and extensible modules, making it the preferred distributed cache over Memcached, which lacks these features and is now rarely chosen for new projects.

Distributed SystemsIn-Memory DatabaseMemcached Comparison
0 likes · 12 min read
An Introduction to Redis: Basics, Performance, and Comparison with Memcached
Goodme Frontend Team
Goodme Frontend Team
Jan 10, 2024 · Frontend Development

Essential Frontend Picks: Canvas Collision Detection, CSS Animation Tricks, Design‑to‑Code & Distributed Theory

This curated newsletter shares the latest frontend news and article recommendations, covering JavaScript rising stars, ESLint upgrades, upcoming jQuery 4.0, V8's Maglev compiler, canvas collision detection, CSS animation control, design‑to‑code workflows, distributed theory basics, version‑update popups, and anti‑mistake design principles.

DesignDistributed SystemsFrontend
0 likes · 6 min read
Essential Frontend Picks: Canvas Collision Detection, CSS Animation Tricks, Design‑to‑Code & Distributed Theory
MaGe Linux Operations
MaGe Linux Operations
Jan 7, 2024 · Backend Development

How Zookeeper Guarantees Reliable Session Management with Heartbeats

This article explains Zookeeper's session management mechanism, detailing why TCP alone is insufficient for client liveness detection, how Zookeeper implements its own heartbeat protocol, and the internal data structures and algorithms—including expiryMap and SessionTracker—that efficiently track and expire sessions.

Distributed SystemsHeartbeatSession Management
0 likes · 12 min read
How Zookeeper Guarantees Reliable Session Management with Heartbeats
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jan 2, 2024 · Backend Development

Cache Design and Optimization Practices for High‑Concurrency Music Library Service

The article details NetEase Cloud Music’s high‑concurrency cache architecture—using lazy‑load, hole‑wrapped objects for penetration protection, placeholder values for missing data, horizontal and vertical scaling with consistent hashing, and asynchronous binlog‑driven invalidation—to achieve sub‑millisecond reads for a read‑heavy, write‑light music library.

Distributed SystemsScale‑Upcache invalidation
0 likes · 12 min read
Cache Design and Optimization Practices for High‑Concurrency Music Library Service
Architect
Architect
Dec 28, 2023 · Backend Development

How to Implement Distributed Multi‑Rule Rate Limiting with Redis and Lua

This article explains how to design and implement a distributed rate‑limiting solution that supports multiple concurrent rules—such as per‑minute and per‑hour limits—by analyzing the shortcomings of simple string counters, introducing atomic Lua scripts and Zset structures, and providing complete Java annotation and AOP code examples.

Distributed SystemsLuaaop
0 likes · 13 min read
How to Implement Distributed Multi‑Rule Rate Limiting with Redis and Lua
Amap Tech
Amap Tech
Dec 28, 2023 · Backend Development

Rate Limiting Algorithms and Their Java Implementations

Rate limiting protects system stability by capping request rates, and this article explains five Java algorithms—Fixed Window, Sliding Window, Leaky Bucket, Token Bucket, and Guava's RateLimiter—showing their principles, pros and cons, and providing sample implementations and a Spring @Limit annotation for practical enforcement.

Distributed SystemsGuavaalgorithm
0 likes · 17 min read
Rate Limiting Algorithms and Their Java Implementations
macrozheng
macrozheng
Dec 28, 2023 · Databases

Mastering Sharding: 21 Core Concepts for Scalable Database Design

This article introduces the essential concepts of sharding architecture—including data partitioning, logical and physical tables, sharding keys, routing strategies, SQL rewriting, distributed primary keys, data masking, transactions, migration, and shadow databases—providing a comprehensive guide for building high‑performance, horizontally scaled database systems.

Broadcast TableDistributed SystemsSQL Routing
0 likes · 21 min read
Mastering Sharding: 21 Core Concepts for Scalable Database Design
System Architect Go
System Architect Go
Dec 26, 2023 · Cloud Native

Distributed Leader Election and Kubernetes Lease

Distributed leader election ensures a single node performs exclusive tasks in high‑availability systems, with common implementations like Etcd, Kafka, Elasticsearch, and Zookeeper using algorithms such as Paxos, Raft, and ZAB, while Kubernetes provides the Lease resource to manage leader election via a distributed lock.

Distributed SystemsGoLease
0 likes · 7 min read
Distributed Leader Election and Kubernetes Lease
DataFunSummit
DataFunSummit
Dec 23, 2023 · Big Data

Common Optimization Points in Building a User Portrait Platform

The presentation by Zhang Xinglong, head of the user understanding center's portrait platform backend, outlines the platform's overall architecture, core module optimizations, advanced crowd creation techniques, bitmap usage, and industry trends, helping the audience understand platform importance, performance improvements, and future directions.

BackendDistributed Systemsoptimization
0 likes · 3 min read
Common Optimization Points in Building a User Portrait Platform
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 21, 2023 · Backend Development

Mastering Rate Limiting: Algorithms, Java Implementations, and Guava Tips

This article explains why rate limiting is essential for high‑traffic systems, defines common rate‑limiting algorithms (fixed window, sliding window, leaky bucket, token bucket), provides complete Java code examples for each, and demonstrates practical usage with Guava's RateLimiter in real‑world applications.

BackendDistributed SystemsGuava
0 likes · 19 min read
Mastering Rate Limiting: Algorithms, Java Implementations, and Guava Tips
MaGe Linux Operations
MaGe Linux Operations
Dec 18, 2023 · Backend Development

How to Prevent Kafka Message Loss in Critical Transaction Systems

This article explains why Kafka can lose messages in production, broker, and consumer stages, analyzes root causes such as asynchronous batch sends, JVM crashes, and network failures, and provides practical solutions including callbacks, retry mechanisms, replication settings, and manual offset commits to ensure reliable delivery.

Distributed SystemsKafkaMessage Reliability
0 likes · 10 min read
How to Prevent Kafka Message Loss in Critical Transaction Systems
dbaplus Community
dbaplus Community
Dec 18, 2023 · Backend Development

How to Prevent Cache Penetration, Avalanche, Breakdown, Inconsistency, and Concurrency Issues

This guide explains common cache problems such as penetration, avalanche, breakdown, data inconsistency, and concurrent access, and provides practical solutions like Bloom filters, multi‑level caching, random expiration, distributed locks, and transaction mechanisms to keep systems stable and performant.

CacheData ConsistencyDistributed Systems
0 likes · 13 min read
How to Prevent Cache Penetration, Avalanche, Breakdown, Inconsistency, and Concurrency Issues
DataFunTalk
DataFunTalk
Dec 15, 2023 · Big Data

Zhihu Bridge Platform: Internal Marketing Architecture, Challenges, and Optimizations

This article presents a comprehensive overview of Zhihu's Bridge Platform internal marketing module, detailing its background, business logic, product components such as CDP, activity and delivery platforms, architectural layers, performance bottlenecks, optimization techniques—including distributed transactions, bitmap indexing, and vectorized query execution—and future directions toward marketing automation and intelligence.

CDPDistributed SystemsPerformance Optimization
0 likes · 28 min read
Zhihu Bridge Platform: Internal Marketing Architecture, Challenges, and Optimizations
Tencent Cloud Middleware
Tencent Cloud Middleware
Dec 12, 2023 · Cloud Native

How Tencent Cloud Implements Tiered Storage for Kafka: Architecture, Challenges, and Evolution

This article examines the challenges of Kafka's traditional architecture, explains why local‑state heavy deployments cause operational difficulty and resource waste, and details Tencent Cloud's elastic, storage‑compute‑separated designs—including tiered storage, segment state machines, offset constraints, and performance optimizations—while sharing practical implementation insights and future directions.

Cloud NativeData LifecycleDistributed Systems
0 likes · 17 min read
How Tencent Cloud Implements Tiered Storage for Kafka: Architecture, Challenges, and Evolution
AntTech
AntTech
Dec 11, 2023 · Artificial Intelligence

Ant Group Open-Sources OpenASCE: A Distributed Full-Stack Causal Learning System Presented at NeurIPS

At NeurIPS 2023, Ant Group unveiled OpenASCE, the industry's first open‑source distributed full‑link causal learning system, detailing its architecture, large‑scale capabilities, and real‑world applications in credit risk, marketing, and recommendation while emphasizing its role in advancing causal AI research.

AIAnt GroupDistributed Systems
0 likes · 5 min read
Ant Group Open-Sources OpenASCE: A Distributed Full-Stack Causal Learning System Presented at NeurIPS
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Dec 7, 2023 · Backend Development

In-Depth Analysis of XXL‑RPC Framework: Design, Implementation, and Source Code Walkthrough

This article provides a comprehensive overview of the lightweight XXL‑RPC framework, covering fundamental RPC concepts, the framework's architecture built on Spring and Netty, detailed provider and consumer implementations, various call types, and the service registry‑discovery mechanism, concluding with practical insights for developers.

Distributed SystemsMicroservicesNetty
0 likes · 17 min read
In-Depth Analysis of XXL‑RPC Framework: Design, Implementation, and Source Code Walkthrough
Alibaba Cloud Native
Alibaba Cloud Native
Dec 6, 2023 · Cloud Native

How RocketMQ Implements Random Indexing for Cloud‑Native Storage

This article explains RocketMQ's random indexing mechanism, detailing its on‑disk three‑segment hash table structure, the compact format conversion process, multi‑threaded write and query workflows, layered system design, crash‑recovery strategy, and comparisons with RocksDB and InnoDB storage engines.

Distributed SystemsMessage IndexingRocketMQ
0 likes · 16 min read
How RocketMQ Implements Random Indexing for Cloud‑Native Storage
Architecture Digest
Architecture Digest
Dec 1, 2023 · Databases

Understanding ULID: A Lexicographically Sortable Unique Identifier and Its Python Implementation

ULID (Universally Unique Lexicographically Sortable Identifier) offers a 128‑bit, time‑based and random identifier that is URL‑safe, sortable, and more collision‑resistant than UUID, with detailed specifications, binary layout, and Python usage examples including generation, conversion, and component extraction.

Distributed SystemsPythonULID
0 likes · 7 min read
Understanding ULID: A Lexicographically Sortable Unique Identifier and Its Python Implementation
Architecture Digest
Architecture Digest
Nov 27, 2023 · Databases

Fast Import of 1 Billion Records into MySQL: Design, Performance, and Reliability Considerations

To import one billion 1 KB log records into MySQL efficiently, the article examines data size constraints, B‑tree index limits, batch insertion strategies, storage engine choices, file‑reading techniques, task coordination with Redis, Redisson semaphores, and distributed lock handling to ensure ordered, reliable, high‑throughput loading.

Batch InsertBig DataDistributed Systems
0 likes · 18 min read
Fast Import of 1 Billion Records into MySQL: Design, Performance, and Reliability Considerations
Top Architecture Tech Stack
Top Architecture Tech Stack
Nov 27, 2023 · Operations

Designing Multi-Active Cross‑Region Architecture: Scenarios, Patterns, and Practical Techniques

This article explains the motivations, application scenarios, architectural patterns (same‑city, cross‑city, and cross‑country), and concrete design techniques for building multi‑active cross‑region systems that ensure high availability and graceful degradation during extreme failures.

Distributed Systemsdata synchronizationdisaster recovery
0 likes · 32 min read
Designing Multi-Active Cross‑Region Architecture: Scenarios, Patterns, and Practical Techniques
Top Architecture Tech Stack
Top Architecture Tech Stack
Nov 26, 2023 · Operations

Understanding High Availability and High Performance: Complexity, Redundancy, and Decision Strategies

This article examines the inherent complexity of achieving high availability and high performance in distributed systems, explaining redundancy techniques, storage consistency challenges, various state‑decision models, and the trade‑offs involved in scaling single‑machine and cluster architectures.

Distributed SystemsSystem Designhigh availability
0 likes · 27 min read
Understanding High Availability and High Performance: Complexity, Redundancy, and Decision Strategies
Architect
Architect
Nov 23, 2023 · Databases

Inside Our High‑Performance Self‑Built Redis System: Architecture, Features & Ops

This article details the design and implementation of a self‑managed Redis KV cache system spanning tens of terabytes, covering its Proxy‑based architecture, ConfigServer high‑availability via Raft, Redis‑Proxy slot routing, async‑fork optimizations, data migration strategies, and a comprehensive automation platform for deployment, scaling, monitoring, and stability governance.

Distributed Systemsautomationhigh availability
0 likes · 24 min read
Inside Our High‑Performance Self‑Built Redis System: Architecture, Features & Ops
Top Architecture Tech Stack
Top Architecture Tech Stack
Nov 22, 2023 · Operations

Designing Multi‑Active (Active‑Active) Architecture Across Regions: Scenarios, Patterns, and Practical Techniques

This article explains the motivations, application scenarios, architectural patterns, and step‑by‑step design techniques for building geographically distributed active‑active systems that can survive extreme failures while balancing cost, complexity, and data consistency requirements.

Active-ActiveDistributed SystemsSystem Design
0 likes · 32 min read
Designing Multi‑Active (Active‑Active) Architecture Across Regions: Scenarios, Patterns, and Practical Techniques
dbaplus Community
dbaplus Community
Nov 19, 2023 · Backend Development

How YouTube Scaled to 100 Million Daily Views with Just 9 Engineers

An in‑depth look at YouTube’s early scalability strategy reveals how a tiny team of nine engineers built a simple yet powerful tech stack—leveraging MySQL, Lighttpd, Python, commodity hardware, stateless design, replication, partitioning, caching, and strategic outsourcing—to handle billions of daily video views.

Distributed SystemsScalabilityYouTube
0 likes · 10 min read
How YouTube Scaled to 100 Million Daily Views with Just 9 Engineers
Architect
Architect
Nov 16, 2023 · Backend Development

Understanding Transactional Messages in Distributed Systems: RocketMQ and Kafka

This article explains the principles of distributed transaction messages, comparing 2PC, TCC, and transactional messaging, and provides detailed walkthroughs of RocketMQ and Kafka implementations, including their two‑phase processes, broker handling, and source‑code insights for ensuring data consistency in asynchronous systems.

2PCDistributed SystemsMessage Queue
0 likes · 16 min read
Understanding Transactional Messages in Distributed Systems: RocketMQ and Kafka
Didi Tech
Didi Tech
Nov 14, 2023 · Databases

Didi's Multi-Active Redis Architecture: Design, Challenges, and Solutions

To achieve disaster-recovery and cross-data-center resilience, Didi progressed from a simple proxy double-write scheme to a sophisticated MQ-free multi-active Redis design that uses a dedicated syncer, shard-based loop prevention, op-id replay protection, conflict detection, and incremental AOF durability, ensuring low latency, no data loss, and consistent availability.

DidiDistributed Systemsdata replication
0 likes · 11 min read
Didi's Multi-Active Redis Architecture: Design, Challenges, and Solutions
Architecture Digest
Architecture Digest
Nov 11, 2023 · Databases

Redis: From Cache to Distributed Data Store – Benefits, Persistence, and Use Cases

This article explains how Redis evolved from a simple cache to a high‑performance distributed data store, covering its architecture, persistence models, scalability, high‑availability features, complex data structures, and the trade‑offs of using it as a primary database versus a traditional relational system.

Distributed SystemsPersistencecaching
0 likes · 9 min read
Redis: From Cache to Distributed Data Store – Benefits, Persistence, and Use Cases
AntTech
AntTech
Nov 10, 2023 · Artificial Intelligence

Ant Group and Tsinghua University’s Distributed Collaborative Risk‑Defense System Wins Zhejiang Provincial Science & Technology Progress Award

The award‑winning distributed collaborative risk‑defense system, developed by Ant Group, Tsinghua University and Alipay, leverages AI, privacy‑preserving computing and graph analytics to achieve real‑time, high‑efficiency detection and invisible, precise control of hidden risks in massive digital transactions, earning top provincial honors and extensive industry adoption.

AI securityAwardDistributed Systems
0 likes · 5 min read
Ant Group and Tsinghua University’s Distributed Collaborative Risk‑Defense System Wins Zhejiang Provincial Science & Technology Progress Award
dbaplus Community
dbaplus Community
Nov 9, 2023 · Fundamentals

Mastering Java Locks: From Pessimistic to Distributed and Optimizations

This article explains the full spectrum of Java locking mechanisms—including pessimistic, optimistic, distributed, reentrant, spin, read/write, fair vs. non‑fair, JVM lock states, and optimization techniques—detailing their principles, use‑cases, SQL/Redis examples, and performance trade‑offs.

Distributed SystemsJVMLocks
0 likes · 16 min read
Mastering Java Locks: From Pessimistic to Distributed and Optimizations
Architecture Digest
Architecture Digest
Nov 6, 2023 · Backend Development

Using Lua Scripts in Spring Boot with Redis: A Comprehensive Guide

This article explains how to integrate Lua scripts into Spring Boot projects for Redis, covering Lua fundamentals, performance benefits, practical use cases, step‑by‑step implementation with code examples, error handling, security considerations, and best‑practice recommendations for backend developers.

Distributed SystemsLuaPerformance Optimization
0 likes · 19 min read
Using Lua Scripts in Spring Boot with Redis: A Comprehensive Guide
Selected Java Interview Questions
Selected Java Interview Questions
Nov 5, 2023 · Backend Development

Design and Implementation of a High‑Performance Distributed Reconciliation System for Large‑Scale Payment Orders

This article presents a comprehensive design of a distributed reconciliation system that handles tens of millions of daily payment orders by using a six‑module architecture, Kafka for decoupled state transitions, Hive for large‑scale data processing, and Java‑based plug‑in patterns to achieve six‑nine accuracy and significant operational cost savings.

Big DataDistributed SystemsKafka
0 likes · 15 min read
Design and Implementation of a High‑Performance Distributed Reconciliation System for Large‑Scale Payment Orders
Bilibili Tech
Bilibili Tech
Nov 3, 2023 · Big Data

Comprehensive Governance and Optimization Strategies for Large‑Scale Kafka Clusters

To tame a petabyte‑scale Kafka deployment of over 1,000 brokers, the team built a Raft‑based federation controller (Guardian) that adds per‑partition I/O throttling, disk‑aware automatic balancing, multi‑tenant isolation, cross‑IDC migration, request‑queue splitting, tiered storage, auditing, and fully automated rolling upgrades, enabling stable, self‑healing operations.

Big DataCluster GovernanceDistributed Systems
0 likes · 21 min read
Comprehensive Governance and Optimization Strategies for Large‑Scale Kafka Clusters
Architect
Architect
Oct 25, 2023 · Operations

The Importance of Logging and Distributed Log Operations in Modern Architecture

This article explores why logs are essential in software development, outlines when to record them, discusses the value of logging in large-scale distributed systems, and examines the capabilities required of log‑operation tools such as APM, metrics, tracing, ELK, Prometheus, and custom batch querying solutions.

APMDistributed SystemsELK
0 likes · 21 min read
The Importance of Logging and Distributed Log Operations in Modern Architecture
JD Cloud Developers
JD Cloud Developers
Oct 25, 2023 · Backend Development

Master Kafka: Core Concepts, Architecture, and Practical Tips

This article explains Kafka's fundamentals, including topics, partitions, brokers, replication, producer‑consumer workflow, consumer groups, offset management, and common exception handling, while providing code examples and diagrams to help developers understand and effectively use this distributed messaging system.

Distributed SystemsKafkaMessage Queue
0 likes · 21 min read
Master Kafka: Core Concepts, Architecture, and Practical Tips
JD Tech
JD Tech
Oct 25, 2023 · Backend Development

Design and Implementation of JD Logistics Order System Architecture for High Scalability and Availability

The article details JD Logistics' order system redesign using a four‑layer transaction architecture, describing its decoupled backend, unified data model, high‑availability components such as CQRS, Redis, JMQ, HBase, and Elasticsearch, and outlines design advantages, extensible data modeling, future challenges, and overall performance outcomes.

Backend ArchitectureDistributed SystemsOrder Management
0 likes · 10 min read
Design and Implementation of JD Logistics Order System Architecture for High Scalability and Availability
DataFunSummit
DataFunSummit
Oct 24, 2023 · Databases

OushuDB: A Cloud‑Native Real‑Time Lakehouse Database – Architecture, Evolution and Practice

This article introduces OushuDB, a cloud‑native real‑time lakehouse database, tracing the evolution of cloud‑native lakehouse architectures, detailing OushuDB’s multi‑engine, multi‑storage design, and sharing practical insights on compute‑storage separation, high‑availability, and integration with Hadoop, Hive and Hudi.

Distributed Systemscloud-native
0 likes · 20 min read
OushuDB: A Cloud‑Native Real‑Time Lakehouse Database – Architecture, Evolution and Practice
Su San Talks Tech
Su San Talks Tech
Oct 22, 2023 · Backend Development

Mastering Rate Limiting: Algorithms, Scenarios, and Practical Implementations

Rate limiting controls request flow to protect system stability, covering its definition, motivations, common algorithms such as token bucket, leaky bucket, fixed and sliding windows, their pros and cons, single‑machine vs distributed implementations, and practical component choices for backend services.

BackendDistributed Systemsalgorithm
0 likes · 17 min read
Mastering Rate Limiting: Algorithms, Scenarios, and Practical Implementations
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Oct 19, 2023 · Backend Development

Understanding RocketMQ Core Architecture and Its Role in High‑Concurrency Systems

This article explains the core components of RocketMQ—including NameServer, Broker, Producer, and Consumer—detailing their functions, deployment diagrams, and how the middleware supports high‑concurrency scenarios such as peak shaving, asynchronous messaging, fault tolerance, and system decoupling.

Backend ArchitectureDistributed SystemsMessage Queue
0 likes · 7 min read
Understanding RocketMQ Core Architecture and Its Role in High‑Concurrency Systems
Architect
Architect
Oct 15, 2023 · Backend Development

Reflections on Microservices: Promises, Realities, and Trade‑offs

The article examines the alluring promises of microservices—such as faster deployments, independent scaling, and team autonomy—while revealing the hidden complexities like deployment chaos, service discovery, data consistency, latency, and the nostalgic appeal of monolithic architectures, ultimately offering three key takeaways for architects.

Backend ArchitectureDistributed SystemsMicroservices
0 likes · 10 min read
Reflections on Microservices: Promises, Realities, and Trade‑offs
Wukong Talks Architecture
Wukong Talks Architecture
Oct 13, 2023 · Backend Development

7 Common Message Queue Scenarios and Their Implementations

This article explains seven typical message‑queue use cases—including ordinary, ordered, delayed, transactional, trace, dead‑letter, and priority messages—detailing their business motivations, implementation challenges, and concrete code examples for Kafka, RocketMQ, Pulsar, and RabbitMQ.

Distributed SystemsKafkaMessage Queue
0 likes · 11 min read
7 Common Message Queue Scenarios and Their Implementations
AntTech
AntTech
Oct 9, 2023 · Databases

TuGraph-DB v4.0: New Features Including ISO GQL Support, Enterprise High Availability, and Graph Learning Engine

TuGraph-DB v4.0, the open‑source graph database from Ant Group, introduces ISO GQL compliance, enterprise‑grade high availability with RAFT‑based leader election, and an integrated graph learning engine compatible with DGL and PyG, enhancing query capabilities, scalability, and AI‑driven analytics.

AIDistributed SystemsGraph Database
0 likes · 5 min read
TuGraph-DB v4.0: New Features Including ISO GQL Support, Enterprise High Availability, and Graph Learning Engine
MaGe Linux Operations
MaGe Linux Operations
Oct 8, 2023 · Big Data

Understanding Kafka: Core Concepts, Architecture, and Performance Secrets

This article explains Kafka’s fundamental role as a message system, detailing topics, partitions, producers, consumers, replica management, consumer groups, the controller, Zookeeper coordination, and performance optimizations such as sequential writes, zero‑copy, log segmentation, and network design, providing a comprehensive overview for big‑data practitioners.

Big DataDistributed SystemsKafka
0 likes · 11 min read
Understanding Kafka: Core Concepts, Architecture, and Performance Secrets
Practical DevOps Architecture
Practical DevOps Architecture
Oct 8, 2023 · Backend Development

Comprehensive RocketMQ Course: Source Code Deep Dive, Advanced Features, Performance Tuning, and Real‑World Projects

This course provides an in‑depth study of RocketMQ, covering its fundamentals, source‑code analysis, advanced features such as transactional and ordered messages, performance optimization, operational monitoring, and hands‑on projects that guide learners through installation, configuration, clustering, and integration with Spring Boot.

Distributed SystemsMessage QueuePerformance Optimization
0 likes · 4 min read
Comprehensive RocketMQ Course: Source Code Deep Dive, Advanced Features, Performance Tuning, and Real‑World Projects
Efficient Ops
Efficient Ops
Oct 7, 2023 · Big Data

Master Kafka Basics: Topics, Partitions, Producers, and Cluster Architecture

This article explains Kafka's role as a messaging system, covering core concepts such as topics, partitions, producers, consumers, messages, cluster architecture, replicas, consumer groups, controller coordination with Zookeeper, and performance optimizations like sequential writes and zero‑copy networking.

Big DataDistributed SystemsKafka
0 likes · 11 min read
Master Kafka Basics: Topics, Partitions, Producers, and Cluster Architecture
Liangxu Linux
Liangxu Linux
Oct 5, 2023 · Fundamentals

Why ULID Beats UUID: A Deep Dive into Lexicographically Sortable IDs

This article explains the limitations of UUID, introduces ULID as a time‑and‑randomness based identifier with 1.21×10⁻²⁴ unique values per millisecond, details its specification, binary layout, and shows how to generate and manipulate ULIDs in Python for distributed systems.

Distributed SystemsPythonULID
0 likes · 7 min read
Why ULID Beats UUID: A Deep Dive into Lexicographically Sortable IDs
Architects' Tech Alliance
Architects' Tech Alliance
Oct 3, 2023 · Fundamentals

Survey on In‑Network Storage Systems

This survey systematically reviews the research progress of in‑network storage systems, covering programmable network hardware characteristics, the two main challenges of building high‑performance solutions, task‑based classification of existing systems, representative designs, and future research directions such as switch‑NIC collaboration, multi‑tenant support, security, and automatic offloading.

CacheConsistencyDistributed Systems
0 likes · 12 min read
Survey on In‑Network Storage Systems
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Sep 28, 2023 · Backend Development

Comprehensive 12306 Railway Ticket System Project for Learning Distributed Backend Development

This article introduces the 12306 railway ticket system project, detailing its purpose for student learning, two deployment versions, extensive backend architecture, technology stack, code repository structure, design challenges, and resources to help learners master high‑concurrency distributed systems.

Distributed SystemsMicroservicesSpringBoot
0 likes · 15 min read
Comprehensive 12306 Railway Ticket System Project for Learning Distributed Backend Development
Efficient Ops
Efficient Ops
Sep 24, 2023 · Big Data

Mastering Kafka: From Basics to Advanced Operations and Performance Tuning

This article provides a comprehensive overview of Apache Kafka, covering its architecture, core concepts such as topics, partitions, and replicas, common operational commands, and practical performance‑tuning tips for high‑throughput, low‑latency streaming workloads.

Distributed SystemsKafkaOperations
0 likes · 23 min read
Mastering Kafka: From Basics to Advanced Operations and Performance Tuning
ITPUB
ITPUB
Sep 22, 2023 · Databases

Where Does Database Innovation Come From? Exploring the Future of Distributed Databases

The article examines the driving forces behind database innovation, emphasizing the role of inherent shortcomings, AI integration, and the emergence of third‑generation distributed databases that aim for minimal usability, controllable latency high availability, and 100% data correctness.

Distributed SystemsInnovationdata correctness
0 likes · 11 min read
Where Does Database Innovation Come From? Exploring the Future of Distributed Databases
Sanyou's Java Diary
Sanyou's Java Diary
Sep 21, 2023 · Big Data

Understanding Kafka: Core Concepts, Architecture, and Reliability Explained

This article provides a comprehensive overview of Kafka, covering its overall architecture, key components such as brokers, producers, consumers, topics, partitions, replicas, and ZooKeeper, as well as logical and physical storage mechanisms, producer and consumer workflows, configuration parameters, partition assignment strategies, rebalancing, and the replication model that ensures data reliability.

Data StreamingDistributed SystemsKafka
0 likes · 18 min read
Understanding Kafka: Core Concepts, Architecture, and Reliability Explained
DevOps Coach
DevOps Coach
Sep 21, 2023 · Operations

Why Observability Engineering Is Essential for Modern Software Systems

The article examines the concept of observability engineering, highlighting its importance for complex distributed systems, the cultural shift toward DevOps collaboration, key principles from the book “Observability Engineering,” and practical guidance for developers, SREs, managers, and executives to improve reliability, performance, and security.

Distributed Systemssoftware reliability
0 likes · 14 min read
Why Observability Engineering Is Essential for Modern Software Systems
Huolala Tech
Huolala Tech
Sep 21, 2023 · Big Data

How We Built a Scalable Data Migration Framework for Billions of Transactions

This article details the design and implementation of a custom, high‑throughput data migration framework that handles petabyte‑scale transaction data, supports heterogeneous source/target schemas, ensures zero‑downtime operation, and provides robust scheduling, checkpointing, and fault‑tolerance mechanisms.

Big DataData MigrationDistributed Systems
0 likes · 17 min read
How We Built a Scalable Data Migration Framework for Billions of Transactions
Wukong Talks Architecture
Wukong Talks Architecture
Sep 21, 2023 · Backend Development

Detecting and Preventing Message Loss in Kafka Message Queues

This article explains how to detect, diagnose, and prevent message loss in Kafka-based message queue systems by covering system decoupling, traffic control, data consistency challenges, producer, broker, and consumer issues, and offering configuration, monitoring, and operational best‑practice solutions.

Data ConsistencyDistributed SystemsKafka
0 likes · 12 min read
Detecting and Preventing Message Loss in Kafka Message Queues
Top Architect
Top Architect
Sep 20, 2023 · Operations

Design and Implementation of a Distributed Log Service: Tianyan vs ELK

This article examines the challenges of building a high‑performance log service for distributed systems, compares the traditional ELK stack with the Tianyan platform, details Tianyan's architecture—including ingest, storage, and consumer components, SDK and Minos collection methods, high‑throughput transmission with Disruptor and Bigpipe, log retrieval, resource isolation, dynamic cleaning, and best‑practice recommendations.

BigpipeDisruptorDistributed Systems
0 likes · 27 min read
Design and Implementation of a Distributed Log Service: Tianyan vs ELK
dbaplus Community
dbaplus Community
Sep 19, 2023 · Cloud Native

How REDck Transformed ClickHouse into a Scalable Cloud‑Native Real‑Time Data Warehouse

REDck, a cloud‑native real‑time data warehouse built on open‑source ClickHouse, overcomes the original MPP architecture’s scaling and maintenance limits by separating compute and storage, introducing unified metadata, multi‑level caching, bucket‑based sharding, and distributed transaction support, delivering petabyte‑scale, 99.9% availability and ten‑fold cost and performance gains for Xiaohongshu’s diverse workloads.

ClickHouseCloud NativeCompute-Storage Separation
0 likes · 22 min read
How REDck Transformed ClickHouse into a Scalable Cloud‑Native Real‑Time Data Warehouse
Architect
Architect
Sep 19, 2023 · Big Data

How Tianyan Beats ELK: Inside a High‑Performance Distributed Log Service

This article analyzes the challenges of logging in distributed services, compares the traditional ELK stack with Baidu's Tianyan platform, and details Tianyan's architecture, data collection, high‑throughput transmission, storage, retrieval, resource isolation, dynamic cleanup, and best‑practice recommendations, complete with code examples and performance insights.

Big DataDistributed SystemsELK
0 likes · 30 min read
How Tianyan Beats ELK: Inside a High‑Performance Distributed Log Service
Su San Talks Tech
Su San Talks Tech
Sep 15, 2023 · Backend Development

What Is Microservices? Architecture, Challenges, and Popular Solutions Explained

This article introduces microservices as a lightweight, autonomous architecture, compares it with monolithic and SOA approaches, outlines the complexities it brings, and reviews the most common open‑source solutions, registration and configuration centers, remote‑call mechanisms, resilience patterns, load‑balancing algorithms, tracing, and monitoring tools.

BackendDistributed Systemsservice discovery
0 likes · 44 min read
What Is Microservices? Architecture, Challenges, and Popular Solutions Explained
Practical DevOps Architecture
Practical DevOps Architecture
Sep 14, 2023 · Cloud Native

Spring Cloud Alibaba Comprehensive Learning Path – Video, Documentation, and Code Examples

This article presents a detailed Spring Cloud Alibaba curriculum covering microservice architecture, service registration with Nacos, rate limiting with Sentinel, distributed transaction solutions, and practical hands‑on labs, providing video tutorials, documentation, and complete code samples for developers.

Distributed SystemsMicroservicesNacos
0 likes · 10 min read
Spring Cloud Alibaba Comprehensive Learning Path – Video, Documentation, and Code Examples
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Sep 14, 2023 · Backend Development

Building a Multi‑Level Cache Consistency Framework for Live‑Streaming Platforms

This article describes how a social live‑streaming platform designed and implemented a custom multi‑level cache consistency framework, detailing the background challenges, the architecture of a cache pipeline with Zookeeper‑based node discovery and retry‑enabled execution, and the integration SDKs that enable transparent cache clearing across services.

CacheCache ConsistencyDistributed Systems
0 likes · 10 min read
Building a Multi‑Level Cache Consistency Framework for Live‑Streaming Platforms
vivo Internet Technology
vivo Internet Technology
Sep 13, 2023 · Operations

Network Quality Monitoring Center: Architecture, Design, and Implementation for Large-Scale Data Center Latency Measurement

The Network Quality Monitoring Center is a large‑scale system that deploys lightweight agents on every server to issue coordinated ICMP ping probes, a controller to generate and distribute topology‑aware PingLists, and a storage‑analysis module that aggregates latency and loss data for real‑time visualization, alerting and troubleshooting, while addressing load‑balance, ingestion concurrency, and future extensions such as UDP/TCP probes.

Distributed SystemsICMP pingNetwork Monitoring
0 likes · 12 min read
Network Quality Monitoring Center: Architecture, Design, and Implementation for Large-Scale Data Center Latency Measurement
360 Smart Cloud
360 Smart Cloud
Sep 13, 2023 · Backend Development

Introduction to Apache Pulsar Architecture and Features

This article introduces Apache Pulsar, an open‑source cloud‑native distributed messaging platform, detailing its storage‑compute separation architecture, multi‑tenant support, load balancing, fault tolerance, schema handling, functions, IO connectors, tiered storage, cross‑region replication, and operational commands for managing brokers, bookies, and namespaces.

Apache PulsarDistributed SystemsMessage Queue
0 likes · 20 min read
Introduction to Apache Pulsar Architecture and Features
Architect
Architect
Sep 11, 2023 · Databases

How eBay Scaled ClickHouse with Read/Write Separation and Keeper

This article details eBay's event monitoring platform architecture, explains the challenges of high‑load OLAP workloads on ClickHouse clusters, describes the design and implementation of read/write separation and multi‑shard Keeper coordination, and shares concrete configuration snippets, performance observations, and production lessons learned.

ClickHouseDistributed SystemsKeeper
0 likes · 20 min read
How eBay Scaled ClickHouse with Read/Write Separation and Keeper
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Sep 7, 2023 · Backend Development

Comprehensive Overview of Message Queues: Types, Core Concepts, and Comparison of Kafka, RocketMQ, and RabbitMQ

This article provides a detailed overview of popular message queue systems, explains their core concepts such as decoupling and eventual consistency, and compares the advantages and disadvantages of Kafka, RocketMQ, RabbitMQ, and other notable MQ solutions for high‑concurrency scenarios.

Backend ArchitectureDistributed SystemsKafka
0 likes · 7 min read
Comprehensive Overview of Message Queues: Types, Core Concepts, and Comparison of Kafka, RocketMQ, and RabbitMQ
Java Architect Essentials
Java Architect Essentials
Sep 6, 2023 · Backend Development

Achieve WebSocket Load Balancing Across Microservice Instances with a Single Annotation

This article explains how to solve the WebSocket message delivery problem in microservice architectures by using a lightweight library that provides an @EnableWebSocketLoadBalanceConcept annotation, detailing its design, connection management, message routing, and customizable selectors for targeted delivery.

Distributed SystemsMicroservicesSpring Cloud
0 likes · 13 min read
Achieve WebSocket Load Balancing Across Microservice Instances with a Single Annotation
Didi Tech
Didi Tech
Sep 5, 2023 · Operations

Observability and Stability Engineering in Didi Ride‑Hailing Platform

At Didi, observability and stability engineering combine automated, AI‑driven alarm generation, distributed tracing, and ChatOps‑based fault handling to manage micro‑service complexity, massive traffic spikes, and cross‑region operations, emphasizing systematic investment, AIOps evolution, and a recruitment call for backend and test engineers.

DidiDistributed SystemsObservability
0 likes · 16 min read
Observability and Stability Engineering in Didi Ride‑Hailing Platform
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Aug 28, 2023 · Backend Development

Understanding Dubbo: Core Functions, Architecture Layers, and Call Process of the RPC Framework

This article explains Dubbo's three core capabilities—remote method invocation, intelligent fault tolerance with load balancing, and service registration/discovery—details its main components and ten architectural layers, and walks through the eight-step RPC call flow, providing a comprehensive overview of the backend framework.

Distributed SystemsMicroservicesRPC
0 likes · 7 min read
Understanding Dubbo: Core Functions, Architecture Layers, and Call Process of the RPC Framework
MaGe Linux Operations
MaGe Linux Operations
Aug 26, 2023 · Backend Development

How Tencent’s PC & Mobile Payment Architecture Evolved to Support Billions

This article traces the evolution of Tencent's payment platform from its early PC‑centric design through three mobile payment phases, detailing architectural generations, availability measures, multi‑active strategies, and cloud‑native innovations that enable massive, reliable transaction processing.

AvailabilityCloud NativeDistributed Systems
0 likes · 14 min read
How Tencent’s PC & Mobile Payment Architecture Evolved to Support Billions
JD Cloud Developers
JD Cloud Developers
Aug 24, 2023 · Backend Development

Mastering Java Locks: From ReentrantLock to Distributed Redisson & Zookeeper

This article explains the fundamentals and practical usage of various Java locking mechanisms—including ReentrantLock, synchronized, ReentrantReadWriteLock, and the Atomic and Concurrent families—then explores distributed lock solutions such as Redisson’s multiple lock types and Zookeeper/Curator, comparing their performance, consistency, and suitability for different scenarios.

Distributed SystemsLocksZooKeeper
0 likes · 22 min read
Mastering Java Locks: From ReentrantLock to Distributed Redisson & Zookeeper
政采云技术
政采云技术
Aug 23, 2023 · Big Data

Step-by-Step Guide to Building a Hadoop Big Data Cluster on ARM Architecture

This comprehensive tutorial details the process of deploying a complete Hadoop-based big data ecosystem on ARM architecture, covering the installation and configuration of essential components including Java, Zookeeper, Hadoop, MySQL, Hive, and Spark with practical code examples.

ARM architectureCluster DeploymentDistributed Systems
0 likes · 19 min read
Step-by-Step Guide to Building a Hadoop Big Data Cluster on ARM Architecture
DeWu Technology
DeWu Technology
Aug 21, 2023 · Backend Development

Design and Implementation of DGraph: A High‑Performance Recommendation Engine

DGraph, a C++ recommendation engine launched in 2022 for 得物, combines an index layer with a service layer, uses lock‑free RCU structures, a custom mmap‑based D‑Allocator, RoaringBitmap invert indexes, and a multi‑operator fusion scheduler to achieve high‑performance, eventually consistent, scalable recommendations.

Distributed SystemsMemory ManagementRCU
0 likes · 12 min read
Design and Implementation of DGraph: A High‑Performance Recommendation Engine
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Aug 17, 2023 · Backend Development

Comprehensive Introduction to Message Queues

This article provides a thorough overview of message queues, covering their definition, core components, types (point‑to‑point and publish/subscribe), key characteristics such as asynchrony, loose coupling, distribution and reliability, selection criteria among popular MQ products, and typical application scenarios in high‑concurrency systems.

AsynchronousBackend ArchitectureDistributed Systems
0 likes · 9 min read
Comprehensive Introduction to Message Queues
Code Ape Tech Column
Code Ape Tech Column
Aug 15, 2023 · Operations

High‑Availability Architecture for a Billion‑Scale Membership System: Dual‑Center ES, Redis, and MySQL Solutions

This article details the design and implementation of a highly available, high‑performance membership system serving over a billion users, covering dual‑center Elasticsearch clusters, traffic‑isolated three‑cluster ES architecture, Redis dual‑center caching, MySQL partitioned clusters, migration strategies, and refined flow‑control and degradation mechanisms.

Distributed SystemsElasticsearchMySQL
0 likes · 20 min read
High‑Availability Architecture for a Billion‑Scale Membership System: Dual‑Center ES, Redis, and MySQL Solutions
Didi Tech
Didi Tech
Aug 7, 2023 · Backend Development

How Didi Achieved Cross‑Datacenter Elasticsearch Replication for Strong Consistency

This article explains Didi's self‑developed DCDR system that replicates Elasticsearch indices across data‑center clusters, detailing its design goals, core mechanisms, chain construction, historical data recovery, real‑time sync, and data‑quality validation to ensure high availability and strong consistency.

Cross‑Datacenter ReplicationDCDRData Consistency
0 likes · 15 min read
How Didi Achieved Cross‑Datacenter Elasticsearch Replication for Strong Consistency
Architects Research Society
Architects Research Society
Aug 5, 2023 · Big Data

Getting Started with Data Mesh: A Quick‑Start Guide

This guide introduces the concept of a data mesh, explains why modern data‑driven organizations need domain‑driven self‑serve design, outlines its three core principles, and provides a curated reading list to help teams transition from monolithic data lakes to distributed, observable data products.

Data GovernanceDistributed SystemsDomain‑Driven Design
0 likes · 10 min read
Getting Started with Data Mesh: A Quick‑Start Guide
Tencent Cloud Developer
Tencent Cloud Developer
Aug 2, 2023 · Backend Development

How to Do Architecture Design Well: A Tencent Architect's Experience

A Tencent architect explains that effective software architecture requires clear distinctions between systems, subsystems, modules and components, understanding frameworks versus structural design, applying TOGAF classifications, evolving from monoliths to microservices, and following fifteen practical principles while avoiding common misconceptions such as over‑design and technology‑driven choices.

Distributed SystemsMicroservicesSoftware Architecture
0 likes · 20 min read
How to Do Architecture Design Well: A Tencent Architect's Experience
DaTaobao Tech
DaTaobao Tech
Jul 26, 2023 · Backend Development

Design and Implementation of a Netty‑Based Microservice Communication Module

The article walks Java developers through constructing a lightweight Netty‑based RPC framework—using RocketMQ’s NettyRemotingServer and NettyRemotingClient to handle synchronous, asynchronous and one‑way calls, routing request codes to dedicated processors and thread pools, exposing services via dynamic proxies, and outlining extensions such as service‑registry integration for a full microservice communication solution.

Distributed SystemsNettyRPC
0 likes · 28 min read
Design and Implementation of a Netty‑Based Microservice Communication Module
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 26, 2023 · Backend Development

How a New Cache Design Boosted Logistics Pricing QPS Five‑Fold

This article reviews the performance challenges of the logistics pricing engine, explains the limitations of the previous 1.0 optimizations, and details the design and implementation of a new 2.0 cache strategy using Tair and local caches, key‑value modeling, pre‑heating, updates, and bottleneck mitigation, achieving the required QPS.

BackendDistributed SystemsPerformance Optimization
0 likes · 19 min read
How a New Cache Design Boosted Logistics Pricing QPS Five‑Fold
Java High-Performance Architecture
Java High-Performance Architecture
Jul 24, 2023 · Backend Development

Business Compensation in Distributed Systems: Rollback, Retry, and Consistency

This article explains how distributed systems handle business inconsistencies through compensation mechanisms, detailing rollback versus retry approaches, their implementation patterns such as explicit/implicit rollback, various retry strategies, and the trade‑offs between ACID and BASE consistency models for achieving eventual consistency.

ConsistencyDistributed SystemsMicroservices
0 likes · 11 min read
Business Compensation in Distributed Systems: Rollback, Retry, and Consistency
Tech Architecture Stories
Tech Architecture Stories
Jul 23, 2023 · Backend Development

Beyond Scale: Rethinking Architecture Boundaries for Massive Services

This article reflects on years of designing large‑scale backend systems at Tencent, discussing how to define clear architecture boundaries, ensure high availability, integrate diverse technologies, and use observability and monitoring to continuously evolve and improve massive service architectures.

ArchitectureDistributed SystemsObservability
0 likes · 25 min read
Beyond Scale: Rethinking Architecture Boundaries for Massive Services
Su San Talks Tech
Su San Talks Tech
Jul 22, 2023 · Backend Development

How Bilibili Scaled Its Relationship Chain Service from MySQL to KV and Redis

This article details how Bilibili’s relationship‑chain service evolved from a MySQL‑based design to a KV‑store with asynchronous writes, introduced multi‑layer caching with memcached, Redis hash and KV, added bloom‑filter optimization, and implemented hotspot mitigation to support millions of QPS.

BackendDistributed SystemsScalability
0 likes · 16 min read
How Bilibili Scaled Its Relationship Chain Service from MySQL to KV and Redis
Su San Talks Tech
Su San Talks Tech
Jul 21, 2023 · Backend Development

Mastering RocketMQ Transaction Messages: Theory, Flow, and Real‑World Example

This article explains RocketMQ transaction messages by covering their use cases, underlying mechanisms, and a step‑by‑step implementation that demonstrates how to ensure eventual consistency between message production and local database transactions in distributed Java back‑end systems.

Distributed SystemsMessage QueueRocketMQ
0 likes · 8 min read
Mastering RocketMQ Transaction Messages: Theory, Flow, and Real‑World Example
Sohu Tech Products
Sohu Tech Products
Jul 19, 2023 · Backend Development

Understanding RocketMQ Consumption Logic in Version 4.9.x

This article provides a comprehensive walkthrough of RocketMQ 4.9.x consumption architecture, covering the four core roles, publish‑subscribe model, storage structures, load‑balancing, long‑polling, concurrent and ordered consumption flows, progress persistence, and retry mechanisms, with illustrative diagrams and code snippets.

ConsumerDistributed SystemsMessage Queue
0 likes · 28 min read
Understanding RocketMQ Consumption Logic in Version 4.9.x
21CTO
21CTO
Jul 19, 2023 · Operations

Scaling a Fast‑Growing Supply Chain Platform: Architecture and Ops Insights

This article details how a rapidly expanding B2B fresh‑food company restructured its R&D organization, adopted a matrix management model, and built a comprehensive distributed infrastructure—including task scheduling, service discovery, messaging, logging, file storage, CDN, configuration, sharding, search, caching, and monitoring—to support nationwide warehouse operations and future growth.

DevOpsDistributed SystemsOperations
0 likes · 7 min read
Scaling a Fast‑Growing Supply Chain Platform: Architecture and Ops Insights
Weimob Technology Center
Weimob Technology Center
Jul 18, 2023 · Backend Development

How MOAT Enables Lightweight, Multi‑Dimensional Rate Limiting for Scalable Systems

MOAT is a lightweight, multi‑dimensional rate‑limiting component designed to ensure system stability amid traffic spikes, offering configurable rules, automatic blacklisting, and a closed‑loop control flow, with detailed architecture spanning access, logic, data, and logging layers, plus client‑side SDK integration.

Distributed SystemsMicroservicesPerformance
0 likes · 11 min read
How MOAT Enables Lightweight, Multi‑Dimensional Rate Limiting for Scalable Systems