Tag

Distributed Systems

0 views collected around this technical thread.

macrozheng
macrozheng
Jun 13, 2025 · Backend Development

How to Build a Real‑Time Chat with Spring Boot WebSocket: Step‑by‑Step Guide

This article explains how to integrate WebSocket into a Spring Boot project to create a lightweight instant‑messaging system, covering dependency setup, configuration classes, core server implementation, required modules, common deployment issues, and practical solutions with complete code examples.

Backend DevelopmentDistributed SystemsInstant Messaging
0 likes · 14 min read
How to Build a Real‑Time Chat with Spring Boot WebSocket: Step‑by‑Step Guide
IT Services Circle
IT Services Circle
May 28, 2025 · Backend Development

WeChat Technical Interview Experience: Java and C++ Candidates' Questions and Insights

The article shares detailed WeChat (wxg) interview experiences from Java and C++ candidates, highlighting the high difficulty, the blend of algorithmic depth and system design breadth, and the extensive range of technical questions that ultimately led both candidates to fail.

Distributed SystemsInterviewJava
0 likes · 6 min read
WeChat Technical Interview Experience: Java and C++ Candidates' Questions and Insights
Xiaokun's Architecture Exploration Notes
Xiaokun's Architecture Exploration Notes
May 25, 2025 · Fundamentals

How Consensus, CAP, and BASE Shape High‑Availability Architecture

This article explains the role of consensus algorithms in achieving high‑availability through redundancy and automatic failover, clarifies distributed consistency, explores the CAP theorem and its C component, and introduces the BASE theory as a practical complement for eventual consistency in modern distributed systems.

BASE theoryCAP TheoremConsensus
0 likes · 10 min read
How Consensus, CAP, and BASE Shape High‑Availability Architecture
Architect
Architect
May 21, 2025 · Databases

Designing Short Numeric ID Generation Using MySQL Auto‑Increment and Segment Allocation

The article examines the challenges of generating short, user‑friendly numeric account IDs, evaluates Snowflake and MySQL auto‑increment approaches, discusses deadlock issues with REPLACE INTO, and presents a final segment‑based solution that allocates ID blocks per login server while avoiding waste and concurrency problems.

Distributed SystemsMySQLauto_increment
0 likes · 12 min read
Designing Short Numeric ID Generation Using MySQL Auto‑Increment and Segment Allocation
Full-Stack Internet Architecture
Full-Stack Internet Architecture
May 20, 2025 · Big Data

Why Learn Kafka? Core Benefits, Use Cases, and a Summary

This article explains why Kafka is widely adopted by top companies, outlines its high throughput, scalability, and durability, and describes key real‑time data pipeline, stream processing, and big‑data integration scenarios, concluding that mastering Kafka is essential for modern backend and data engineering roles.

Big DataDistributed SystemsKafka
0 likes · 4 min read
Why Learn Kafka? Core Benefits, Use Cases, and a Summary
FunTester
FunTester
May 20, 2025 · Operations

Baseline Metrics for Initiating Chaos Engineering

The article outlines essential baseline metrics—including application, SEV, alert, and infrastructure indicators—required before launching chaos engineering experiments, describes a multi‑stage experiment sequence across known and unknown system areas, and presents best‑practice guidelines for safely conducting chaos tests in production environments.

Distributed Systemsbaseline metricschaos engineering
0 likes · 9 min read
Baseline Metrics for Initiating Chaos Engineering
FunTester
FunTester
May 19, 2025 · Operations

Chaos Engineering Tools, Theory, and Practices

Chaos engineering, a scientific method for improving system resilience, is explored through an overview of leading tools such as Gremlin, ChaosBlade, Chaos Mesh, Chaos Toolkit, and ChaosMeta, alongside core concepts, real-world case studies, common misconceptions, and the practical value of controlled fault injection in distributed systems.

Distributed Systemschaos engineeringfault injection
0 likes · 12 min read
Chaos Engineering Tools, Theory, and Practices
Xiaokun's Architecture Exploration Notes
Xiaokun's Architecture Exploration Notes
May 18, 2025 · Fundamentals

How Distributed Consensus Overcomes the FLP Impossibility Theorem

This article explores how to build fault‑tolerant distributed systems by formalizing consensus, outlines its core properties, explains the FLP impossibility theorem, and shows how algorithms like Raft sidestep its limits through timing constraints and recovery mechanisms.

ConsensusDistributed SystemsFLP theorem
0 likes · 8 min read
How Distributed Consensus Overcomes the FLP Impossibility Theorem
FunTester
FunTester
May 16, 2025 · Operations

Chaos Engineering: Evolution, Workflow, Advantages, and Practice Principles

Chaos engineering is a discipline that deliberately injects faults into distributed systems to test and improve resilience, tracing its evolution from Netflix's Chaos Monkey to modern platforms, outlining its operational workflow, benefits, and core principles for reliable system design.

Distributed SystemsSREchaos engineering
0 likes · 9 min read
Chaos Engineering: Evolution, Workflow, Advantages, and Practice Principles
Top Architecture Tech Stack
Top Architecture Tech Stack
May 15, 2025 · Backend Development

Understanding Cookie + Session Mechanism and Distributed Session Sharing Solutions

This article explains the Cookie + Session mechanism for maintaining user state, discusses its limitations such as size, performance and security, examines challenges in distributed environments, and reviews common solutions including session replication, sticky load balancing, centralized storage, and the use of ThreadLocal for small‑scale backend applications.

Distributed SystemsWeb Developmentbackend
0 likes · 17 min read
Understanding Cookie + Session Mechanism and Distributed Session Sharing Solutions
Code Ape Tech Column
Code Ape Tech Column
May 14, 2025 · Databases

Designing Short Numeric ID Generation with MySQL: From Snowflake to Segment Allocation

This article explores the evolution from using the Snowflake algorithm to a MySQL‑based short numeric ID generation scheme, detailing challenges such as long IDs, deadlocks with REPLACE INTO, and presenting a segment‑based solution that allocates ID blocks, tracks unused IDs, and avoids concurrency issues.

DeadlockDistributed SystemsMySQL
0 likes · 11 min read
Designing Short Numeric ID Generation with MySQL: From Snowflake to Segment Allocation
Java Architecture Diary
Java Architecture Diary
May 12, 2025 · Big Data

Why Kafka 4.0’s KRaft Mode Is a Game‑Changer for Distributed Messaging

Kafka 4.0 replaces ZooKeeper with the native KRaft (Kafka Raft) consensus layer, simplifying deployment, boosting scalability, cutting latency, and introducing a new incremental consumer rebalance protocol and share‑group model that together reshape how distributed streaming applications are built and operated.

Consumer RebalanceDistributed SystemsJava
0 likes · 9 min read
Why Kafka 4.0’s KRaft Mode Is a Game‑Changer for Distributed Messaging
Xiaokun's Architecture Exploration Notes
Xiaokun's Architecture Exploration Notes
May 11, 2025 · Fundamentals

Why Unreliable Networks Threaten Distributed Systems and How to Mitigate Them

Distributed systems suffer from network unreliability—including packet loss, out‑of‑order delivery, variable latency, and ambiguous node failures—making timeout settings and fault detection challenging, and this article explains these issues, compares synchronous and asynchronous networks, and discusses strategies to balance latency and resource utilization.

Distributed Systemsasynchronous networkfault tolerance
0 likes · 8 min read
Why Unreliable Networks Threaten Distributed Systems and How to Mitigate Them
Xiaokun's Architecture Exploration Notes
Xiaokun's Architecture Exploration Notes
May 11, 2025 · Fundamentals

Why Unreliable Clocks Threaten Distributed Systems—and How to Fix Them

This article examines the unreliability of physical clocks in distributed systems, compares synchronous and asynchronous network timing, explains the roles of wall and monotonic clocks, and explores logical clocks, snapshot isolation, and practical solutions such as Google Spanner's TrueTime to ensure data consistency.

Distributed Systemsclock synchronizationdata consistency
0 likes · 11 min read
Why Unreliable Clocks Threaten Distributed Systems—and How to Fix Them
Xiaokun's Architecture Exploration Notes
Xiaokun's Architecture Exploration Notes
May 11, 2025 · Fundamentals

How Fencing Tokens Ensure Safety and Liveness in Distributed Lock Services

This article explores how fencing tokens can provide safety and liveness guarantees in distributed lock services, illustrating fault scenarios, token-based conflict resolution, and abstract system models that help engineers prioritize correctness while tolerating temporary unavailability.

Distributed Systemsfault tolerancefencing tokens
0 likes · 8 min read
How Fencing Tokens Ensure Safety and Liveness in Distributed Lock Services
Code Ape Tech Column
Code Ape Tech Column
May 9, 2025 · Databases

Efficient Strategies for Importing One Billion Records into MySQL

This article explains how to import 1 billion 1 KB log records stored in HDFS or S3 into MySQL by analyzing single‑table limits, using batch inserts, choosing storage engines, sharding, optimizing file‑reading methods, and coordinating distributed tasks with Redis, Redisson, and Zookeeper to ensure ordered, reliable, and high‑throughput data loading.

Distributed SystemsKafkaMySQL
0 likes · 19 min read
Efficient Strategies for Importing One Billion Records into MySQL
Xiaokun's Architecture Exploration Notes
Xiaokun's Architecture Exploration Notes
May 4, 2025 · Fundamentals

Why Unreliable Clocks Threaten Distributed Systems—and How to Fix Them

This article examines how unreliable physical clocks—both wall and monotonic—affect distributed systems, compares synchronous and asynchronous network timing, illustrates conflicts caused by timestamp drift, and presents logical clocks and Google’s TrueTime as robust solutions for achieving consistent ordering and data reliability.

Distributed SystemsTrueTimeclock synchronization
0 likes · 11 min read
Why Unreliable Clocks Threaten Distributed Systems—and How to Fix Them
Java Captain
Java Captain
May 4, 2025 · Backend Development

Understanding Java Counter Implementations: AtomicLong vs LongAdder

This article explains the principles, advantages, and drawbacks of Java's AtomicLong and LongAdder counters, describes the CAS operation and its ABA problem, and analyzes why Alibaba recommends LongAdder for high‑concurrency, high‑availability scenarios in distributed systems.

AtomicLongCASDistributed Systems
0 likes · 7 min read
Understanding Java Counter Implementations: AtomicLong vs LongAdder
Cognitive Technology Team
Cognitive Technology Team
Apr 30, 2025 · Backend Development

Preventing Message Loss, Duplicate Consumption, and Backlog in RocketMQ: Best Practices and Strategies

This article examines the three major reliability challenges of message queues—loss, duplicate consumption, and backlog—and provides detailed RocketMQ‑specific strategies, including producer acknowledgment, broker replication, idempotent consumer design, monitoring, scaling, and parameter tuning to ensure high‑availability distributed systems.

Distributed SystemsMessage QueueRocketMQ
0 likes · 17 min read
Preventing Message Loss, Duplicate Consumption, and Backlog in RocketMQ: Best Practices and Strategies
macrozheng
macrozheng
Apr 30, 2025 · Fundamentals

Key Questions for a Basic Infrastructure Interview: TCP, Redis, Kafka, CAP & More

This article compiles essential interview questions covering TCP connection termination, multi‑port listening, page load workflow, Redis data structures, Kafka consumer sizing and at‑most‑once semantics, the CAP theorem, Singleton usage, C++ map complexity, and a doubly linked list reversal algorithm, providing concise explanations and code examples.

AlgorithmsBackend DevelopmentDistributed Systems
0 likes · 14 min read
Key Questions for a Basic Infrastructure Interview: TCP, Redis, Kafka, CAP & More