Search

Discover articles.

Search across authors, categories, and technical themes. The layout mirrors the editorial references while staying responsive and fast.

Results

Matches for “distributed systems”

1000 results
Fundamentals May 11, 2025 Xiaokun's Architecture Exploration Notes

Why Unreliable Clocks Threaten Distributed Systems—and How to Fix Them

This article examines the unreliability of physical clocks in distributed systems, compares synchronous and asynchronous network timing, explains the roles of wall and monotonic clocks, and explores logical clocks, snapshot isolation, and practical solutions such as Google Spanner's TrueTime to ensure data consistency.

distributed systemsdata consistencymonotonic clocklogical clockclock synchronization
Fundamentals May 11, 2025 Xiaokun's Architecture Exploration Notes

Why Unreliable Networks Threaten Distributed Systems and How to Mitigate Them

Distributed systems suffer from network unreliability—including packet loss, out‑of‑order delivery, variable latency, and ambiguous node failures—making timeout settings and fault detection challenging, and this article explains these issues, compares synchronous and asynchronous networks, and discusses strategies to balance latency and resource utilization.

distributed systemsfault tolerancenetwork reliabilityasynchronous networksynchronous network
Fundamentals May 4, 2025 Xiaokun's Architecture Exploration Notes

Why Unreliable Clocks Threaten Distributed Systems—and How to Fix Them

This article examines how unreliable physical clocks—both wall and monotonic—affect distributed systems, compares synchronous and asynchronous network timing, illustrates conflicts caused by timestamp drift, and presents logical clocks and Google’s TrueTime as robust solutions for achieving consistent ordering and data reliability.

distributed systemsTrueTimemonotonic clocklogical clockclock synchronization
Fundamentals Apr 20, 2025 Xiaokun's Architecture Exploration Notes

Why Unreliable Networks Threaten Distributed Systems—and How to Mitigate Them

The article explains how network failures such as packet loss, reordering, latency, and ambiguous node failures make distributed systems unreliable, compares synchronous and asynchronous networks, and discusses the trade‑off between timeout settings and resource utilization.

distributed systemslatencynode failurenetwork reliabilityasynchronous networksynchronous network
Fundamentals Apr 3, 2025 Cognitive Technology Team

Understanding CAP Theory and BASE: Data Consistency in Distributed Systems

This article explains the CAP theorem and its practical extension BASE, describing their core concepts, trade‑off combinations, typical components such as Zookeeper, Eureka, and Nacos, and engineering techniques like asynchronous replication, Saga, and idempotent design for building highly available distributed systems.

distributed systemsCAP theoremconsistencyBASEavailabilitypartition tolerance
Fundamentals Mar 26, 2025 DeWu Technology

Consistency Challenges and Solutions in Distributed Systems: CAP, BASE, RPC, and Messaging

To address consistency problems in distributed systems, the article explains CAP and BASE trade‑offs, shows how transactional RPC and messaging—using retries, RocketMQ two‑phase commits, Spring @TransactionalEventListener, or a local message log—can ensure atomic updates, and compares their reliability, latency, and performance impacts.

microservicesBASE theoryCAP theoremdistributed consistencymessage queues
Operations Mar 7, 2025 FunTester

Fault Testing: Proactive Resilience Engineering for Distributed Systems

Fault testing, akin to a shield, deliberately injects failures into distributed and cloud‑native systems to expose weak points, verify recovery mechanisms, and improve overall reliability, ensuring business continuity even under unexpected disruptions.

distributed systemsoperationschaos engineeringresiliencefault testing
Operations Mar 2, 2025 FunTester

Common Fault Propagation Patterns and Prevention Strategies in Distributed Systems

The article examines typical fault propagation scenarios such as avalanche effects, cascading failures, resource exhaustion, data pollution, and dependency cycles in distributed systems, and outlines proactive measures like rate limiting, circuit breaking, isolation, monitoring, and chaos engineering to prevent small issues from escalating into large-scale outages.

distributed systemsmonitoringchaos engineeringfault tolerancerate limitingcircuit breaker
Backend Development Feb 18, 2025 Selected Java Interview Questions

Why Java synchronized Is Insufficient in Distributed Systems and Alternative Lock Solutions

The article explains how Java's synchronized keyword works only within a single JVM, why it fails to provide mutual exclusion across multiple processes in distributed systems, and presents alternative distributed locking mechanisms such as database locks and Redis-based locks with code examples.

JavaConcurrencyRedisdistributed locksynchronizeddatabase lock
Fundamentals Jan 13, 2025 IT Architects Alliance

Strong Consistency vs. Eventual Consistency in Distributed Systems

The article explains the principles, implementation techniques, trade‑offs, and typical use cases of strong consistency and eventual consistency in distributed systems, helping architects decide which model best fits the performance, availability, and correctness requirements of their applications.

distributed systemsCAP theoremconsistencyeventual consistencystrong consistency
Page 1 Next