Tagged articles
43 articles
Page 1 of 1
Cognitive Technology Team
Cognitive Technology Team
Mar 27, 2026 · Operations

How to Build a Rock‑Solid High‑Availability Architecture: Redundancy, Defense, and Smooth Deployments

This article breaks down high‑availability architecture into redundancy, defensive degradation, and release mechanisms, offering concrete techniques, real‑world failure case studies, and step‑by‑step configurations to ensure continuous service even under heavy load or component failures.

Kubernetesci/cdcircuit breaker
0 likes · 16 min read
How to Build a Rock‑Solid High‑Availability Architecture: Redundancy, Defense, and Smooth Deployments
Su San Talks Tech
Su San Talks Tech
Jul 7, 2025 · Operations

Mastering High Availability: Redundancy & Automatic Failover in Modern Internet Architecture

This article explains how to achieve high availability in internet systems by designing redundant components and automatic failover mechanisms across layers such as load balancers, reverse proxies, microservices, middleware, databases, and messaging, illustrating concepts with diagrams of architectures, clustering, leader election, and practical tools like keepalived, Zookeeper, Redis Sentinel, and Kafka.

MicroservicesOperationsfailover
0 likes · 19 min read
Mastering High Availability: Redundancy & Automatic Failover in Modern Internet Architecture
IT Architects Alliance
IT Architects Alliance
Jan 6, 2025 · Operations

Ensuring High Reliability in Distributed Systems: Redundancy, Fault Detection, Replication, and Resilience Strategies

The article explores how distributed systems achieve high reliability through redundant design, precise fault detection and recovery, data replication and synchronization, coordinated fault tolerance and load balancing, distributed transaction handling, comprehensive monitoring, elastic scaling, security safeguards, and robust disaster‑recovery planning.

Reliabilityfault tolerancemonitoring
0 likes · 18 min read
Ensuring High Reliability in Distributed Systems: Redundancy, Fault Detection, Replication, and Resilience Strategies
IT Architects Alliance
IT Architects Alliance
Dec 29, 2024 · Operations

Design Principles and Key Technologies for High‑Availability Systems

The article explains why 24/7 high‑availability systems are essential for modern enterprises and details core design principles, layered architecture, and critical technologies such as redundancy, load balancing, caching, elastic scaling, monitoring, and fault‑tolerance to ensure continuous, reliable service.

System Designcloud computinghigh availability
0 likes · 23 min read
Design Principles and Key Technologies for High‑Availability Systems
Cognitive Technology Team
Cognitive Technology Team
Nov 15, 2024 · Operations

Building Redundancy in Applications to Avoid Single Points of Failure

The article explains how to design resilient applications by identifying critical paths, adding redundant components, using formulas for overall availability, and applying best‑practice recommendations such as multi‑zone/region deployment, load‑balanced VMs, database replication, and thorough testing of failover mechanisms.

cloud architecturehigh availabilityload balancing
0 likes · 6 min read
Building Redundancy in Applications to Avoid Single Points of Failure
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
May 16, 2024 · Operations

High Availability Architecture: Eight Common Solutions for Large‑Scale Websites

This article explains the concept of high‑availability architecture and details eight practical solutions—including redundant servers, load balancers, data backup strategies, security measures, redundancy patterns, automated operations, and monitoring/alert systems—to help large‑scale sites achieve continuous, fault‑tolerant service.

load balancingredundancy
0 likes · 7 min read
High Availability Architecture: Eight Common Solutions for Large‑Scale Websites
Top Architecture Tech Stack
Top Architecture Tech Stack
Nov 26, 2023 · Operations

Understanding High Availability and High Performance: Complexity, Redundancy, and Decision Strategies

This article examines the inherent complexity of achieving high availability and high performance in distributed systems, explaining redundancy techniques, storage consistency challenges, various state‑decision models, and the trade‑offs involved in scaling single‑machine and cluster architectures.

Distributed SystemsSystem Designhigh availability
0 likes · 27 min read
Understanding High Availability and High Performance: Complexity, Redundancy, and Decision Strategies
Open Source Linux
Open Source Linux
Oct 25, 2023 · Operations

Scaling Servers for Millions: Load Balancing, Sharding, CDN Strategies

This guide explains how to design and expand server infrastructure to handle millions of concurrent users by using load balancers, database sharding, caching, CDNs, hardware selection criteria, and redundancy techniques, ensuring high availability and performance 24/7.

CDNdatabase shardingload balancing
0 likes · 11 min read
Scaling Servers for Millions: Load Balancing, Sharding, CDN Strategies
Open Source Linux
Open Source Linux
Apr 11, 2023 · Operations

Boost Network Reliability: Link Aggregation, Switch Stacking, and HSRP Explained

This article introduces the concept of link aggregation for combining multiple data channels into a higher‑bandwidth logical link, demonstrates configuration steps on Cisco‑style switches, explains switch stacking for increased port capacity and redundancy, and outlines HSRP hot‑standby routing to ensure continuous network availability.

EtherChannelHSRPlink aggregation
0 likes · 7 min read
Boost Network Reliability: Link Aggregation, Switch Stacking, and HSRP Explained
Laravel Tech Community
Laravel Tech Community
Oct 13, 2022 · Backend Development

Designing a Scalable Backend for a Nationwide ID Query Service

The article outlines a simple yet scalable backend architecture that can handle 20 million daily ID queries by partitioning a billion‑record dataset across multiple 16 GB virtual machines, using direct‑index lookups, modest bandwidth, and basic redundancy mechanisms to achieve ample performance headroom.

Distributed SystemsScalabilityredundancy
0 likes · 6 min read
Designing a Scalable Backend for a Nationwide ID Query Service
dbaplus Community
dbaplus Community
Oct 8, 2022 · Operations

Designing High‑Availability Internet Architecture: Redundancy and Automatic Failover

This article explains how to achieve high availability in internet systems by layering architecture, using redundancy and automatic failover across access, proxy, microservice, middleware, and storage components, and discusses practical techniques, common pitfalls, and operational safeguards for resilient services.

MicroservicesOperationsautomatic failover
0 likes · 19 min read
Designing High‑Availability Internet Architecture: Redundancy and Automatic Failover
Programmer DD
Programmer DD
Oct 8, 2022 · Fundamentals

Eight Timeless Computer Architecture Principles Every Designer Should Know

This article outlines eight enduring ideas—from designing for Moore's Law and using abstraction to speeding up common cases, leveraging parallelism, pipelining, prediction, memory hierarchy, and redundancy—that have shaped computer architecture over the past six decades.

Moore's LawParallelismPipeline
0 likes · 11 min read
Eight Timeless Computer Architecture Principles Every Designer Should Know
Liangxu Linux
Liangxu Linux
Sep 22, 2022 · Operations

How to Choose the Right Server: Key Specs and Bandwidth Calculations

This article explains how to select a server by reviewing popular brands and detailing essential parameters such as bandwidth capacity, CPU characteristics, chipset architecture, memory requirements, storage options, network interfaces, redundancy features, and scalability considerations, including a practical bandwidth‑to‑online‑users calculation.

Bandwidth CalculationCPU specsNetwork Card
0 likes · 10 min read
How to Choose the Right Server: Key Specs and Bandwidth Calculations
Open Source Linux
Open Source Linux
Aug 15, 2022 · Operations

How to Choose the Right Server: Key Brands and Essential Specs Explained

This guide explains what a server is, lists the most common server brands, and walks through the crucial hardware parameters—bandwidth, CPU, chipset, memory, storage, network cards, redundancy, hot‑swap and scalability—helping you make an informed server purchase decision.

Bandwidth CalculationCPU specsScalability
0 likes · 9 min read
How to Choose the Right Server: Key Brands and Essential Specs Explained
Architect's Alchemy Furnace
Architect's Alchemy Furnace
May 10, 2022 · Operations

How to Build Truly High‑Availability Systems: Redundancy, Failover, and Layered Architecture

High availability (HA) is essential for distributed systems, requiring redundancy and automatic failover across each architectural layer—from client to proxy, gateway, business logic, cache, and storage—to minimize downtime, achieve desired “nines” of uptime, and prevent cascading failures such as service snowballing.

Distributed SystemsSystem Architecturefailover
0 likes · 14 min read
How to Build Truly High‑Availability Systems: Redundancy, Failover, and Layered Architecture
Architects' Tech Alliance
Architects' Tech Alliance
Dec 29, 2021 · Fundamentals

Core Switch vs. Regular Switch: Differences, Advantages, and Key Technologies

This article explains how core switches differ from ordinary switches in port count, network layer placement, performance features such as large cache, high capacity, virtualization, TRILL, FCOE, and how technologies like link aggregation, redundancy, stacking, and HSRP enhance data‑center reliability and scalability.

HSRPcore switchlink aggregation
0 likes · 10 min read
Core Switch vs. Regular Switch: Differences, Advantages, and Key Technologies
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 2, 2021 · Fundamentals

Why Data Loss Happens: Hidden CPU Silent Errors and How to Prevent Them

This article explains the concepts of data loss and corruption, outlines common bit‑flip sources in disks, memory, network and CPUs, describes how silent CPU data errors are discovered and verified, and presents multi‑layer design strategies—including redundancy, checksums, logging and recovery—to ensure data is neither lost nor corrupted.

CPU SDEStorage Reliabilitydata integrity
0 likes · 17 min read
Why Data Loss Happens: Hidden CPU Silent Errors and How to Prevent Them
Architects' Tech Alliance
Architects' Tech Alliance
Feb 8, 2021 · Fundamentals

Overview of UPS Technologies and Redundant Power System Architectures

This article explains the fundamentals of UPS systems, describing standby, line‑interactive, and double‑conversion topologies, their operational principles, economic mode based on ITIC curves, and various redundancy configurations such as parallel N+1, centralized and distributed bypass, and dual‑bus architectures for data‑center reliability.

Data centerElectrical EngineeringUPS
0 likes · 10 min read
Overview of UPS Technologies and Redundant Power System Architectures
FunTester
FunTester
Dec 12, 2020 · Operations

Why Redundancy Is the Key to Effective Disaster Recovery in IT Systems

The article explains that disaster recovery for information systems relies on redundancy across hardware, energy, and data, classifies natural, human, and technical disasters, defines critical metrics such as RTO and RPO, and outlines the technologies, architectures, and maturity levels needed to ensure business continuity.

RPORTObusiness continuity
0 likes · 29 min read
Why Redundancy Is the Key to Effective Disaster Recovery in IT Systems
IT Architects Alliance
IT Architects Alliance
Dec 2, 2020 · Operations

Understanding High Availability: Sources of Complexity and Decision Strategies

The article explains high availability as a source of system complexity, describing how redundancy, hardware and software failures, external disasters, and state‑decision mechanisms such as dictatorial, negotiated, and democratic approaches affect both compute and storage layers, and discusses trade‑offs like the CAP theorem.

CAP theoremDistributed SystemsSystem Design
0 likes · 12 min read
Understanding High Availability: Sources of Complexity and Decision Strategies
Architects' Tech Alliance
Architects' Tech Alliance
Jun 6, 2020 · Fundamentals

Core Switch vs. Regular Switch: Key Differences, Advantages, and Deployment Practices

The article explains what distinguishes core switches from ordinary switches, outlines their architectural roles, port and performance differences, and describes advanced features such as large buffers, high capacity, virtualization, TRILL, FCOE, link aggregation, redundancy, stacking, and HSRP for reliable data‑center networking.

HSRPcore switchlink aggregation
0 likes · 12 min read
Core Switch vs. Regular Switch: Key Differences, Advantages, and Deployment Practices
Efficient Ops
Efficient Ops
Mar 31, 2020 · Information Security

Can You Really Destroy Alipay’s Storage? Inside Financial Data Center Redundancy

This article explores the layered redundancy of financial data centers, explaining hot and cold backups, multi‑site architectures, power supply safeguards, fire‑suppression systems, and why simply attacking a single component is unlikely to cripple services like Alipay.

Backup StrategiesData Center Securityfinancial systems
0 likes · 9 min read
Can You Really Destroy Alipay’s Storage? Inside Financial Data Center Redundancy
Efficient Ops
Efficient Ops
Mar 18, 2020 · Fundamentals

Master RAID in One Minute: Quick Guide to Disk Array Types & Benefits

This concise guide explains RAID fundamentals, covering hardware vs. software implementations, various RAID levels, deployment methods, and the strengths and weaknesses of each configuration, all illustrated with clear diagrams for rapid comprehension.

RAIDSoftware RAIDdisk array
0 likes · 5 min read
Master RAID in One Minute: Quick Guide to Disk Array Types & Benefits
Programmer DD
Programmer DD
Dec 8, 2019 · Operations

Can Your Money Survive a Bombed Alipay Server? Inside Data Center Redundancy

The article explores how Alipay’s financial data is protected through multi‑site data centers, hot and cold backups, and disaster‑recovery mechanisms, explaining why destroying a single server—or even multiple facilities—won’t instantly erase users’ funds, and outlining the lengths required to truly cripple the system.

BackupData centerdisaster recovery
0 likes · 10 min read
Can Your Money Survive a Bombed Alipay Server? Inside Data Center Redundancy
Architecture Digest
Architecture Digest
Nov 16, 2019 · Operations

What Happens If Alipay’s Data Centers Are Physically Destroyed? A Deep Dive into Redundancy and Disaster Recovery

The article examines how Alipay’s financial data would survive a physical destruction of its servers by explaining multi‑site data center architectures, hot and cold backups, power redundancy, fire‑suppression systems, and the role of partner banks in data recovery, highlighting the extensive resilience measures in modern financial infrastructures.

AlipayData centerOperations
0 likes · 8 min read
What Happens If Alipay’s Data Centers Are Physically Destroyed? A Deep Dive into Redundancy and Disaster Recovery
21CTO
21CTO
Sep 26, 2017 · Operations

Why You Should Never Trust Any Component in Your System—and How to Protect It

In programming and operations, every element—from services and dependencies to requests, machines, data centers, power, networks, and humans—can fail unexpectedly, so you must assume distrust and implement defensive measures such as monitoring, redundancy, rate limiting, fallback strategies, backups, and automated deployment.

OperationsReliabilitySecurity
0 likes · 9 min read
Why You Should Never Trust Any Component in Your System—and How to Protect It
Qunar Tech Salon
Qunar Tech Salon
Jan 16, 2017 · Backend Development

Scalable Web Architecture and Distributed Systems

This article explains the key design principles, components, and techniques—such as availability, performance, reliability, scalability, cost, redundancy, partitioning, caching, proxies, indexing, load balancing, and queuing—required to build large‑scale, high‑performance, and fault‑tolerant web and distributed systems, illustrated with an image‑hosting example.

Web Performancecachingredundancy
0 likes · 37 min read
Scalable Web Architecture and Distributed Systems
Efficient Ops
Efficient Ops
Oct 8, 2016 · Operations

How to Boost Server Resource Utilization: Strategies, Trade‑offs, and Metrics

This article explains why servers often run far below their theoretical capacity, defines the concept of highest usable resource utilization, and offers practical and advanced techniques—such as multithreading, workload consolidation, resource layering, and overselling—to improve utilization while weighing performance, cost, and reliability impacts.

OperationsPerformance OptimizationResource Efficiency
0 likes · 9 min read
How to Boost Server Resource Utilization: Strategies, Trade‑offs, and Metrics
21CTO
21CTO
Sep 12, 2015 · Fundamentals

Unlocking RAID: How Different Levels Balance Speed, Redundancy, and Cost

This article provides a comprehensive overview of RAID technology, explaining its purpose, the various standard levels from RAID 0 to RAID 6, hybrid configurations, non‑standard implementations like DRFS, and both software and firmware/driver based deployment methods.

FilesystemRAIDperformance
0 likes · 13 min read
Unlocking RAID: How Different Levels Balance Speed, Redundancy, and Cost