Tag

availability

0 views collected around this technical thread.

Cognitive Technology Team
Cognitive Technology Team
Apr 3, 2025 · Fundamentals

Understanding CAP Theory and BASE: Data Consistency in Distributed Systems

This article explains the CAP theorem and its practical extension BASE, describing their core concepts, trade‑off combinations, typical components such as Zookeeper, Eureka, and Nacos, and engineering techniques like asynchronous replication, Saga, and idempotent design for building highly available distributed systems.

BASECAP theoremDistributed Systems
0 likes · 5 min read
Understanding CAP Theory and BASE: Data Consistency in Distributed Systems
Efficient Ops
Efficient Ops
Feb 6, 2025 · Operations

Inside Alipay’s Full‑Ecosystem Availability Monitoring: Architecture and Practices

At the 2024 GOPS Global Operations Conference in Shanghai, Alipay’s monitoring lead Tang Liang presented the challenges, architecture, risk‑prevention practices, and implementation details of the company’s full‑ecosystem availability monitoring system, highlighting its role in DevOps, SRE, and AIOps initiatives.

AIOpsDevOpsMonitoring
0 likes · 4 min read
Inside Alipay’s Full‑Ecosystem Availability Monitoring: Architecture and Practices
Architecture and Beyond
Architecture and Beyond
Feb 6, 2025 · Operations

Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC

This article examines DeepSeek’s frequent service interruptions, contrasts the inherent reliability challenges of AIGC products with traditional internet applications, and proposes adopting proven isolation, rate‑limiting, and elastic‑scaling techniques to improve AI service availability and user experience.

AIGCDeepSeekavailability
0 likes · 12 min read
Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jul 27, 2024 · Fundamentals

Understanding Distributed BASE: Basically Available, Soft State, and Eventual Consistency

This article explains the core concepts of the BASE theory—Basically Available, Soft State, and Eventual Consistency—illustrating how distributed systems trade off consistency for higher availability and performance, and provides practical examples and typical application scenarios.

BASEDistributed SystemsSoft State
0 likes · 15 min read
Understanding Distributed BASE: Basically Available, Soft State, and Eventual Consistency
Tencent Cloud Developer
Tencent Cloud Developer
Jul 17, 2024 · Operations

Combining FMEA and Chaos Engineering to Improve Software Architecture Availability

By integrating the proactive, static risk assessment of Failure Mode and Effects Analysis with the dynamic fault‑injection validation of chaos engineering, the article demonstrates how cloud‑native architectures—illustrated through a Tencent‑based e‑commerce case—can systematically identify, quantify, and mitigate availability risks, leading to continuous, measurable resilience improvements.

Chaos EngineeringFMEAavailability
0 likes · 16 min read
Combining FMEA and Chaos Engineering to Improve Software Architecture Availability
Efficient Ops
Efficient Ops
May 29, 2024 · Operations

Essential Operations Metrics Every IT Team Should Track

In today’s competitive business landscape, tracking key operations metrics—such as availability, failure rate, MTTR, MTBF, response time, throughput, error rate, and various utilization and data integrity measures—helps organizations monitor performance, reduce costs, ensure reliability, and maintain regulatory compliance.

IT performanceMonitoringavailability
0 likes · 7 min read
Essential Operations Metrics Every IT Team Should Track
Efficient Ops
Efficient Ops
May 12, 2024 · Operations

From Firefighting to Fire‑Starting: Mastering Operations for System Reliability

The article outlines a three‑stage evolution of operations—from rapid incident response to proactive fault‑injection—while offering practical guidance on improving availability, visualizing changes, and aligning technical metrics with business value to elevate the role of operations engineers.

SREavailabilityfault injection
0 likes · 7 min read
From Firefighting to Fire‑Starting: Mastering Operations for System Reliability
Efficient Ops
Efficient Ops
Feb 1, 2024 · Operations

How Tencent’s Public Gateway Overcomes Extreme Availability Challenges

The article details Tencent's Public Gateway (TGW) architecture, its forwarding and control planes, and presents two real‑world extreme failure cases— a NIC batch bug and a special IPv6 packet causing core dumps—along with the multi‑level disaster‑recovery design and mitigation strategies employed to ensure high availability.

Tencent Cloudavailabilitydisaster recovery
0 likes · 8 min read
How Tencent’s Public Gateway Overcomes Extreme Availability Challenges
Tencent Cloud Developer
Tencent Cloud Developer
Sep 13, 2023 · Cloud Native

Designing and Implementing a Payment Fund Account System

The article details how to design and implement a cloud‑native payment fund account system on Tencent Cloud, covering account definitions, fund flow and multiple account types, TDSQL storage, separated fund and account services, robust security, distributed transactions, auditing, reconciliation, and high‑availability measures for high‑concurrency merchant payments.

Distributed DatabasePayment SystemSecurity
0 likes · 35 min read
Designing and Implementing a Payment Fund Account System
政采云技术
政采云技术
Apr 27, 2023 · Backend Development

Understanding CAP Theorem, BASE Theory, and Their Implementation with Zookeeper (CP) and Eureka (AP)

This article explains the CAP theorem and its trade‑offs, introduces the BASE model as a practical compromise, and demonstrates how Zookeeper implements a CP registration center while Eureka adopts an AP approach, illustrating the impact on consistency, availability, and partition tolerance in distributed systems.

BASE theoryCAP theoremDistributed Systems
0 likes · 12 min read
Understanding CAP Theorem, BASE Theory, and Their Implementation with Zookeeper (CP) and Eureka (AP)
Continuous Delivery 2.0
Continuous Delivery 2.0
Mar 9, 2023 · Fundamentals

Ten Essential Software Architecture Quality Attributes

The article explains ten key non‑functional quality attributes of software architecture—such as scalability, availability, consistency, resilience, usability, observability, security, persistence, agility, and maintainability—describing their meanings, typical implementation techniques, and why selecting the right attributes is crucial for any system.

Securityavailabilitynon-functional requirements
0 likes · 9 min read
Ten Essential Software Architecture Quality Attributes
NetEase Yanxuan Technology Product Team
NetEase Yanxuan Technology Product Team
Nov 14, 2022 · Operations

Quantifying Internet Service Availability: Classic Metrics and the New User‑Uptime Indicator

The article reviews classic availability metrics such as Success‑Ratio, Incident‑Ratio, MTTR/MTTF, Error‑Budget, and SLA/SLO, then introduces User‑Uptime—a per‑user success time proportion that ignores long idle periods—and its windowed variant, showing how it complements existing indicators for more user‑centric reliability insight.

MonitoringSREavailability
0 likes · 27 min read
Quantifying Internet Service Availability: Classic Metrics and the New User‑Uptime Indicator
Architects Research Society
Architects Research Society
May 6, 2022 · Fundamentals

Understanding the CAP Theorem and How PACELC Extends It

The article explains the CAP theorem’s three properties—consistency, availability, and partition tolerance—its implications for distributed systems, highlights its limitations, introduces the PACELC extension that adds latency versus consistency trade‑offs when no partition exists, and provides real‑world database examples.

CAP theoremDistributed SystemsPACELC
0 likes · 7 min read
Understanding the CAP Theorem and How PACELC Extends It
Architect
Architect
Jan 13, 2022 · Databases

Understanding the CAP Theorem: Origins, Principles, and Implications for Distributed Databases

This article explains the origin of the CAP theorem, its formal definitions of consistency, availability, and partition tolerance, examines how it applies to relational and NoSQL databases, discusses trade‑offs and practical strategies for balancing these properties in distributed system design.

CAP theoremDistributed Systemsavailability
0 likes · 31 min read
Understanding the CAP Theorem: Origins, Principles, and Implications for Distributed Databases
DevOps Cloud Academy
DevOps Cloud Academy
Aug 24, 2021 · Cloud Computing

Key Considerations for Designing Cloud Applications: Scalability, Availability, Manageability, and Feasibility

The article outlines four essential cloud‑application design dimensions—scalability, availability, manageability, and feasibility—providing discussion points and questions for each to guide stakeholders toward robust, cost‑effective, and secure cloud solutions through comprehensive evaluation of capacity, platform constraints, load handling, SLA commitments, disaster recovery, performance tuning, and security considerations.

availabilitycloud computingfeasibility
0 likes · 12 min read
Key Considerations for Designing Cloud Applications: Scalability, Availability, Manageability, and Feasibility
Selected Java Interview Questions
Selected Java Interview Questions
Jun 6, 2021 · Fundamentals

Understanding the CAP Theorem: Consistency, Availability, and Partition Tolerance

The article explains the origins of the CAP theorem, defines its three core properties, illustrates why a distributed system can satisfy at most two of them, discusses practical workarounds such as eventual consistency, and provides references for further study.

CAP theoremDistributed Systemsavailability
0 likes · 5 min read
Understanding the CAP Theorem: Consistency, Availability, and Partition Tolerance
Wukong Talks Architecture
Wukong Talks Architecture
Dec 30, 2020 · Fundamentals

Understanding CAP, ACID, and BASE Theories Through the Metaphor of Tai Chi and Distributed Systems

This article uses the story of Tai Chi from the novel *The Heaven Sword and Dragon Saber* to explain the CAP theorem, ACID properties, BASE theory, and two‑phase commit in distributed systems, illustrating how consistency, availability, and partition tolerance correspond to the hard and soft aspects of Tai Chi.

ACIDBASECAP
0 likes · 14 min read
Understanding CAP, ACID, and BASE Theories Through the Metaphor of Tai Chi and Distributed Systems
Selected Java Interview Questions
Selected Java Interview Questions
Dec 28, 2020 · Backend Development

Eureka vs Zookeeper: AP vs CP Trade‑offs in Service Registry Design

The article compares Eureka and Zookeeper as service registry solutions, explaining how Eureka follows an AP model with high availability and eventual consistency, while Zookeeper adopts a CP model prioritizing strong consistency, and discusses their suitable scenarios, limitations, and design considerations for distributed systems.

CAP theoremDistributed SystemsEureka
0 likes · 10 min read
Eureka vs Zookeeper: AP vs CP Trade‑offs in Service Registry Design