Tagged articles
20 articles
Page 1 of 1
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jun 6, 2025 · Fundamentals

How Erasure Coding Cuts Storage Costs in Ozone: A Deep Dive

This article explains how Erasure Coding (EC) improves data reliability and dramatically reduces storage overhead in Ozone by leveraging hot‑cold data characteristics, intelligent tiering, dynamic EC ratios, and repair throttling, while also discussing performance trade‑offs and limitations.

Data ReliabilityOzoneStorage Optimization
0 likes · 9 min read
How Erasure Coding Cuts Storage Costs in Ozone: A Deep Dive
Architects' Tech Alliance
Architects' Tech Alliance
Nov 3, 2024 · Industry Insights

Exploring the Future of Storage Class Memory: Technologies, Challenges, and Research Directions

This article introduces a newly released comprehensive guide on SSD flash technology while providing an in‑depth analysis of emerging Storage Class Memory (SCM) technologies—such as PCM, ReRAM, MRAM, and NRAM—detailing their principles, current research challenges, and potential system‑level innovations.

Data ReliabilityHardware PrototypingMemory Technologies
0 likes · 18 min read
Exploring the Future of Storage Class Memory: Technologies, Challenges, and Research Directions
Didi Tech
Didi Tech
Aug 31, 2023 · Big Data

Data Stability Construction and Fault Governance Practices at Didi Customer Service

Didi’s multi‑year data‑stability program for its customer‑service platform progressed through fault‑centered engineering, business‑aligned cross‑team work, and capability normalization, instituting pre‑, mid‑ and post‑fault safeguards, clear ownership, automated alerts and repair tools, which cut fault count by 42 % and more than doubled mean‑time‑to‑repair while boosting team communication and satisfaction.

AutomationData ReliabilityData Warehouse
0 likes · 16 min read
Data Stability Construction and Fault Governance Practices at Didi Customer Service
Programmer DD
Programmer DD
Jun 7, 2023 · Cloud Native

Why Apache Pulsar Is the Next‑Gen Cloud‑Native Streaming Platform

This article explains how Apache Pulsar combines messaging, storage, and lightweight function computing into a cloud‑native streaming platform, detailing its architecture, storage‑compute separation, tiered storage, pluggable protocols, reliability guarantees, and rich ecosystem compared with traditional queues and Kafka.

Apache PulsarCloud NativeData Reliability
0 likes · 10 min read
Why Apache Pulsar Is the Next‑Gen Cloud‑Native Streaming Platform
vivo Internet Technology
vivo Internet Technology
Apr 5, 2023 · Databases

Understanding MySQL Replication: Principles, Mechanisms, and Practical Applications

MySQL replication copies data changes from a primary server to one or more replicas using binlog events—supporting statement, row, or mixed formats and GTID positioning—to provide real‑time backup, read‑write separation, high‑availability failover, and integration pipelines via asynchronous, semi‑synchronous, or centralized binlog services.

BinlogData ReliabilityGTID
0 likes · 30 min read
Understanding MySQL Replication: Principles, Mechanisms, and Practical Applications
Bilibili Tech
Bilibili Tech
Mar 14, 2023 · Big Data

Bilibili HDFS Erasure Coding Strategy and Implementation

Bilibili reduced petabyte‑scale storage costs by back‑porting erasure‑coding patches to its HDFS 2.8.4 cluster, deploying a parallel EC‑enabled cluster, adding a data‑proxy service, intelligent routing and block‑checking, and automating cold‑data migration, while noting write overhead and planning native acceleration.

Big DataData ReliabilityDistributed Systems
0 likes · 14 min read
Bilibili HDFS Erasure Coding Strategy and Implementation
Top Architect
Top Architect
Apr 23, 2022 · Big Data

Ensuring No Duplicate and No Loss in Baidu Log Middle Platform: Architecture, Challenges, and Solutions

This article explains the design, implementation, and future plans of Baidu's log middle platform, detailing its lifecycle management, service architecture, data reliability goals of eliminating duplication and loss, and the technical measures taken across SDKs, servers, and streaming pipelines to achieve near‑100% data integrity.

Backend ArchitectureBig DataData Reliability
0 likes · 15 min read
Ensuring No Duplicate and No Loss in Baidu Log Middle Platform: Architecture, Challenges, and Solutions
Architect
Architect
Apr 18, 2022 · Big Data

Ensuring Data Accuracy and Reliability in Baidu's Log Middle Platform

This article describes Baidu's log middle platform architecture, its data lifecycle management, integration status, terminology, service overview, core challenges of ensuring data accuracy, and the implemented optimizations for persistent storage, service decomposition, and SDK reporting to achieve near‑100% no‑repeat no‑loss reliability.

Backend ArchitectureBig DataData Reliability
0 likes · 15 min read
Ensuring Data Accuracy and Reliability in Baidu's Log Middle Platform
High Availability Architecture
High Availability Architecture
Apr 11, 2022 · Big Data

Ensuring Data Accuracy and Reliability in Baidu Log Platform: Architecture, Challenges, and Solutions

This article introduces the current state of Baidu's log platform, explains its lifecycle from data collection to downstream applications, analyzes the challenges of achieving near‑zero duplication and loss, and presents architectural optimizations and best‑practice recommendations to improve data stability and accuracy across the system.

Big DataData ReliabilitySystem Architecture
0 likes · 19 min read
Ensuring Data Accuracy and Reliability in Baidu Log Platform: Architecture, Challenges, and Solutions
JavaEdge
JavaEdge
Apr 6, 2022 · Big Data

When Does Kafka Lose Data? Proven Strategies to Prevent Message Loss

This article explains Kafka's message delivery semantics, identifies the exact scenarios where data can be lost in producer, broker, and consumer components, and provides concrete configuration and coding practices to ensure reliable, at‑least‑once or exactly‑once delivery in production environments.

BrokerConsumerData Reliability
0 likes · 18 min read
When Does Kafka Lose Data? Proven Strategies to Prevent Message Loss
vivo Internet Technology
vivo Internet Technology
Jul 28, 2021 · Industry Insights

How to Quantify Data Reliability in Distributed Storage Systems

This article analyzes the quantitative model for data reliability in distributed storage, covering factors such as disk count, replication factor, recovery time, annualized failure rate, and copyset configuration, and derives formulas to estimate yearly data loss probability for both replica and erasure‑coding schemes.

AFRData Reliabilitycopyset
0 likes · 16 min read
How to Quantify Data Reliability in Distributed Storage Systems
UCloud Tech
UCloud Tech
Apr 20, 2021 · Fundamentals

How UCloud Cuts Archive Storage Costs by 80% with SMR Disks and Smart IO Scheduling

UCloud’s US3 archive storage leverages high‑density SMR drives combined with JBOD architecture and a custom IO‑scheduling algorithm to slash hardware CAPEX by up to 80%, reduce OPEX electricity costs, and maintain data reliability through embedded metadata and dual‑head redundancy.

Cost OptimizationData ReliabilityIO scheduling
0 likes · 11 min read
How UCloud Cuts Archive Storage Costs by 80% with SMR Disks and Smart IO Scheduling
DataFunTalk
DataFunTalk
Dec 23, 2019 · Databases

Cassandra Deployment and Optimization at 360 Cloud Storage

This article details how 360 adopted Cassandra for its cloud drive, describing Cassandra’s decentralized architecture, the reasons for its selection over HBase, large‑scale deployment challenges, performance optimizations, reliability improvements, disk utilization techniques, and the evolution of the system from 2010 to present.

Big DataData ReliabilityScalability
0 likes · 15 min read
Cassandra Deployment and Optimization at 360 Cloud Storage
AntTech
AntTech
Aug 6, 2019 · Databases

How OceanBase Guarantees Data Reliability and Service High‑Availability

The article explains how OceanBase, a distributed enterprise‑grade database, achieves strong data reliability and rapid service recovery on ordinary PC servers by combining Paxos‑based consensus, enhanced redo‑log verification, periodic checkpoint checks, and fine‑grained fail‑over mechanisms, surpassing traditional hardware‑dependent databases.

Data ReliabilityOceanBasePaxos
0 likes · 17 min read
How OceanBase Guarantees Data Reliability and Service High‑Availability
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 30, 2019 · Big Data

How to Build a Systematic Data Quality Model for Big Data Testing

This article presents a comprehensive data quality model derived from ISO 9126, maps its characteristics to data testing, outlines practical testing methods and tool requirements, and demonstrates how to integrate quality checks into the data development lifecycle for reliable, efficient big‑data pipelines.

Data QualityData ReliabilityISO 9126
0 likes · 28 min read
How to Build a Systematic Data Quality Model for Big Data Testing
Big Data Technology & Architecture
Big Data Technology & Architecture
Jul 17, 2019 · Backend Development

Preventing Message Loss in RabbitMQ and Kafka: Principles, Scenarios, and Solutions

This article explains the core principles of message queues, outlines common data‑loss scenarios for RabbitMQ and Kafka, and provides practical techniques—such as transactions, confirm mode, persistence settings, and replication configurations—to ensure reliable, loss‑free messaging.

Backend DevelopmentData ReliabilityKafka
0 likes · 9 min read
Preventing Message Loss in RabbitMQ and Kafka: Principles, Scenarios, and Solutions
Tencent Cloud Developer
Tencent Cloud Developer
Jun 4, 2018 · Cloud Computing

In-depth Analysis and Practice of Tencent Cloud EB-level Object Storage Architecture

At the 2023 Tencent Cloud + Future summit, Liu Jinming detailed Tencent Cloud COS’s three‑tier EB‑level object storage architecture—covering network, application, and data layers—highlighting its 99.95% availability, 11‑nine durability, end‑to‑end encryption, scalable performance, tiered pricing, and real‑world media and security use cases.

Data ReliabilityPerformance OptimizationTencent Cloud
0 likes · 10 min read
In-depth Analysis and Practice of Tencent Cloud EB-level Object Storage Architecture
Tencent TDS Service
Tencent TDS Service
Apr 27, 2017 · Databases

Cutting WeChat SQLite Corruption in Half: Strategies and Lessons

Facing a 0.02% SQLite corruption rate that threatened years of chat history, the WeChat mobile team identified three main causes—insufficient space, power loss, and sync failures—and implemented space management, full sync settings, and master‑table backups, halving damage and doubling repair success.

BackupData ReliabilityDatabase Optimization
0 likes · 9 min read
Cutting WeChat SQLite Corruption in Half: Strategies and Lessons