Tagged articles
3 articles
Page 1 of 1
DataFunTalk
DataFunTalk
Jul 8, 2021 · Big Data

Design and Evolution of ByteDance's Multi‑Datacenter HDFS Architecture

This article explains how ByteDance extended the Apache HDFS architecture with a multi‑datacenter design, introducing components such as DanceNN, NNProxy, and BookKeeper to achieve scalable storage, cross‑datacenter data placement, and rack‑level disaster recovery for petabyte‑scale workloads.

ByteDanceHDFSbig data storage
0 likes · 13 min read
Design and Evolution of ByteDance's Multi‑Datacenter HDFS Architecture
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Apr 8, 2018 · Operations

How DPRD Improves Data Placement Over CRUSH in Distributed Storage

This article introduces the DPRD hierarchical data placement strategy, explains its roots in the CRUSH algorithm, highlights CRUSH's migration inefficiencies, and details how DPRD achieves near‑theoretical replica movement and balanced distribution during both expansion and shrinkage of a distributed storage system.

CRUSH algorithmCephDPRD strategy
0 likes · 11 min read
How DPRD Improves Data Placement Over CRUSH in Distributed Storage
21CTO
21CTO
Aug 24, 2015 · Databases

Mastering Distributed Consistency: Strategies Behind NoSQL Replication

This article systematically explores the distributed characteristics of NoSQL databases, covering consistency trade‑offs, replication techniques, anti‑entropy protocols, data placement strategies, failure detection, and coordinator election, while illustrating each concept with diagrams and practical examples.

Anti-entropyConsistencyDistributed Systems
0 likes · 32 min read
Mastering Distributed Consistency: Strategies Behind NoSQL Replication