Tag

data placement

0 views collected around this technical thread.

DataFunTalk
DataFunTalk
Jul 8, 2021 · Big Data

Design and Evolution of ByteDance's Multi‑Datacenter HDFS Architecture

This article explains how ByteDance extended the Apache HDFS architecture with a multi‑datacenter design, introducing components such as DanceNN, NNProxy, and BookKeeper to achieve scalable storage, cross‑datacenter data placement, and rack‑level disaster recovery for petabyte‑scale workloads.

ByteDanceHDFSbig data storage
0 likes · 13 min read
Design and Evolution of ByteDance's Multi‑Datacenter HDFS Architecture
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Apr 8, 2018 · Operations

How DPRD Improves Data Placement Over CRUSH in Distributed Storage

This article introduces the DPRD hierarchical data placement strategy, explains its roots in the CRUSH algorithm, highlights CRUSH's migration inefficiencies, and details how DPRD achieves near‑theoretical replica movement and balanced distribution during both expansion and shrinkage of a distributed storage system.

CRUSH algorithmCephDPRD strategy
0 likes · 11 min read
How DPRD Improves Data Placement Over CRUSH in Distributed Storage