Tagged articles
12 articles
Page 1 of 1
JD Cloud Developers
JD Cloud Developers
Jul 16, 2025 · Databases

How JD Ads Cut Storage Costs 87% with Apache Doris Hot‑Cold Tiering

This article details JD Advertising's journey from a 1 PB Apache Doris data lake to a multi‑level hot‑cold tiering architecture, describing two tiering strategies, the performance and schema‑change challenges faced during the upgrade to Doris 2.0, and the optimizations that reduced storage costs by about 87% while boosting query throughput.

Apache DorisSchema Changecold data
0 likes · 19 min read
How JD Ads Cut Storage Costs 87% with Apache Doris Hot‑Cold Tiering
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Sep 25, 2024 · Big Data

How Cold‑Hot Data Separation Boosts Cost Efficiency in Baidu Palo for Apache Doris

This article explains the principles, configuration steps, monitoring metrics, leader selection, data migration granularity, compaction, invalid data cleanup, and cache mechanisms of cold‑hot data separation in Baidu Intelligent Cloud's Palo for Apache Doris, illustrating how tiered storage reduces costs while maintaining query performance.

Apache DorisData TieringPalo
0 likes · 21 min read
How Cold‑Hot Data Separation Boosts Cost Efficiency in Baidu Palo for Apache Doris
DataFunTalk
DataFunTalk
Sep 22, 2023 · Big Data

Design and Practice of Baidu's Tape Library Storage Architecture Based on the Aries Cloud Storage System

This article presents a comprehensive overview of Baidu Intelligent Cloud's tape‑library solution, detailing tape and tape‑library fundamentals, the Aries cloud storage stack, data and access models, the end‑to‑end data flow, key architectural design choices, implementation details, and a real‑world case study demonstrating large‑scale cold‑data storage, backup, and retrieval performance.

ariescold datadata archiving
0 likes · 28 min read
Design and Practice of Baidu's Tape Library Storage Architecture Based on the Aries Cloud Storage System
Code Ape Tech Column
Code Ape Tech Column
Sep 7, 2023 · Databases

Hot and Cold Data Separation: Concepts, Scenarios, and Implementation Methods

The article explains the principle of hot‑cold data separation, when it should be applied, how to distinguish hot versus cold data, and three practical implementation approaches—code modification, binlog listening, and scheduled scanning—to improve database performance and maintain consistency.

Backend ArchitectureData Lifecyclecold data
0 likes · 9 min read
Hot and Cold Data Separation: Concepts, Scenarios, and Implementation Methods
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Sep 4, 2023 · Big Data

How Baidu’s Aries Cloud Storage Leverages Tape Libraries for Massive Cold Data Archiving

This article explains Baidu Intelligent Cloud’s tape‑library based cold‑data storage architecture, covering tape media basics, the Aries cloud storage system, its modular design, data flow, write and retrieval processes, and a real‑world deployment case that demonstrates cost‑effective petabyte‑scale archival.

ariescloud storagecold data
0 likes · 31 min read
How Baidu’s Aries Cloud Storage Leverages Tape Libraries for Massive Cold Data Archiving
Tencent Cloud Developer
Tencent Cloud Developer
Aug 31, 2023 · Cloud Computing

Tape Storage Technology: Enterprise Deep Archive and the Berg Cold‑Data Engine

Magnetic tape, once the music‑distribution workhorse, remains essential for enterprise deep‑archive thanks to its low cost, high capacity, and durability, with LTO and IBM 3592 cartridges housed in large libraries, while cloud object‑storage deep‑archive tiers and Tencent’s Berg cold‑data engine provide API‑driven ingestion, retrieval, erasure‑coding, and fault‑tolerant management for truly cold workloads that tolerate hours‑long latency.

Berg enginecloud storagecold data
0 likes · 27 min read
Tape Storage Technology: Enterprise Deep Archive and the Berg Cold‑Data Engine
Code Ape Tech Column
Code Ape Tech Column
Jan 24, 2022 · Databases

Hot and Cold Data Separation: Concepts, Scenarios, and Implementation Methods

The article explains the hot‑cold data separation pattern, describing its purpose, when to use it, how to distinguish hot versus cold data, and three practical implementation approaches—code modification, binlog listening, and scheduled scanning—to improve performance and maintain data consistency in large‑scale systems.

Data Lifecyclecold datadatabase partitioning
0 likes · 10 min read
Hot and Cold Data Separation: Concepts, Scenarios, and Implementation Methods
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 12, 2022 · Databases

How to Slash Cloud Data Warehouse Costs with ADB PG Disk Optimization

This article explains how enterprises can dramatically reduce cloud‑native data‑warehouse expenses by understanding ADB PG/Greenplum architecture, applying disk‑reservation and lock‑write safeguards, and implementing practical optimizations such as table compression, hot‑cold tiering, vacuuming, redundant‑index cleanup, replication conversion, and isolated temporary‑table spaces.

ADB PGCost reductionGreenplum
0 likes · 25 min read
How to Slash Cloud Data Warehouse Costs with ADB PG Disk Optimization
Big Data Technology & Architecture
Big Data Technology & Architecture
Jun 16, 2020 · Big Data

Hot and Cold Data Separation in Big Data Systems

The article explains the concept of hot and cold data, why separating them reduces cost, and presents heterogeneous and homogeneous architectural solutions—including Elasticsearch, HBase, AWS S3, and cloud‑based UltraWarm—illustrated with network‑behavior and e‑commerce order system case studies.

AWS S3Big Data ArchitectureData Lifecycle
0 likes · 11 min read
Hot and Cold Data Separation in Big Data Systems