Tagged articles
20 articles
Page 1 of 1
Shuge Unlimited
Shuge Unlimited
Apr 29, 2026 · Databases

Milvus Storage Tuning in Practice: 25× Query Speedup and Three Tricks to Cut Memory Usage by Half

This article walks through Milvus 2.3‑2.6.x storage optimizations—Mmap, tiered storage, and clustering compaction—explaining their principles, configuration hierarchy, benchmark results, and concrete deployment templates that together can boost query performance up to 25‑fold while halving memory consumption.

MilvusStorage Optimizationclustering compaction
0 likes · 24 min read
Milvus Storage Tuning in Practice: 25× Query Speedup and Three Tricks to Cut Memory Usage by Half
JD Cloud Developers
JD Cloud Developers
Jul 16, 2025 · Databases

How JD Ads Cut Storage Costs 87% with Apache Doris Hot‑Cold Tiering

This article details JD Advertising's journey from a 1 PB Apache Doris data lake to a multi‑level hot‑cold tiering architecture, describing two tiering strategies, the performance and schema‑change challenges faced during the upgrade to Doris 2.0, and the optimizations that reduced storage costs by about 87% while boosting query throughput.

Apache DorisSchema Changecold data
0 likes · 19 min read
How JD Ads Cut Storage Costs 87% with Apache Doris Hot‑Cold Tiering
JD Retail Technology
JD Retail Technology
Oct 29, 2024 · Big Data

JD Unified Storage Practice: Cross‑Region and Tiered Storage on HDFS

This article details JD's large‑scale HDFS unified storage implementation, covering cross‑region storage challenges, topology design, asynchronous block replication, flow‑control mechanisms, tiered storage strategies, automatic hot‑cold data migration, and the resulting performance and cost improvements for big‑data workloads.

Big DataCross-Region StorageData Management
0 likes · 20 min read
JD Unified Storage Practice: Cross‑Region and Tiered Storage on HDFS
DataFunSummit
DataFunSummit
Oct 4, 2024 · Big Data

JD Retail HDFS Unified Storage: Cross‑Region and Tiered Storage Practices

This article presents JD Retail's large‑scale HDFS deployment, detailing its unified storage architecture, cross‑region data replication challenges and solutions, tiered storage strategies for hot, warm and cold data, and the operational modules that together improve performance, reliability and cost efficiency in a big‑data environment.

Big DataCross-Region StorageDistributed File System
0 likes · 21 min read
JD Retail HDFS Unified Storage: Cross‑Region and Tiered Storage Practices
dbaplus Community
dbaplus Community
Jul 2, 2024 · Cloud Native

How Xiaohongshu Cut Kafka Storage Costs by 60% with a Cloud‑Native Tiered Architecture

Facing exploding Kafka scale, Xiaohongshu’s data‑storage team adopted a cloud‑native design that introduces tiered hot‑cold storage, containerization, and a custom load‑balancing service, achieving dramatic storage‑cost reductions, minute‑level cluster migrations, high‑performance data access, and automated resource scheduling.

autoscalingcloud-nativecontainerization
0 likes · 20 min read
How Xiaohongshu Cut Kafka Storage Costs by 60% with a Cloud‑Native Tiered Architecture
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
May 23, 2024 · Cloud Native

Cloud-Native Architecture and Tiered Storage for Xiaohongshu Kafka: Cost Reduction, Elastic Migration, and Performance Optimization

Xiaohongshu's big-data storage team built cloud-native architecture with tiered storage, containerized Kafka, and custom load balancer, cutting storage costs up to 60%, enabling minute‑level elastic migration, improving scaling efficiency tenfold, and boosting performance via caching and batch reads.

Cost OptimizationKafkaelastic scaling
0 likes · 20 min read
Cloud-Native Architecture and Tiered Storage for Xiaohongshu Kafka: Cost Reduction, Elastic Migration, and Performance Optimization
Tencent Cloud Middleware
Tencent Cloud Middleware
Mar 13, 2024 · Cloud Native

What’s New in RocketMQ 5.x? A Deep Dive into Cloud‑Native Features, Proxy, Pop, and Tiered Storage

This article explores Apache RocketMQ 5.x’s new cloud‑native capabilities—including the Proxy component, gRPC client, Pop consumption model, timer‑based delayed messages, tiered storage, distributed rate‑limiting, and containerized deployment on Tencent Cloud—while outlining architecture changes, practical usage patterns, and future directions.

RocketMQcontainerizationdistributed rate limiting
0 likes · 32 min read
What’s New in RocketMQ 5.x? A Deep Dive into Cloud‑Native Features, Proxy, Pop, and Tiered Storage
DataFunSummit
DataFunSummit
Feb 6, 2024 · Big Data

Exploring ByteDance's EB‑Scale HDFS: Architecture, Multi‑Datacenter Challenges, Tiered Storage, and Data Protection Practices

This article presents an in‑depth overview of ByteDance's EB‑scale HDFS, covering its new features, multi‑datacenter architecture, tiered storage implementation, data management services, capacity and fault‑tolerance strategies, as well as practical data‑protection mechanisms and related Q&A.

Big DataData ProtectionHDFS
0 likes · 22 min read
Exploring ByteDance's EB‑Scale HDFS: Architecture, Multi‑Datacenter Challenges, Tiered Storage, and Data Protection Practices
dbaplus Community
dbaplus Community
Dec 20, 2023 · Operations

Scaling Kafka to 1000+ Nodes: Governance, Auto‑Balancing & Tiered Storage

This article outlines how a large‑scale Kafka deployment of over a thousand machines across dozens of clusters was engineered for stability and efficiency through a custom Guardian controller that adds partition‑level throttling, automatic balancing, multi‑tenant isolation, cross‑IDC management, tiered storage, audit capabilities, and fully automated operational workflows.

Cluster ManagementKafkaOperations
0 likes · 21 min read
Scaling Kafka to 1000+ Nodes: Governance, Auto‑Balancing & Tiered Storage
Programmer DD
Programmer DD
Jun 7, 2023 · Cloud Native

Why Apache Pulsar Is the Next‑Gen Cloud‑Native Streaming Platform

This article explains how Apache Pulsar combines messaging, storage, and lightweight function computing into a cloud‑native streaming platform, detailing its architecture, storage‑compute separation, tiered storage, pluggable protocols, reliability guarantees, and rich ecosystem compared with traditional queues and Kafka.

Apache PulsarCloud NativeData Reliability
0 likes · 10 min read
Why Apache Pulsar Is the Next‑Gen Cloud‑Native Streaming Platform
Wukong Talks Architecture
Wukong Talks Architecture
Apr 15, 2023 · Backend Development

Design and Implementation of RocketMQ Tiered Storage

The article explains how RocketMQ 5.1.0 introduces a tiered storage module that offloads messages to cheaper media, describes its design, architecture layers, quick‑start configuration, upload and read mechanisms, prefetch cache, fault recovery, current development plans, and remaining challenges.

Message QueueRocketMQtiered storage
0 likes · 13 min read
Design and Implementation of RocketMQ Tiered Storage
DataFunTalk
DataFunTalk
Jun 5, 2022 · Big Data

JD Big Data Platform: Cross‑Region and Tiered Storage Architecture and Practices

This article presents JD's large‑scale big‑data platform, detailing its overall architecture, the challenges of cross‑region storage, the design of a unified cross‑domain data synchronization mechanism, and the implementation of tiered storage to improve performance, cost efficiency, and data reliability across multi‑datacenter clusters.

Big DataData PlatformHDFS
0 likes · 15 min read
JD Big Data Platform: Cross‑Region and Tiered Storage Architecture and Practices
dbaplus Community
dbaplus Community
Mar 17, 2021 · Big Data

How We Cut PBs of Waste and Optimized HDFS with Tiered Storage and Cloud Migration

This article details a three‑part technical sharing that covers cost governance for offline Hadoop clusters, a large‑scale data‑center migration with architecture upgrades, and a tiered storage strategy using EC and COS to reduce storage costs and improve performance in a cloud‑native big‑data environment.

Big Data MigrationCOSCloud Native
0 likes · 10 min read
How We Cut PBs of Waste and Optimized HDFS with Tiered Storage and Cloud Migration
ITPUB
ITPUB
Mar 28, 2019 · Big Data

Why Pravega Matters: Native Stream Storage for Low‑Latency, Exactly‑Once Data Pipelines

Pravega, Dell’s native stream storage project, addresses the challenges of modern low‑latency, exactly‑once stream processing by combining tiered storage, Apache BookKeeper, and seamless Flink integration, offering a unified solution that reduces development, storage, and operational costs compared to traditional message systems like Kafka.

Apache FlinkExactly-OnceKafka Comparison
0 likes · 10 min read
Why Pravega Matters: Native Stream Storage for Low‑Latency, Exactly‑Once Data Pipelines
Architecture Digest
Architecture Digest
Mar 31, 2016 · Operations

Why Hyper‑Converged Architecture Improves I/O Performance: From Traditional SAN to Tiered Storage

The article explains how traditional SAN storage creates CPU‑I/O bottlenecks, how Google’s distributed file system inspired hyper‑converged designs that fuse compute and storage, and why tiered storage with SSD and HDD offers scalable, high‑performance infrastructure for modern data‑center workloads.

Data centerDistributed File SystemI/O performance
0 likes · 11 min read
Why Hyper‑Converged Architecture Improves I/O Performance: From Traditional SAN to Tiered Storage
MaGe Linux Operations
MaGe Linux Operations
Apr 7, 2015 · Big Data

How Hadoop’s Tiered Storage Optimizes Data Based on Temperature

This article explains Hadoop’s tiered storage concept, describing how data is classified by temperature—hot, warm, cold, frozen—and automatically moved across disk and archive layers to optimize cost and performance, with examples from Hadoop versions and eBay’s large‑scale deployment.

Big DataData TemperatureHDFS
0 likes · 9 min read
How Hadoop’s Tiered Storage Optimizes Data Based on Temperature