Tagged articles
93 articles
Page 1 of 1
JD Retail Technology
JD Retail Technology
Jan 13, 2026 · Backend Development

Deep Dive into Kafka, RocketMQ, and JMQ Storage Architectures

This article compares the storage models, data organization, indexing, read/write processes, and performance trade‑offs of three major message queues—Kafka, RocketMQ, and JMQ—providing detailed technical insights for architects and engineers making storage‑related design decisions.

Backend EngineeringJMQKafka
0 likes · 16 min read
Deep Dive into Kafka, RocketMQ, and JMQ Storage Architectures
Architects' Tech Alliance
Architects' Tech Alliance
Nov 30, 2025 · Cloud Computing

How DPU Redefines Data Center Storage for AI and Cloud Workloads

This article analyzes the technical principles, architectural innovations, and real‑world scenarios of Data Processing Units (DPUs), showing how they resolve storage‑CPU mismatches, eliminate excessive east‑west traffic, and accelerate failure recovery, thereby becoming a core infrastructure for AI and cloud computing.

AIDPUData center
0 likes · 15 min read
How DPU Redefines Data Center Storage for AI and Cloud Workloads
DataFunSummit
DataFunSummit
Oct 29, 2025 · Big Data

How Huolala Scaled to 40PB: Inside Their Evolving Big Data Storage Architecture

Huolala, founded in 2013, runs a massive cross‑cloud hybrid big‑data storage platform of over 40 PB across 3,000+ machines, evolving through four online‑storage phases, robust HA design, performance‑cost optimizations, AI vector storage, and a cost‑governance system that saved more than half of its storage expenses.

AI vector storageBig DataCost Optimization
0 likes · 18 min read
How Huolala Scaled to 40PB: Inside Their Evolving Big Data Storage Architecture
Architect's Guide
Architect's Guide
Sep 14, 2025 · Databases

Why Database Indexes Speed Up Queries: From Storage Basics to Binary Search

This article explains how databases store data on various storage devices, why indexes dramatically improve query performance through sorted structures and binary search, and outlines practical SQL optimization techniques while warning about the trade‑offs of excessive indexing.

Binary SearchClustered IndexDatabase Index
0 likes · 11 min read
Why Database Indexes Speed Up Queries: From Storage Basics to Binary Search
Efficient Ops
Efficient Ops
Sep 9, 2025 · Fundamentals

Inside 3FS: How Distributed File Systems Hide Complexity and Scale

3FS is an open‑source distributed file system that abstracts multiple machines into a single namespace, offering massive scalability, fault tolerance, and high throughput through components like Meta, Mgmtd, Storage, and Client, and leveraging the CRAQ protocol for strong consistency and efficient reads and writes.

3FSCRAQDistributed File System
0 likes · 12 min read
Inside 3FS: How Distributed File Systems Hide Complexity and Scale
Architecture & Thinking
Architecture & Thinking
Aug 26, 2025 · Backend Development

How RocketMQ Achieves Million‑TPS with Sequential Writes and Multi‑Level Indexes

RocketMQ tackles the high‑performance, high‑reliability challenges of distributed messaging by combining sequential disk writes, memory caching, and multi‑level indexing, detailing its storage logic, core structures, zero‑copy techniques, replication modes, static topic scaling, and practical tuning guidelines for optimal throughput.

Message QueueRocketMQstorage architecture
0 likes · 8 min read
How RocketMQ Achieves Million‑TPS with Sequential Writes and Multi‑Level Indexes
Tencent Cloud Middleware
Tencent Cloud Middleware
Jul 11, 2025 · Backend Development

Inside RabbitMQ Classic Queue: How Its Storage Architecture Impacts Performance

This article provides a detailed, step‑by‑step analysis of RabbitMQ's classic queue storage architecture, covering directory layout, index and data file formats, write and read workflows, file compaction, and practical operational tips such as publisher confirms and manual acknowledgments to improve reliability and throughput.

Classic QueueRabbitMQmessage broker
0 likes · 13 min read
Inside RabbitMQ Classic Queue: How Its Storage Architecture Impacts Performance

Why Data Replication Matters: Architectures, Formats, and Consistency Models

This article explores the principles of data replication, comparing shared memory, shared disk, and non‑shared storage architectures, detailing replication formats, consistency challenges, and various replication strategies such as synchronous, asynchronous, semi‑synchronous, and majority‑based approaches, helping engineers choose the right trade‑offs.

Asynchronous ReplicationConsistencydata replication
0 likes · 12 min read
Why Data Replication Matters: Architectures, Formats, and Consistency Models
Liangxu Linux
Liangxu Linux
May 22, 2025 · Cloud Computing

Master Ceph: Step‑by‑Step Guide to Deploy a Scalable Distributed Storage Cluster

Learn how to design, configure, and deploy a Ceph distributed storage cluster using ceph‑deploy, covering storage fundamentals, Ceph architecture, component roles, network planning, OS preparation, mon, mgr, osd setup, and dashboard activation, with detailed commands and best‑practice recommendations for production environments.

CephDashboarddistributed storage
0 likes · 28 min read
Master Ceph: Step‑by‑Step Guide to Deploy a Scalable Distributed Storage Cluster
Architects' Tech Alliance
Architects' Tech Alliance
Apr 26, 2025 · Industry Insights

Why Distributed Storage Is the Next Backbone of the Digital Economy

This article analyzes the evolution of distributed storage—from traditional compute‑storage separation to edge‑centric, AI‑enabled architectures—covering service models, key technologies such as CXL and erasure coding, reliability strategies, performance optimizations, vendor landscapes, and emerging green and intelligent trends.

AI storageCXLNVMe-oF
0 likes · 13 min read
Why Distributed Storage Is the Next Backbone of the Digital Economy
Linux Cloud Computing Practice
Linux Cloud Computing Practice
Apr 10, 2025 · Cloud Computing

Unlock Scalable, Reliable Storage: A Complete Guide to Deploying Ceph

This article provides a comprehensive overview of Ceph distributed storage, covering storage fundamentals, Ceph architecture, advantages, version lifecycle, and step‑by‑step deployment using ceph‑deploy, including environment preparation, monitor and OSD setup, manager configuration, and dashboard activation.

CephCluster ManagementDashboard
0 likes · 28 min read
Unlock Scalable, Reliable Storage: A Complete Guide to Deploying Ceph
Architects' Tech Alliance
Architects' Tech Alliance
Mar 25, 2025 · Industry Insights

How Near‑Memory Computing Can Power Edge LLMs: A 2025 Storage Framework

The article analyzes the challenges of deploying large language models on cloud servers—such as latency, security, and constant connectivity—and explains how near‑memory computing architectures (PNM, PIM, CIM) can integrate storage and processing to enable efficient, high‑performance edge AI deployments, outlining the trade‑offs of each approach.

Near-Memory Computingartificial intelligenceedge AI
0 likes · 5 min read
How Near‑Memory Computing Can Power Edge LLMs: A 2025 Storage Framework
dbaplus Community
dbaplus Community
Jan 21, 2025 · Databases

How Bilibili Scaled Its Comment System with Multi‑Level Storage and Automatic Failover

Bilibili’s comment service, a critical component for user interaction, faces massive read‑write traffic that can overwhelm TiDB, so the team built a multi‑level storage architecture using Redis sorted‑sets for indexes and a custom Taishan KV store, adding automatic degradation, consistency mechanisms, and hedging policies to ensure high availability and performance.

Comment SystemData Consistencyfailover
0 likes · 12 min read
How Bilibili Scaled Its Comment System with Multi‑Level Storage and Automatic Failover
High Availability Architecture
High Availability Architecture
Jan 17, 2025 · Databases

Design and Implementation of a Multi‑Level Comment Storage System for Bilibili

This article presents a comprehensive design of Bilibili's comment service architecture, detailing the transition from TiDB to a multi‑level storage system based on Taishan KV, the data models, consistency mechanisms, retry and versioning strategies, and a hedging‑based degradation policy to ensure high availability under heavy traffic.

Comment SystemTaishan KVTiDB
0 likes · 13 min read
Design and Implementation of a Multi‑Level Comment Storage System for Bilibili
Bilibili Tech
Bilibili Tech
Dec 13, 2024 · Databases

Design and Implementation of a Multi-Level Storage Architecture for Bilibili Comment Service

The paper proposes a multi‑level storage architecture for Bilibili’s comment service that replaces TiDB with a custom KV store (Taishan) and Redis caching, introduces unstructured indexes, CAS‑based consistency, real‑time and offline reconciliation, and a hedged degradation strategy to boost reliability, read throughput, and scalability during traffic spikes.

Comment SystemData ConsistencyNoSQL
0 likes · 13 min read
Design and Implementation of a Multi-Level Storage Architecture for Bilibili Comment Service
Baidu Geek Talk
Baidu Geek Talk
Nov 6, 2024 · Cloud Computing

Baidu Canghai Storage Unified Technology Base: Architecture and Evolution of Metadata, Namespace, and Data Layers

Baidu’s Canghai Storage unifies metadata, hierarchical namespace, and data layers into a Meta‑Aware, three‑generation architecture that scales to trillions of metadata items and zettabyte‑scale data, using a distributed transactional KV store, single‑machine‑distributed namespace, and online erasure‑coding micro‑services to deliver high performance, low cost, and seamless scalability.

Big DataDistributed SystemsNewSQL
0 likes · 18 min read
Baidu Canghai Storage Unified Technology Base: Architecture and Evolution of Metadata, Namespace, and Data Layers
Architects' Tech Alliance
Architects' Tech Alliance
Sep 15, 2024 · Industry Insights

How to Build a Super‑Scale AI Cluster: From GPU Power to DPU‑Driven Architecture

This article analyzes the technical roadmap for upgrading AI super‑large GPU clusters to support trillion‑parameter multimodal models, covering single‑chip performance, super‑node scaling, DPU‑based compute fusion, energy‑efficient designs, converged storage, high‑throughput networking, and fault‑tolerant checkpoint strategies.

AI computeDPUGPU clusters
0 likes · 18 min read
How to Build a Super‑Scale AI Cluster: From GPU Power to DPU‑Driven Architecture
DaTaobao Tech
DaTaobao Tech
Aug 9, 2024 · Cloud Native

Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow

RocketMQ stores messages in a sequential CommitLog while using a ConsumerQueue index for fast random access, and employs two looping threads—RebalanceService for load‑balancing queue assignments and PullMessageService for continuously pulling messages and dispatching them to user callbacks via a thread‑pooled consumer pipeline.

ConsumerDistributed SystemsJava
0 likes · 12 min read
Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow
Architect
Architect
Aug 3, 2024 · Big Data

Kafka Storage Architecture Design: Deep Analysis and Implementation

This article thoroughly examines Kafka's storage architecture, analyzing its design motivations, storage mechanisms, log formats, partitioning, indexing, cleanup strategies, and performance optimizations such as sequential log writes, sparse indexing, page cache, and zero‑copy, providing insights applicable to other storage systems.

KafkaLog SegmentsSparse Index
0 likes · 22 min read
Kafka Storage Architecture Design: Deep Analysis and Implementation
Architect
Architect
May 30, 2024 · Backend Development

Inside Pulsar’s Bookie: A Deep Dive into Storage Architecture and Write/Read Paths

This article dissects Pulsar’s storage layer by examining the Bookie component, detailing its architecture, the sequential write‑ahead log, ledger management, journal handling, caching mechanisms, and the complete read/write call chains with concrete code examples and performance considerations.

Distributed SystemsJavaLedger
0 likes · 33 min read
Inside Pulsar’s Bookie: A Deep Dive into Storage Architecture and Write/Read Paths
Sanyou's Java Diary
Sanyou's Java Diary
Feb 1, 2024 · Backend Development

Message Queues Unveiled: From Decoupling to Platformization and Core Architectures

This article traces the two‑decade evolution of message queues—from early decoupling solutions like ActiveMQ, through high‑throughput designs such as Kafka, to modern platformized systems like RocketMQ and Pulsar—while explaining fundamental concepts, partitioning, and storage architectures that underpin today’s distributed messaging platforms.

Distributed SystemsKafkaMessage Queue
0 likes · 17 min read
Message Queues Unveiled: From Decoupling to Platformization and Core Architectures
Baidu Geek Talk
Baidu Geek Talk
Jan 31, 2024 · Cloud Computing

Cloud Computing in the AI-Native Era: Baidu Cloud's AI-Native Architecture and Latest Product Innovations

In his 2023 keynote, Baidu Vice President Xie Guangjun unveiled an AI‑native cloud architecture featuring 7th‑gen G7 servers, Kunlun R300 and Ascend 910B instances, a gateway with X86, programmable switches and FPGA, unified storage (TafDB, Aries, BOS, CDS, PFS), Baige 3.0 AI infrastructure, an intelligent computing network, GaiaDB 4.0, BMR Spark 3.2, SugarBot natural‑language analytics, distributed edge and private cloud, and video and low‑code platforms.

Baidu CloudDatabase InnovationIntelligent Computing
0 likes · 19 min read
Cloud Computing in the AI-Native Era: Baidu Cloud's AI-Native Architecture and Latest Product Innovations
DataFunTalk
DataFunTalk
Aug 30, 2023 · Big Data

Design and Implementation of Baidu Cloud Block Storage EC System for Large‑Scale Data

This article presents Baidu Cloud's block storage architecture, comparing replication and erasure‑coding fault‑tolerance methods, detailing the challenges of applying EC to mutable block data, and describing a two‑layer append‑engine solution with selective 3‑replica caching, cost‑benefit compaction, and performance optimizations for low‑cost, high‑throughput storage.

Big Dataappend engineblock storage
0 likes · 14 min read
Design and Implementation of Baidu Cloud Block Storage EC System for Large‑Scale Data
DataFunTalk
DataFunTalk
May 21, 2023 · Databases

Graph Database Storage Techniques and Practices with Galaxybase

This article introduces RDF and property graph models, explains the core goals of graph database storage, compares mainstream storage solutions such as array, linked‑list and LSM‑Tree approaches, and presents practical deployment experiences of the Galaxybase distributed graph database.

Distributed SystemsGalaxybaseProperty Graph
0 likes · 23 min read
Graph Database Storage Techniques and Practices with Galaxybase
ITPUB
ITPUB
Oct 26, 2022 · Big Data

Why Kafka Stores Data the Way It Does: Inside Its Architecture

This article provides an in‑depth technical analysis of Kafka’s storage architecture, covering its design goals, storage mechanisms, log segment layout, sparse indexing, log cleanup policies, and the performance techniques such as sequential writes, page cache, and zero‑copy that enable high‑throughput streaming.

Big DataLog SegmentsSparse Index
0 likes · 22 min read
Why Kafka Stores Data the Way It Does: Inside Its Architecture
Architects' Tech Alliance
Architects' Tech Alliance
Oct 15, 2022 · Fundamentals

Key Technologies of Enterprise Flash Storage Systems: Global Load Balancing, Multi‑Stream Partitioning, and End‑to‑End I/O Prioritization

The article explains core flash storage techniques—including global load balancing, multi‑stream partitioning, and end‑to‑end I/O priority—by describing Active‑Active architectures, hash‑based request distribution, cache mirroring, data classification, and latency management to improve performance, reliability, and SSD lifespan.

I/O PriorityMulti‑Stream PartitionSSD
0 likes · 9 min read
Key Technologies of Enterprise Flash Storage Systems: Global Load Balancing, Multi‑Stream Partitioning, and End‑to‑End I/O Prioritization
Architects' Tech Alliance
Architects' Tech Alliance
Oct 10, 2022 · Fundamentals

All‑Flash Storage System Architecture and Key Functions (Dorado Flash Product Example)

The article explains the fully interconnected architecture of an all‑flash storage system, covering redundant FRU modules, RDMA‑based high‑speed networking, intelligent disk enclosures, SSD structure, wear‑leveling, bad‑block management, data redundancy, and the differences between SAS and NVMe protocols.

All-Flash StorageNVMeRDMA
0 likes · 12 min read
All‑Flash Storage System Architecture and Key Functions (Dorado Flash Product Example)
DataFunTalk
DataFunTalk
Aug 9, 2022 · Databases

Graph Database Storage Technologies and Practices: Concepts, Core Goals, Technical Solutions, and Galaxybase Case Study

This article introduces graph database fundamentals, explains why graph databases are needed, outlines core storage goals such as index‑free adjacency, compares array, linked‑list and LSM‑tree storage schemes, and presents the design, performance advantages, and real‑world applications of the Galaxybase distributed graph database.

Big DataDistributed SystemsGalaxybase
0 likes · 20 min read
Graph Database Storage Technologies and Practices: Concepts, Core Goals, Technical Solutions, and Galaxybase Case Study
Baidu Geek Talk
Baidu Geek Talk
Jul 26, 2022 · Industry Insights

How Baidu’s Canghai Storage Powers High‑Performance Computing: Challenges and Solutions

This article analyzes the storage challenges of high‑performance computing—including traditional HPC, AI‑driven HPC, and high‑performance data analysis—examines Baidu’s internal practices, and presents the Canghai storage platform with its object storage, parallel file system (PFS) and RapidFS solutions that address throughput, latency, and scalability requirements.

AI trainingHigh‑performance computingcloud storage
0 likes · 31 min read
How Baidu’s Canghai Storage Powers High‑Performance Computing: Challenges and Solutions
vivo Internet Technology
vivo Internet Technology
Apr 20, 2022 · Backend Development

FastDFS Overview: Principles, Architecture, Upload/Download Process, Synchronization, and Storage Management

FastDFS is a lightweight, open‑source distributed file system written in C that uses a three‑component architecture—client, tracker server for load‑balancing and discovery, and storage servers with push‑based binlog replication—to handle high‑concurrency upload/download of small to medium files, support group‑wide synchronization, optional trunk storage, Nginx anti‑leech integration, and extensible deduplication via FastDHT.

Distributed File SystemFastDFSSynchronization
0 likes · 15 min read
FastDFS Overview: Principles, Architecture, Upload/Download Process, Synchronization, and Storage Management
Su San Talks Tech
Su San Talks Tech
Nov 4, 2021 · Backend Development

How RocketMQ’s Storage Architecture Powers High Throughput and Low Latency

This article explains RocketMQ’s disk‑based storage design—including CommitLog, ConsumeQueue, and Index files—detailing sequential writes, memory‑mapped I/O, flexible flush strategies, and memory‑level read/write separation that together enable unlimited backlog, high throughput, and low latency messaging.

Memory Mapped FilesRocketMQSequential Write
0 likes · 13 min read
How RocketMQ’s Storage Architecture Powers High Throughput and Low Latency
IT Architects Alliance
IT Architects Alliance
Sep 21, 2021 · Fundamentals

Analyzing Android 6.0 Storage Architecture and Communication Design

This article examines Android 6.0’s storage architecture, detailing the roles and interactions of MountService, Vold, and the kernel, illustrating module and process structures, communication flows via sockets, and highlighting performance considerations such as thread blocking and ANR issues.

AndroidMountServiceProcess Communication
0 likes · 14 min read
Analyzing Android 6.0 Storage Architecture and Communication Design
Tencent Database Technology
Tencent Database Technology
Sep 6, 2021 · Cloud Native

Cloud‑Native ClickHouse Architecture and Design Overview

This article presents a comprehensive design of a cloud‑native ClickHouse OLAP system, detailing its three‑layer architecture, storage‑compute separation, unified metadata management, high‑availability mechanisms, elastic scaling, cost reductions, and future enhancements for multi‑replica and MPP query support.

ClickHouseCloud NativeDistributed Systems
0 likes · 19 min read
Cloud‑Native ClickHouse Architecture and Design Overview
21CTO
21CTO
Sep 1, 2021 · Databases

How BaikalDB Redefines Cloud‑Native Distributed Databases for Modern Business Needs

This article examines the evolving data‑storage demands of large‑scale commercial advertising systems, traces the development of BaikalDB from early MySQL sharding to a heterogeneous, cloud‑native distributed database, and explains its storage, compute, and scheduling designs that deliver high reliability, low cost, and millisecond‑level performance.

BaikalDBSQL Compatibilityadvertising systems
0 likes · 27 min read
How BaikalDB Redefines Cloud‑Native Distributed Databases for Modern Business Needs
High Availability Architecture
High Availability Architecture
Aug 6, 2021 · Databases

Design and Evolution of BaikalDB: A Distributed Database for Commercial Product Systems

This article examines the requirements of commercial advertising systems for data storage, traces the evolution from single‑node MySQL to BaikalDB’s cloud‑native, MySQL‑compatible distributed architecture, and details its storage, compute, and scheduling designs, highlighting key features such as Raft replication, RocksDB storage, and hybrid OLTP/OLAP support.

BaikalDBRocksDBSQL
0 likes · 25 min read
Design and Evolution of BaikalDB: A Distributed Database for Commercial Product Systems
Baidu Geek Talk
Baidu Geek Talk
Aug 4, 2021 · Databases

How Baidu’s BaikalDB Redefined Distributed Storage for Massive Ad Platforms

This article analyzes the evolution of Baidu's advertising data storage, detailing the business-driven requirements, the design and development of the BaikalDB distributed database, its architecture across storage, compute and scheduling layers, key features such as Raft replication and multi‑index support, and the lessons learned for building cloud‑native, high‑performance databases.

BaikalDBCloud NativeRaft replication
0 likes · 27 min read
How Baidu’s BaikalDB Redefined Distributed Storage for Massive Ad Platforms
DataFunTalk
DataFunTalk
Jul 20, 2021 · Databases

Time‑Series Database Series: Trends, Design Principles, and Comparative Analysis of OpenTSDB, InfluxDB, and Apache IoTDB

This article explores the evolution and current landscape of time‑series databases, detailing design principles, storage structures such as B‑Tree, B+Tree, and LSM‑Tree, and providing an in‑depth comparison of OpenTSDB, InfluxDB, and the emerging Apache IoTDB, while also discussing practical deployment considerations and industry use cases.

Apache IoTDBB+TreeInfluxDB
0 likes · 38 min read
Time‑Series Database Series: Trends, Design Principles, and Comparative Analysis of OpenTSDB, InfluxDB, and Apache IoTDB
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 18, 2021 · Fundamentals

What Makes Distributed File Systems Tick? Design Principles and Architecture Explained

This article examines the core concepts, requirements, architectural models, persistence strategies, scalability, high‑availability mechanisms, performance optimizations, security models, and practical considerations of distributed file systems such as HDFS, GFS, and Ceph, offering a comprehensive guide for engineers and researchers.

Data ConsistencyDistributed File SystemScalability
0 likes · 21 min read
What Makes Distributed File Systems Tick? Design Principles and Architecture Explained
IT Architects Alliance
IT Architects Alliance
Jun 11, 2021 · Backend Development

Understanding Message Queues: From Redis List to Kafka and Pulsar

This article explains the evolution of message‑queue middleware by comparing the basic double‑ended queue implementation, Redis list usage, Kafka’s partitioned log architecture with segment files and sparse indexes, and Pulsar’s compute‑storage separation using BookKeeper, highlighting their designs, strengths, and trade‑offs.

Distributed SystemsKafkaMessage Queue
0 likes · 28 min read
Understanding Message Queues: From Redis List to Kafka and Pulsar
IT Architects Alliance
IT Architects Alliance
Jun 6, 2021 · Fundamentals

Design Principles and Architecture of Distributed File Systems

This article provides a comprehensive overview of distributed file systems, covering their historical evolution, essential requirements, architectural models with and without central nodes, persistence strategies, scalability, high availability, performance optimization, security mechanisms, and additional considerations such as space allocation, file deletion, small‑file handling, and deduplication.

ConsistencyDistributed File SystemScalability
0 likes · 21 min read
Design Principles and Architecture of Distributed File Systems
DataFunTalk
DataFunTalk
Jun 4, 2021 · Fundamentals

Intel Optane Persistent Memory: Architecture, Programming Model, and Development Kit Overview

This article introduces Intel® Optane™ Persistent Memory (PMem), explains its performance characteristics, usage modes, the SNIA NVM programming model, and details the PMDK libraries and related tools that enable developers to build high‑performance, fault‑tolerant applications on persistent memory.

Intel OptaneMemory ManagementNVM Programming
0 likes · 16 min read
Intel Optane Persistent Memory: Architecture, Programming Model, and Development Kit Overview
Programmer DD
Programmer DD
Jun 1, 2021 · Fundamentals

What Makes Distributed File Systems Tick? Design Principles and Architecture Explained

This article explores the core concepts, design requirements, architectural models, scalability, high availability, performance optimization, and security considerations of distributed file systems, comparing centralized and decentralized approaches while highlighting practical solutions for persistence, consistency, and fault tolerance.

ConsistencyDistributed File SystemScalability
0 likes · 21 min read
What Makes Distributed File Systems Tick? Design Principles and Architecture Explained
Open Source Linux
Open Source Linux
May 26, 2021 · Fundamentals

How Hard Drives Store Data: Inside the Mechanics and Architecture

This article explains the types, components, operation principles, data layout, read/write process, locality-based prefetching, fragmentation, capacity calculations, and RAID configurations of modern hard disk drives, providing a comprehensive technical overview for anyone interested in storage fundamentals.

Disk I/ORAIDhard disk
0 likes · 21 min read
How Hard Drives Store Data: Inside the Mechanics and Architecture
Open Source Linux
Open Source Linux
Jan 27, 2021 · Fundamentals

Unlocking Ceph: A Deep Dive into Distributed Storage Architecture and Features

This article provides a comprehensive overview of Red Hat Ceph’s distributed object‑storage architecture, covering storage pools, authentication, placement groups, the CRUSH algorithm, replication, erasure coding, internal operations, high‑availability mechanisms, client interfaces, and encryption, illustrated with diagrams and practical details.

CRUSHCephReplication
0 likes · 40 min read
Unlocking Ceph: A Deep Dive into Distributed Storage Architecture and Features
Architects' Tech Alliance
Architects' Tech Alliance
Dec 7, 2020 · Fundamentals

Overview of Lustre Parallel File System Architecture and Performance Characteristics

The article provides a comprehensive overview of the Lustre parallel file system architecture, its core components, POSIX compliance, scalability, high‑performance networking, security features, data layout mechanisms, and performance considerations for large and small files, along with practical optimization tips for HPC environments.

HPCLustrePOSIX
0 likes · 17 min read
Overview of Lustre Parallel File System Architecture and Performance Characteristics
Architects' Tech Alliance
Architects' Tech Alliance
Dec 3, 2020 · Fundamentals

IBM GPFS (Spectrum Scale) Overview: History, Architecture, Features, and High‑Performance Computing Use Cases

This article provides a comprehensive overview of IBM's General Parallel File System (GPFS), detailing its historical development, architectural models—including SAN, NSD, and Share‑Nothing Cluster—its operational capabilities, performance advantages, scalability, high‑availability features, and its role in large‑scale high‑performance computing environments.

Distributed File SystemGPFSHigh‑performance computing
0 likes · 12 min read
IBM GPFS (Spectrum Scale) Overview: History, Architecture, Features, and High‑Performance Computing Use Cases
Tencent Cloud Developer
Tencent Cloud Developer
Dec 2, 2020 · Big Data

WeChat Pay Log System at Scale: Practices with Hermes

WeChat Pay’s Hermes‑based log system ingests trillions of entries daily, storing petabytes across a 200‑node HDFS cluster with four‑nine availability, while LSM‑style writes, separate inverted indexes and hot‑cold tiering cut memory, disk and cost by up to 70 % and keep 95 % of queries under five seconds.

HDFSHermesLog Analytics
0 likes · 7 min read
WeChat Pay Log System at Scale: Practices with Hermes
Architects' Tech Alliance
Architects' Tech Alliance
Nov 30, 2020 · Industry Insights

Cut Storage Costs and Boost Disaster Recovery with Deduplication and Encryption

Data deduplication eliminates redundant data blocks to lower storage and bandwidth costs, while source‑ and transmission‑level encryption safeguards data in transit and at rest; the article also compares hardware vs software deduplication, various storage architectures (DAS, SAN, NAS, object and distributed storage) and their trade‑offs.

BackupNASSAN
0 likes · 15 min read
Cut Storage Costs and Boost Disaster Recovery with Deduplication and Encryption
dbaplus Community
dbaplus Community
Nov 19, 2020 · Big Data

How Banks Can Tame Petabytes of Unstructured Data: Architecture and Best Practices

This article presents a comprehensive design and deployment plan for a bank's unstructured data service platform, covering data growth challenges, lifecycle management, three‑tier storage architecture, Elasticsearch indexing, fault‑tolerant disaster recovery, monitoring, and future development directions.

Elasticsearchdisaster recoverystorage architecture
0 likes · 19 min read
How Banks Can Tame Petabytes of Unstructured Data: Architecture and Best Practices
Architects' Tech Alliance
Architects' Tech Alliance
Nov 7, 2020 · Fundamentals

Design Considerations and Architecture of Distributed File Systems

This article examines the evolution, core requirements, architectural models (centralized and decentralized), persistence strategies, scalability, high availability, performance optimization, security mechanisms, and additional design trade‑offs of distributed file systems, providing a comprehensive overview for architects and engineers.

ConsistencyDistributed File SystemScalability
0 likes · 19 min read
Design Considerations and Architecture of Distributed File Systems
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Sep 18, 2020 · Fundamentals

Linux SCSI Subsystem Architecture and Design Framework

The article thoroughly examines the Linux SCSI subsystem architecture, illustrating how abstract hardware topology—hosts, channels, IDs, and LUNs—is modeled in software through structs such as scsi_host and scsi_device, and detailing initialization, scanning, path establishment, power management, and block‑layer integration to guide kernel storage driver design.

Block LayerData StructuresLinux kernel
0 likes · 20 min read
Linux SCSI Subsystem Architecture and Design Framework
Big Data Technology & Architecture
Big Data Technology & Architecture
Sep 14, 2020 · Big Data

Distributed File Systems: Overview, Design Requirements, Architecture Models, and Key Considerations

This article provides a comprehensive overview of distributed file systems, covering their historical evolution, essential design requirements, centralized and decentralized architecture models, persistence, scalability, high availability, performance optimization, security, and additional practical aspects such as space allocation, file deletion, small‑file handling, and deduplication.

Data ConsistencyDistributed File SystemScalability
0 likes · 21 min read
Distributed File Systems: Overview, Design Requirements, Architecture Models, and Key Considerations
Architects' Tech Alliance
Architects' Tech Alliance
Jul 15, 2020 · Fundamentals

Design Considerations and Architecture of Distributed File Systems

This article provides a comprehensive overview of distributed file systems, covering their historical evolution, essential requirements such as POSIX compliance, persistence, scalability, and security, and compares centralized (e.g., GFS) and decentralized (e.g., Ceph) architectures, along with strategies for high availability, performance optimization, and handling small files.

ConsistencyDistributed File SystemPerformance Optimization
0 likes · 21 min read
Design Considerations and Architecture of Distributed File Systems
Architects' Tech Alliance
Architects' Tech Alliance
Mar 6, 2020 · Fundamentals

Design Principles and Architecture of Distributed File Systems

This article provides a comprehensive overview of distributed file systems, covering their historical evolution, essential requirements, architectural models with and without a central node, persistence strategies, scalability, high availability, performance optimizations, security mechanisms, and practical considerations for small‑file workloads.

ConsistencyDistributed File SystemPerformance Optimization
0 likes · 20 min read
Design Principles and Architecture of Distributed File Systems
Architects' Tech Alliance
Architects' Tech Alliance
Feb 16, 2020 · Fundamentals

Design Principles and Architecture of Distributed File Systems

This article provides a comprehensive overview of distributed file systems, covering their historical evolution, essential requirements such as POSIX compliance, persistence, scalability and security, architectural models with and without a central node, replication strategies, load balancing, high availability, performance optimizations, and practical considerations for small‑file handling.

ConsistencyDistributed File SystemScalability
0 likes · 20 min read
Design Principles and Architecture of Distributed File Systems
UCloud Tech
UCloud Tech
Sep 2, 2019 · Backend Development

How UFS’s Performance‑Optimized Architecture Cuts I/O Latency to Sub‑10ms

This article explains how UCloud's performance‑oriented UFS leverages NVMe SSDs, upgrades to NFSv4, redesigns business indexing, and introduces a novel append‑only storage engine with stream/extent architecture to achieve sub‑10 ms latency and high IOPS for demanding AI and analytics workloads.

Distributed File SystemNFSv4NVMe SSD
0 likes · 17 min read
How UFS’s Performance‑Optimized Architecture Cuts I/O Latency to Sub‑10ms
DevOps Cloud Academy
DevOps Cloud Academy
Aug 11, 2019 · Big Data

Overview of MFS Distributed File System Architecture Similar to GoogleFS

The article explains the MFS distributed file system, detailing its four components—Master, Metalogger, Chunkserver, and Client—along with hardware recommendations, metadata handling, replication strategies, and FUSE‑based client mounting, providing a comprehensive guide to building a GoogleFS‑like storage cluster.

Big DataDistributed File SystemMFS
0 likes · 5 min read
Overview of MFS Distributed File System Architecture Similar to GoogleFS
Architects' Tech Alliance
Architects' Tech Alliance
Nov 21, 2018 · Fundamentals

Unlocking NVMe: How PCIe‑Based SSDs Achieve Ultra‑Low Latency and High IOPS

This article explains the NVMe (Non‑Volatile Memory Express) standard, its logical device interface, key attributes, queue architecture, namespace concepts, multi‑path I/O, SR‑IOV support, and how it compares to traditional SCSI storage, providing a comprehensive technical overview for modern data‑center and client systems.

Enterprise StorageIO QueuesNVMe
0 likes · 10 min read
Unlocking NVMe: How PCIe‑Based SSDs Achieve Ultra‑Low Latency and High IOPS
Architects' Tech Alliance
Architects' Tech Alliance
May 8, 2018 · Fundamentals

Welcome to the NVMe Era: PureStorage’s Take on Dell’s New PowerMax

The article summarizes PureStorage’s Matt Kixmoeller’s blog, which critiques Dell EMC’s PowerMax as a rebranded, partially NVMe‑optimized VMAX, highlights Pure’s own NVMe‑centric strategy, points out performance and efficiency shortcomings, and discusses the broader impact of NVMe on enterprise storage design.

Dell EMCEnterprise StorageFlashArray
0 likes · 7 min read
Welcome to the NVMe Era: PureStorage’s Take on Dell’s New PowerMax
Architects' Tech Alliance
Architects' Tech Alliance
Dec 18, 2017 · Fundamentals

GPFS Technical Practice Sharing and Building‑Block Design Overview

This article provides a comprehensive overview of IBM GPFS, covering its architecture, management components, networking models, cluster and storage design, as well as practical guidance on building‑block configurations for performance and capacity scaling in high‑performance computing environments.

Building BlockCluster ManagementDistributed File System
0 likes · 13 min read
GPFS Technical Practice Sharing and Building‑Block Design Overview
Efficient Ops
Efficient Ops
Jul 16, 2017 · Cloud Computing

Why PB‑Level Object Storage Is Essential and How to Choose the Right Solution

With data volumes soaring to petabyte scales, the article explains why object storage is the only viable solution for massive storage needs, outlines procurement considerations, design principles, and operational challenges, and offers practical guidance for building, evaluating, and scaling PB‑level storage systems.

Big Datacloud computingobject storage
0 likes · 38 min read
Why PB‑Level Object Storage Is Essential and How to Choose the Right Solution
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jul 1, 2017 · Fundamentals

Designing Distributed File Systems: Solving Local FS Limits

Distributed file systems extend traditional local storage by partitioning data across multiple servers, using a master node for metadata and coordination, handling namespace, replication, load balancing, caching, and client interfaces, thereby overcoming file size, quantity, and concurrency constraints of ext3, reiserfs, and similar local filesystems.

Distributed File SystemReplicationcaching
0 likes · 15 min read
Designing Distributed File Systems: Solving Local FS Limits
Architects' Tech Alliance
Architects' Tech Alliance
Jan 19, 2017 · Fundamentals

Ceph Architecture, Design Principles, Features, and BlueStore Deep Dive

This article shares a series of Sage Weil’s Ceph video lectures that thoroughly explain Ceph’s architecture, design philosophy, key features, and the BlueStore storage engine, providing links to additional resources and background on Ceph’s development history for enthusiasts seeking an in‑depth understanding.

BlueStoreCephObject Store
0 likes · 4 min read
Ceph Architecture, Design Principles, Features, and BlueStore Deep Dive
Architects' Tech Alliance
Architects' Tech Alliance
Jul 20, 2016 · Operations

How Distributed Indexing Improves Backup Performance and Scalability

The article explains how traditional centralized backup indexes become performance bottlenecks as data grows, and details Simpana's two‑level distributed indexing architecture—primary and secondary indexes—showing how it enhances backup speed, reduces network load, and simplifies recovery across multi‑site environments.

BackupData RecoveryOperations
0 likes · 7 min read
How Distributed Indexing Improves Backup Performance and Scalability
Art of Distributed System Architecture Design
Art of Distributed System Architecture Design
Apr 15, 2016 · Cloud Computing

How Meizu Optimized Cloud Sync: Architecture, Protocols, and Performance Insights

This article provides an in‑depth technical overview of Meizu's cloud sync service, covering its business scenarios, the evolution of its core SyncML‑based protocols, modular architecture across multiple data centers, performance optimizations such as traffic reduction and connection handling, as well as high‑availability, security, and future SDK‑API extensions.

Distributed SystemsPerformance Optimizationcloud sync
0 likes · 25 min read
How Meizu Optimized Cloud Sync: Architecture, Protocols, and Performance Insights
Java High-Performance Architecture
Java High-Performance Architecture
Feb 29, 2016 · Backend Development

How Kafka Stores and Retrieves Messages: Inside Partitions, Segments, and Index Files

Kafka persists messages on disk by organizing each topic into multiple partitions, which are further divided into segment files containing paired .index and .log files; this structure enables efficient storage, offset-based lookup, and fast retrieval of specific messages through binary search across segment indexes.

KafkaMessage Queuestorage architecture
0 likes · 5 min read
How Kafka Stores and Retrieves Messages: Inside Partitions, Segments, and Index Files
21CTO
21CTO
Nov 13, 2015 · Operations

Inside Taobao’s High‑Performance Distributed File System (TFS): Architecture & Scaling

Taobao’s File System (TFS) is a highly available, high‑performance distributed storage solution built on Linux servers, featuring name‑server and data‑server clusters, block‑level replication, HA mechanisms, client caching, seamless scaling, multi‑data‑center disaster recovery, and open‑source support for C++, Java, and Nginx integration.

Distributed File SystemTaobaohigh availability
0 likes · 15 min read
Inside Taobao’s High‑Performance Distributed File System (TFS): Architecture & Scaling
Architect
Architect
Nov 4, 2015 · Backend Development

Design Principles and Architecture of the FastDFS Distributed File System

The article explains FastDFS, an open‑source lightweight distributed file system built from tracker servers, storage servers, and client libraries, detailing its component roles, file upload/download workflows, storage organization, small‑file merging, HTTP support, and current limitations such as data safety and load balancing.

Distributed File SystemFastDFSstorage architecture
0 likes · 16 min read
Design Principles and Architecture of the FastDFS Distributed File System