Tagged articles
469 articles
Page 2 of 5
AntTech
AntTech
Mar 22, 2024 · Cloud Native

LightPool: A Cloud‑Native NVMe‑oF Based High‑Performance Storage Pool Architecture for Distributed Databases

The article introduces LightPool, an open‑source, cloud‑native storage‑pool architecture presented at HPCA 2024, which leverages NVMe‑over‑Fabric, Kubernetes CSI integration, and a lightweight user‑space engine to deliver high‑performance, elastic, and highly available storage for large‑scale distributed databases while reducing cost and improving resource utilization.

NVMe-oFcloud-nativedistributed-database
0 likes · 12 min read
LightPool: A Cloud‑Native NVMe‑oF Based High‑Performance Storage Pool Architecture for Distributed Databases
MaGe Linux Operations
MaGe Linux Operations
Mar 22, 2024 · Fundamentals

Mastering LVM on Linux: From Basics to Advanced Operations

This comprehensive guide explains what LVM is, its core concepts, installation steps, common commands, how to create and manage physical, volume and logical volumes, extend and shrink volumes, handle snapshots, and migrate volume groups across machines, all illustrated with practical command‑line examples.

LVMLinuxLogical Volume Manager
0 likes · 30 min read
Mastering LVM on Linux: From Basics to Advanced Operations
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Mar 21, 2024 · Cloud Native

LightPool: An NVMe‑oF‑Based High‑Performance and Lightweight Storage Pool Architecture for Cloud‑Native Distributed Databases

The article presents LightPool, a cloud‑native storage‑pooling solution that leverages NVMe‑over‑Fabric, Kubernetes‑based scheduling, and a lightweight user‑space engine to deliver high‑performance, low‑cost, and highly available storage for large‑scale distributed databases while eliminating traditional bottlenecks.

KubernetesNVMe-oFcloud-native
0 likes · 13 min read
LightPool: An NVMe‑oF‑Based High‑Performance and Lightweight Storage Pool Architecture for Cloud‑Native Distributed Databases
AntData
AntData
Mar 21, 2024 · Cloud Computing

LightPool: A Cloud‑Native NVMe‑oF Based High‑Performance Storage Pool Architecture for Distributed Databases

The article introduces LightPool, an open‑source cloud‑native storage‑pool architecture built on NVMe‑over‑Fabric that delivers high performance, low cost, and high availability for large‑scale distributed databases, and explains its design, scheduling, storage engine, and hot‑upgrade/migration capabilities presented at the 30th IEEE HPCA conference.

LiteIONVMe-oFcloud-native
0 likes · 13 min read
LightPool: A Cloud‑Native NVMe‑oF Based High‑Performance Storage Pool Architecture for Distributed Databases
Architects' Tech Alliance
Architects' Tech Alliance
Mar 17, 2024 · Industry Insights

Why Hyper‑Converged Infrastructure Beats Traditional VMware + FC SAN: 4 Key Differences

The article compares hyper‑converged infrastructure with the traditional VMware + FC SAN stack, highlighting four architectural differences and showing how hyper‑convergence improves reliability, concurrency performance, scalability, operational simplicity, and total cost of ownership for modern data‑center workloads.

CostData centerHyper-Converged
0 likes · 8 min read
Why Hyper‑Converged Infrastructure Beats Traditional VMware + FC SAN: 4 Key Differences
Liangxu Linux
Liangxu Linux
Mar 7, 2024 · Operations

How Upgrading EBS Volumes Boosted etcd Write Performance by 30%

A technical deep‑dive shows how a team managing dozens of Kubernetes clusters diagnosed a write‑ahead‑log bottleneck in etcd, measured IOPS and latency with etcdctl and fio, upgraded gp2 volumes to gp3, and discovered diminishing returns beyond 3000 IOPS while explaining the role of fdatasync in storage performance.

AWSEBSIOPS
0 likes · 11 min read
How Upgrading EBS Volumes Boosted etcd Write Performance by 30%
MaGe Linux Operations
MaGe Linux Operations
Mar 3, 2024 · Operations

Master Linux I/O Schedulers, fio Workloads, and RAID Performance

This guide explains how to choose Linux I/O schedulers, generate realistic storage workloads with fio, configure various RAID levels using mdadm and LVM, and monitor performance with tools like top, iostat, iotop, blktrace, and atop, providing practical command examples and best‑practice recommendations.

I/O schedulerLVMLinux
0 likes · 24 min read
Master Linux I/O Schedulers, fio Workloads, and RAID Performance
Aikesheng Open Source Community
Aikesheng Open Source Community
Feb 27, 2024 · Cloud Native

Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle

This article provides a comprehensive technical overview of LiteIO, describing its core and CSI components, the complete volume lifecycle within Kubernetes, Disk‑Agent responsibilities, common implementation pitfalls, storage‑pool construction methods, and the design of the node‑disk‑controller, scheduler, and CSI modules.

CSICloud NativeDistributed Systems
0 likes · 13 min read
Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle
MaGe Linux Operations
MaGe Linux Operations
Jan 30, 2024 · Cloud Native

How to Auto‑Recover Lost s3fs Mounts in a Huawei OBS CSI Plugin

This article explains why a Huawei OBS CSI plugin loses its s3fs process after a restart, causing "Transport endpoint is not connected" errors, and provides a step‑by‑step solution using client‑go to rebuild the mount and trigger kubelet remount via a liveness probe.

CSICloudNativeKubernetes
0 likes · 7 min read
How to Auto‑Recover Lost s3fs Mounts in a Huawei OBS CSI Plugin
Tencent Architect
Tencent Architect
Jan 23, 2024 · Fundamentals

XFS Deep Dive: Layout, Inode Management, and Read/Write Operations

This article analyzes the XFS filesystem implementation in the Linux kernel, covering its on‑disk layout, superblock and allocation‑group structures, inode and free‑space B+ trees, operation sets (iops, fops, aops), file creation, write and read paths, logging, block layer interactions, and useful XFS utilities.

FilesystemKernelXFS
0 likes · 12 min read
XFS Deep Dive: Layout, Inode Management, and Read/Write Operations
Architects' Tech Alliance
Architects' Tech Alliance
Jan 14, 2024 · Fundamentals

Overview of CPU, GPU, and Storage Fundamentals in the Xinchuang Industry

This article introduces the Xinchuang (information technology innovation) industry, outlines its hardware components, and provides concise explanations of CPU concepts, instruction sets, GPU architecture and operation, as well as storage classifications, while also linking to related research reports and promotional resources.

CPUGPUInformation Technology
0 likes · 8 min read
Overview of CPU, GPU, and Storage Fundamentals in the Xinchuang Industry
Aikesheng Open Source Community
Aikesheng Open Source Community
Jan 8, 2024 · Databases

MySQL General Tablespaces: A Powerful Storage Option

This article explains what MySQL general tablespaces are, outlines their main features and benefits, provides step‑by‑step commands for creating, managing, and assigning tables to them, and shows monitoring queries and practical usage scenarios for improved storage efficiency and performance.

InnoDBencryptiongeneral tablespace
0 likes · 12 min read
MySQL General Tablespaces: A Powerful Storage Option
Architects' Tech Alliance
Architects' Tech Alliance
Dec 9, 2023 · Fundamentals

Impact of IO Patterns on NVMe SSD Performance and Optimization Strategies

The article examines how different I/O patterns, such as sequential writes, request size interference, and read/write conflicts, affect NVMe SSD performance, explains the underlying mechanisms like write amplification and GC behavior, and proposes software-level optimizations—including large‑block writes, Optane caching, and OpenChannel/Object SSD designs—to improve throughput and latency.

IO PatternNVMePerformance Optimization
0 likes · 10 min read
Impact of IO Patterns on NVMe SSD Performance and Optimization Strategies
Baidu Geek Talk
Baidu Geek Talk
Nov 29, 2023 · Databases

How Baidu Built an HTAP Table Storage System to Tackle Massive Data Analytics

This article examines Baidu Search's content storage team's HTAP table storage system, detailing the challenges of supporting massive OLAP workloads on an OLTP‑oriented backend, the architectural split into Neptune and Saturn, storage‑engine optimizations such as row partitioning and dynamic columns, and a SQL‑like KQL framework for compute and scheduling.

HTAPKQLOLAP
0 likes · 13 min read
How Baidu Built an HTAP Table Storage System to Tackle Massive Data Analytics
Open Source Linux
Open Source Linux
Nov 21, 2023 · Fundamentals

Understanding RAID Levels: Choose the Right Storage Solution for Performance and Reliability

RAID combines multiple physical disks into virtual drives, offering various levels—RAID 0, 1, 1ADM, 5, 6, 10, 10ADM, 1E, 50, and 60—each balancing performance, fault tolerance, and capacity, with detailed processing flows, storage calculations, and best‑practice recommendations for optimal deployment.

RAIDdata redundancyfault tolerance
0 likes · 20 min read
Understanding RAID Levels: Choose the Right Storage Solution for Performance and Reliability
dbaplus Community
dbaplus Community
Nov 20, 2023 · Operations

Can VictoriaLogs Really Beat Elasticsearch, Loki, and ClickHouse? A Deep Dive

VictoriaLogs, a log‑storage system marketed as a cost‑effective, high‑performance alternative, is compared against Elasticsearch/OpenSearch, Grafana Loki, and ClickHouse, highlighting its lower RAM and disk usage, faster queries, simplified setup, and specialized features such as LogSQL, Bloom filters, and custom compression.

ClickHouseGrafana LokiLog Management
0 likes · 9 min read
Can VictoriaLogs Really Beat Elasticsearch, Loki, and ClickHouse? A Deep Dive
Architects' Tech Alliance
Architects' Tech Alliance
Nov 10, 2023 · Fundamentals

Factors Affecting NVMe SSD Performance: Hardware, Software, and Environmental Influences

The article analyzes how hardware components, software mechanisms, and environmental conditions collectively impact NVMe SSD performance, explaining why real‑world results often differ from vendor specifications and highlighting the roles of NAND type, controller architecture, PCIe bandwidth, garbage collection, temperature, and wear‑leveling.

Garbage CollectionHardwareNVMe
0 likes · 12 min read
Factors Affecting NVMe SSD Performance: Hardware, Software, and Environmental Influences
Architects' Tech Alliance
Architects' Tech Alliance
Nov 5, 2023 · Fundamentals

Understanding RAID Fault Tolerance, Consistency Checks, Hot Spare, Rebuild, and Data Protection Features

This article explains RAID fault‑tolerance mechanisms, consistency verification, hot‑spare and emergency backup, rebuild processes, virtual‑disk read/write policies, power‑loss protection, disk striping, mirroring, foreign configurations, power‑saving and pass‑through features, providing a comprehensive overview of modern storage system capabilities.

RAIDdisk stripingfault tolerance
0 likes · 16 min read
Understanding RAID Fault Tolerance, Consistency Checks, Hot Spare, Rebuild, and Data Protection Features
Architects' Tech Alliance
Architects' Tech Alliance
Oct 21, 2023 · Operations

Understanding NVMe, NVMe‑oF, and RDMA for High‑Performance Storage

This article explains how the emergence of ultra‑fast SSDs and NVMe reshapes storage architecture, details the NVMe protocol and its extensions over fabrics, compares RDMA‑based transport options such as FC, TCP, and RoCE, and discusses network requirements and performance trade‑offs for modern data‑center deployments.

Data centerNVMeRDMA
0 likes · 17 min read
Understanding NVMe, NVMe‑oF, and RDMA for High‑Performance Storage
Architect's Guide
Architect's Guide
Oct 19, 2023 · Databases

Understanding Database Indexes: Storage Principles, Types, and Optimization Techniques

This article explains how computer storage works, why database indexes dramatically speed up queries, the mechanics of binary search, the differences between clustered and non‑clustered indexes, common pitfalls of over‑indexing, and practical SQL optimization strategies to avoid full table scans and index invalidation.

Binary SearchClustered IndexSQL Optimization
0 likes · 12 min read
Understanding Database Indexes: Storage Principles, Types, and Optimization Techniques
Architects' Tech Alliance
Architects' Tech Alliance
Oct 18, 2023 · Industry Insights

What’s Driving the Revival of the Storage Chip Market and the Rise of Domestic Alternatives?

This article provides a comprehensive overview of the storage chip market, detailing the different storage media, memory hierarchy, DRAM/ROM technologies, DDR standards, NAND flash classifications, 3D NAND advancements, and the industry’s supply‑chain structure, while highlighting the emerging domestic substitution trend.

DDRIndustry analysisMemory
0 likes · 11 min read
What’s Driving the Revival of the Storage Chip Market and the Rise of Domestic Alternatives?
DataFunTalk
DataFunTalk
Oct 17, 2023 · Databases

Evolution of MatrixOne: From NewSQL MPP to a Modern HTAP Architecture

This article details the evolution of the open‑source distributed database MatrixOne, describing its early NewSQL/MPP design, the scalability, performance and cost challenges it faced, and the architectural redesign that introduced decoupled compute, transaction, and storage layers with HTAP support using TAE, S3, and Logtail.

ComputeHTAPMatrixOne
0 likes · 20 min read
Evolution of MatrixOne: From NewSQL MPP to a Modern HTAP Architecture
Architects' Tech Alliance
Architects' Tech Alliance
Sep 29, 2023 · Artificial Intelligence

AI Compute Landscape: GPUs, Networking, and Storage

The article analyzes the AI compute ecosystem—highlighting GPUs as the core engine, network bandwidth as a bottleneck, and storage memory walls—while also promoting comprehensive server and storage e‑books for deeper technical insight.

AIComputeE‑book
0 likes · 4 min read
AI Compute Landscape: GPUs, Networking, and Storage
Architect
Architect
Sep 28, 2023 · Databases

How to Pick the Best Storage Engine for High‑Throughput Browsing Records: Redis, MySQL or Tair?

This article walks through a real‑world e‑commerce scenario where billions of daily browsing events generate over 100K TPS writes, evaluates storage options on reliability, cost, read/write performance and implementation difficulty, and ultimately recommends Tair after detailed analysis of List, Sorted‑Set and Hash structures, code examples, and concurrency controls.

Tairconcurrencydata modeling
0 likes · 15 min read
How to Pick the Best Storage Engine for High‑Throughput Browsing Records: Redis, MySQL or Tair?
Ops Development Stories
Ops Development Stories
Aug 17, 2023 · Operations

Understanding Server Storage Types: HDD, SSD, RAID & Network Storage Explained

This article explains the different server storage types—including mechanical HDDs, solid‑state SSDs, various interfaces, RAID arrays, and network‑attached storage—while covering their structures, performance characteristics, Linux device handling, and how to choose the right solution based on cost, capacity, speed, and reliability.

LinuxRAIDSSD
0 likes · 9 min read
Understanding Server Storage Types: HDD, SSD, RAID & Network Storage Explained
Liangxu Linux
Liangxu Linux
Jul 20, 2023 · Fundamentals

How to Choose the Right Desktop PC Components for Your Needs

This guide explains how to select desktop computer parts—including CPU, GPU, motherboard, memory, storage, power supply, and cooling—by evaluating usage, performance tiers, specifications, brand options, and compatibility, while also noting which components can be safely bought second‑hand.

CPUGPUMemory
0 likes · 9 min read
How to Choose the Right Desktop PC Components for Your Needs
21CTO
21CTO
Jul 15, 2023 · Backend Development

From ActiveMQ to Pulsar: The Evolution of Message Queues Explained

This article traces the development of message queues from early decoupling solutions like ActiveMQ and RabbitMQ, through high‑throughput designs such as Kafka and RocketMQ, to modern platform‑centric systems like Pulsar, while detailing core concepts, architecture diagrams, storage mechanisms and trade‑offs.

BackendKafkaMessage Queue
0 likes · 15 min read
From ActiveMQ to Pulsar: The Evolution of Message Queues Explained
Efficient Ops
Efficient Ops
Jul 12, 2023 · Cloud Native

Mastering Kubernetes Persistent Storage: PV, PVC, and CSI Explained

Learn how Kubernetes abstracts persistent storage using PersistentVolumes (PV) and PersistentVolumeClaims (PVC), why HostPath is limited, the role of network storage, and how to implement CSI drivers with practical YAML examples and kubectl commands for creating and managing PVs and PVCs.

CSIKubernetesPVC
0 likes · 6 min read
Mastering Kubernetes Persistent Storage: PV, PVC, and CSI Explained
Architects' Tech Alliance
Architects' Tech Alliance
Jun 26, 2023 · Fundamentals

Understanding Linux Ext Filesystems, RAID, and LVM

This article explains the structure of Linux Ext (2/3/4) filesystems, detailing superblocks, inode tables and data blocks, then describes block groups, the role of superblocks, and outlines the differences between hardware and software RAID as well as the principles and risks of using LVM for flexible storage management.

FilesystemLVMLinux
0 likes · 5 min read
Understanding Linux Ext Filesystems, RAID, and LVM
DataFunSummit
DataFunSummit
Jun 21, 2023 · Databases

Forum on Building Ultra‑Scale Storage Systems: Insights from Baidu, Meituan, Ant Group, Xiaomi and Baidu Cloud

The forum gathers senior experts from Baidu, Meituan, Ant Group, Xiaomi and Baidu Cloud to share practical experiences and future trends on constructing ultra‑large‑scale file, block, KV and NoSQL storage systems, focusing on low‑cost, high‑performance solutions and architectural challenges.

Distributed SystemsKV storageblock storage
0 likes · 8 min read
Forum on Building Ultra‑Scale Storage Systems: Insights from Baidu, Meituan, Ant Group, Xiaomi and Baidu Cloud
Architects' Tech Alliance
Architects' Tech Alliance
Jun 14, 2023 · Artificial Intelligence

AI Compute Landscape: GPUs, Network, and Storage as Core Engines

The article analyzes how large language models like ChatGPT are reshaping the software ecosystem by positioning AI compute—driven by GPUs, high‑speed networking, and advanced storage solutions such as HBM and 3D‑stacked memory—as the foundational engine for future information systems, highlighting current market trends and technical challenges.

AIComputeGPU
0 likes · 4 min read
AI Compute Landscape: GPUs, Network, and Storage as Core Engines
Alibaba Cloud Native
Alibaba Cloud Native
Jun 9, 2023 · Cloud Native

Accelerate AI & Big Data on Kubernetes with Elastic File Client & Fluid

This article explains how the Elastic File Client (EFC) and Fluid together provide a cloud‑native, high‑performance storage solution for AI and big‑data workloads on Kubernetes, detailing architecture challenges, core features, performance benchmarks, and a step‑by‑step deployment guide.

AIBig DataCloud Native
0 likes · 16 min read
Accelerate AI & Big Data on Kubernetes with Elastic File Client & Fluid
Architects' Tech Alliance
Architects' Tech Alliance
Jun 1, 2023 · Fundamentals

Overview of the Xinchuang (Information Technology Innovation) Industry: CPU, GPU, and Storage Fundamentals

This article provides a comprehensive overview of the Xinchuang industry, detailing the fundamental concepts, architectures, and classifications of CPUs, GPUs, and storage devices, and explains how these core hardware components support the goal of achieving self‑controlled, secure information technology in China.

CPUGPUHardware
0 likes · 6 min read
Overview of the Xinchuang (Information Technology Innovation) Industry: CPU, GPU, and Storage Fundamentals
AI Cyberspace
AI Cyberspace
May 22, 2023 · Cloud Computing

Step‑by‑Step Guide to Deploy a Ceph Cluster with ceph‑deploy on CentOS 7

This tutorial walks through the complete process of preparing three CentOS 7 nodes, installing ceph‑deploy, configuring storage and network topology, and sequentially deploying monitors, managers, OSDs, MDS, RGW, enabling the Ceph dashboard, performing verification tests, and finally uninstalling the cluster.

CephCeph-deployCluster
0 likes · 30 min read
Step‑by‑Step Guide to Deploy a Ceph Cluster with ceph‑deploy on CentOS 7
Ops Development Stories
Ops Development Stories
May 17, 2023 · Operations

Deploy Ceph Offline on ZTE NewStart CGSL: Step‑by‑Step Guide

This article provides a comprehensive, step‑by‑step tutorial for installing Ceph on ZTE NewStart CGSL using an offline RPM repository, covering yum repository setup, Ceph package installation, monitor, OSD, and MGR node configuration, and includes all necessary command examples.

CephLinuxZTE NewStart
0 likes · 16 min read
Deploy Ceph Offline on ZTE NewStart CGSL: Step‑by‑Step Guide
ITPUB
ITPUB
May 10, 2023 · Cloud Native

How Meituan’s MStore Achieves Scalable Storage‑Compute Separation in Cloud‑Native Environments

This article explains how Meituan’s storage team designed the MStore distributed storage platform to separate storage and compute, addressing scaling, cost, and reliability challenges of monolithic architectures, and details its cloud‑native components, data model, performance optimizations, observability, and the derived EBS block‑storage service.

Distributed SystemsMStorecloud-native
0 likes · 16 min read
How Meituan’s MStore Achieves Scalable Storage‑Compute Separation in Cloud‑Native Environments
ITPUB
ITPUB
Apr 10, 2023 · Backend Development

How Bilibili Scales Its Like Service: Architecture, Storage, and Disaster Recovery

This article details Bilibili's thumb‑up system design, covering business capabilities, multi‑layer storage, traffic handling, disaster‑recovery strategies, and future plans to ensure a high‑traffic, reliable like service for videos, posts, comments, and more.

Backenddisaster recoverystorage
0 likes · 15 min read
How Bilibili Scales Its Like Service: Architecture, Storage, and Disaster Recovery
DataFunSummit
DataFunSummit
Mar 28, 2023 · Big Data

Core Technologies, Performance Metrics, Challenges, and Future Trends of Cloud‑Native Big Data – Expert Interview

In this expert interview, a chief big‑data architect from NetEase explains the core technology layers, key performance indicators, major challenges and mitigation strategies, the business value, and emerging trends of cloud‑native big data platforms, highlighting scheduling, storage, and mixed‑deployment considerations.

Schedulingbig-datastorage
0 likes · 15 min read
Core Technologies, Performance Metrics, Challenges, and Future Trends of Cloud‑Native Big Data – Expert Interview
System Architect Go
System Architect Go
Mar 21, 2023 · Cloud Native

Understanding and Using Kubernetes Volume Snapshots

This article explains the concepts, architecture, configuration, and practical use cases of Kubernetes volume snapshots, including how to define snapshot classes, create snapshots, clone PVCs, and perform consistent backups across different storage providers and clusters.

CSICloudNativeKubernetes
0 likes · 11 min read
Understanding and Using Kubernetes Volume Snapshots
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 13, 2023 · Big Data

Unlocking Big Data with Alibaba Cloud’s Native Data Lake Solution

Alibaba Cloud’s cloud‑native data lake analysis solution combines fully managed storage (OSS‑HDFS), a one‑stop lake management platform (Data Lake Formation), and multimodal compute capabilities, delivering high performance, massive scalability, and low cost for big‑data and AI workloads across offline, real‑time, and lake‑house scenarios.

AnalyticsBig DataCloud Native
0 likes · 11 min read
Unlocking Big Data with Alibaba Cloud’s Native Data Lake Solution
Open Source Linux
Open Source Linux
Feb 27, 2023 · Fundamentals

Mastering RAID: From Basics to Linux Software RAID 5 Setup

This article explains the fundamentals of RAID, compares common RAID levels, describes hardware RAID components, and provides a step‑by‑step guide for configuring a Linux software RAID 5 array with mdadm, including commands, monitoring, and recovery procedures.

HardwareLinuxRAID
0 likes · 11 min read
Mastering RAID: From Basics to Linux Software RAID 5 Setup
DataFunTalk
DataFunTalk
Feb 18, 2023 · Big Data

Xiaomi Data Governance Evolution: Cost Governance Practices for HDFS and HBase

The article outlines Xiaomi's data governance journey, focusing on storage‑service cost governance, describing the transition from simple cost‑centered governance to big‑data‑driven asset management, and detailing concrete HDFS and HBase practices that achieved significant resource and cost reductions.

Big DataData GovernanceHBase
0 likes · 15 min read
Xiaomi Data Governance Evolution: Cost Governance Practices for HDFS and HBase
Architect's Guide
Architect's Guide
Jan 24, 2023 · Databases

Understanding Database Indexes: How They Accelerate Query Performance

This article explains the evolution of data storage, the fundamentals of computer storage devices, how database indexes function like a book's table of contents, the role of binary search, the benefits and drawbacks of indexes, clustered versus non‑clustered indexes, and common SQL optimization techniques.

Clustered IndexSQLdatabase
0 likes · 11 min read
Understanding Database Indexes: How They Accelerate Query Performance
DataFunSummit
DataFunSummit
Jan 20, 2023 · Cloud Native

Design and Architecture of JuiceFS: A Cloud‑Native Distributed File System

This article reviews the evolution of file storage, outlines the challenges of the cloud era, and details JuiceFS's design philosophy, architecture, key capabilities, and real‑world use cases such as Kubernetes, AI, big‑data analytics, and NAS migration to the cloud.

AICloud NativeDistributed File System
0 likes · 22 min read
Design and Architecture of JuiceFS: A Cloud‑Native Distributed File System
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Jan 18, 2023 · Big Data

How Distributed Technologies Power Modern Big Data Platforms

This article explains how distributed storage, computing, and resource‑management technologies have evolved—from early Google File System research to Hadoop, Spark, and Kubernetes—enabling enterprises to tackle the 4 Vs of big data while reducing cost, improving performance, and supporting real‑time analytics.

Resource Managementcomputingstorage
0 likes · 17 min read
How Distributed Technologies Power Modern Big Data Platforms
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Jan 6, 2023 · Big Data

Designing a Unified Enterprise Data Storage and Compute Platform

This article explains how enterprises can build a unified data storage and compute foundation, covering strategic goals, functional and architectural requirements, and the layered design of business support, storage‑compute, and resource management to enable scalable, secure, and high‑performance data platforms.

ComputeData Platformenterprise architecture
0 likes · 15 min read
Designing a Unified Enterprise Data Storage and Compute Platform
Java High-Performance Architecture
Java High-Performance Architecture
Jan 4, 2023 · Databases

Why Do Database Indexes Speed Up Queries? A Deep Dive into Storage and Optimization

This article explains how databases store data on physical devices, how indexes work like a book’s table of contents to accelerate queries, covers storage fundamentals, binary search, clustered vs non‑clustered indexes, and practical SQL optimization tips to avoid full‑table scans and index pitfalls.

SQL Optimizationindexingstorage
0 likes · 12 min read
Why Do Database Indexes Speed Up Queries? A Deep Dive into Storage and Optimization
Top Architect
Top Architect
Jan 4, 2023 · Cloud Computing

Diskless Architecture for Modern Data Centers: Challenges, Technologies, and Industry Practices

The article outlines the evolution of data‑center architectures, identifies capacity, efficiency, and performance challenges of traditional storage‑compute models, and presents the emerging Diskless architecture—leveraging DPU, CXL, RDMA, and high‑throughput networking—to achieve decoupled, pool‑based resources and improve overall data‑center utilization.

CXLComputeDPU
0 likes · 12 min read
Diskless Architecture for Modern Data Centers: Challenges, Technologies, and Industry Practices
Architecture Digest
Architecture Digest
Jan 4, 2023 · Databases

YouTube Backend Architecture: Databases, Scaling, and Vitess

This article examines YouTube’s massive backend infrastructure, detailing how the platform stores billions of videos using MySQL, Vitess for horizontal scaling, sharding, replication, disaster management, cloud‑native deployment on Kubernetes, and the supporting storage systems such as GFS and BigTable.

BackendCloud NativeScalability
0 likes · 13 min read
YouTube Backend Architecture: Databases, Scaling, and Vitess
Ops Development Stories
Ops Development Stories
Dec 29, 2022 · Operations

Master Ceph Cache Tiering: Principles, Modes, and Deployment Guide

This article explains the fundamentals of Ceph cache tiering, covering cache and buffer concepts, the two cache‑pool modes (write‑back and read‑forward), step‑by‑step deployment, configuration parameters, and proper procedures for creating, tuning, and safely removing cache pools.

Cache TieringCephLinux
0 likes · 14 min read
Master Ceph Cache Tiering: Principles, Modes, and Deployment Guide
Architects' Tech Alliance
Architects' Tech Alliance
Dec 11, 2022 · Fundamentals

Fundamentals of CPU, GPU, and Storage in the Xinchuang Industry

This article provides a comprehensive overview of the Xinchuang industry’s hardware fundamentals, detailing CPU architecture and operation, instruction set classifications, GPU concepts and workflows, storage categories, and the distinction between independent and integrated GPUs, while also noting related promotional resources.

CPUGPUHardware
0 likes · 8 min read
Fundamentals of CPU, GPU, and Storage in the Xinchuang Industry
ITPUB
ITPUB
Dec 7, 2022 · Databases

Can Modern OSes Replace Database Kernels? Insights on OS‑Database Synergy

The article examines whether today’s powerful operating systems can assume tasks traditionally handled by database kernels, tracing the evolution from early OS‑dependent databases to modern systems that leverage OS file cache, VMM, and storage optimizations, while also discussing scenarios where direct DB control remains necessary.

LinuxOperating SystemPostgreSQL
0 likes · 8 min read
Can Modern OSes Replace Database Kernels? Insights on OS‑Database Synergy
Liangxu Linux
Liangxu Linux
Nov 10, 2022 · Cloud Native

How to Quickly Set Up a Podman Environment on CentOS 8

This guide walks you through installing Podman on CentOS 8, configuring user namespaces, creating and building a custom UBI‑based image with a Dockerfile, running containers, verifying installed tools, using interactive shells, and mounting host directories for persistent storage.

CentOSContainerDockerfile
0 likes · 6 min read
How to Quickly Set Up a Podman Environment on CentOS 8
IT Services Circle
IT Services Circle
Nov 5, 2022 · Fundamentals

UFS File-Based Optimization (FBO) Standard and Its Controversial Linux Kernel Patch

JEDEC’s new UFS File‑Based Optimization (FBO) 1.0 standard defines an interface for host‑device communication to defragment file‑related LBA ranges, but a Xiaomi engineer’s Linux kernel patch implementing it was sharply criticized by kernel veteran Christoph Hellwig as impractical and unrelated to the kernel.

File-Based OptimizationJEDECLinux kernel
0 likes · 4 min read
UFS File-Based Optimization (FBO) Standard and Its Controversial Linux Kernel Patch
21CTO
21CTO
Nov 4, 2022 · Fundamentals

Can File‑Based Optimization Save UFS Performance? Inside the Linux Kernel Debate

JEDEC’s new UFS File‑Based Optimization (FBO) 1.0 standard aims to improve flash storage performance by defragmenting LBA ranges tied to specific files, but a Xiaomi engineer’s patch proposal to implement it in Linux was sharply rejected by kernel veteran Christoph Hellwig, who called the idea fundamentally flawed.

File-Based OptimizationLinux kernelUFS
0 likes · 4 min read
Can File‑Based Optimization Save UFS Performance? Inside the Linux Kernel Debate
Architects' Tech Alliance
Architects' Tech Alliance
Nov 3, 2022 · Fundamentals

Rapid Growth and Technical Overview of 10‑Gigabit Ethernet Deployment

Since the IEEE 802.3ae standard was approved in 2002, 10‑Gigabit Ethernet has expanded dramatically due to falling port prices, new fiber technologies, rising bandwidth demands, and its ability to support high‑performance data center, storage, and clustering applications, making it a key networking solution today.

10GbEData centerHigh Bandwidth
0 likes · 12 min read
Rapid Growth and Technical Overview of 10‑Gigabit Ethernet Deployment
Baidu Geek Talk
Baidu Geek Talk
Nov 3, 2022 · Cloud Native

Challenges and Solutions for AI Storage Systems in Cloud‑Native Training

The talk outlines how AI training’s growing data and compute demands create storage bottlenecks across four evolutionary stages, identifies four core problems—massive data, data‑flow, resource scheduling, and compute acceleration—and proposes hardware, software (parallel file systems, caching), and cloud‑native orchestration (Fluid, Baidu Canghai) solutions that combine object‑storage lakes with high‑performance acceleration layers to achieve near‑full GPU utilization.

AICloud NativeData Lake
0 likes · 37 min read
Challenges and Solutions for AI Storage Systems in Cloud‑Native Training
Open Source Linux
Open Source Linux
Oct 26, 2022 · Fundamentals

Choosing the Right RAID Level: Pros, Cons, and Best Use Cases

This guide explains what RAID is, its role in server storage, compares common RAID levels (0, 1, 5, 6, 10) in terms of fault tolerance, performance, and capacity, and offers recommendations for selecting the most suitable RAID configuration based on data safety, speed, and cost considerations.

Data ProtectionRAIDfault tolerance
0 likes · 8 min read
Choosing the Right RAID Level: Pros, Cons, and Best Use Cases
DataFunTalk
DataFunTalk
Oct 19, 2022 · Big Data

Understanding Flink Table Store: Design, Usage, and Roadmap

Flink Table Store, an Apache Flink subproject, provides a unified stream‑batch storage layer with SQL‑based table APIs, addressing real‑time and offline data needs, detailing its design goals, usage patterns, architectural layers, implementation choices, and upcoming roadmap.

FlinkLSM‑TreeStreaming
0 likes · 14 min read
Understanding Flink Table Store: Design, Usage, and Roadmap
Selected Java Interview Questions
Selected Java Interview Questions
Oct 17, 2022 · Cloud Native

Comprehensive Docker Guide: Concepts, Installation, Management, and Practical Use Cases

This article provides a detailed overview of Docker, covering its definition, core components, advantages over virtual machines, common use cases, installation steps, storage configuration, image management commands, network modes, data volumes, private registry setup, backup procedures, and MySQL deployment, all illustrated with practical command examples.

ContainerDevOpsDocker
0 likes · 18 min read
Comprehensive Docker Guide: Concepts, Installation, Management, and Practical Use Cases
Architecture Digest
Architecture Digest
Sep 12, 2022 · Fundamentals

Understanding Low‑Code Implementation Principles: Front‑End Rendering, Back‑End Data Models, and Future Directions

This article explains the core concepts of low‑code platforms, emphasizing visual editing as a mandatory feature, comparing declarative and imperative approaches, detailing front‑end JSON‑to‑React rendering, reviewing five back‑end storage strategies, discussing business‑logic implementation, workflow engines, and predicting the future evolution of low‑code solutions.

BackendDeclarativeamis
0 likes · 36 min read
Understanding Low‑Code Implementation Principles: Front‑End Rendering, Back‑End Data Models, and Future Directions
Liangxu Linux
Liangxu Linux
Sep 7, 2022 · Operations

Mastering lsscsi: Understand SCSI, Install, Options, and Debugging

This guide explains the SCSI protocol and common disk interfaces, details how to install and use the lsscsi command on various Linux distributions, describes its options and output formats, and provides examples of querying devices, interpreting results, and enabling SCSI logging for debugging.

LinuxSCSIlsscsi
0 likes · 13 min read
Mastering lsscsi: Understand SCSI, Install, Options, and Debugging
dbaplus Community
dbaplus Community
Sep 4, 2022 · Databases

How to Choose the Right Database: Relational vs NoSQL Storage Guide

This article presents a comprehensive framework for selecting a storage solution, covering business and technical metrics, comparing relational databases with various NoSQL types, and offering concrete pros, cons, and scenario‑based recommendations to help engineers make informed data‑store decisions.

NoSQLRelationalSelection
0 likes · 18 min read
How to Choose the Right Database: Relational vs NoSQL Storage Guide
Architects' Tech Alliance
Architects' Tech Alliance
Aug 1, 2022 · Fundamentals

Specialized Data Processor (DPU) Performance Benchmark Methodology and Implementation

The whitepaper introduces a comprehensive DPU performance benchmark methodology, detailing evaluation criteria across network, storage, compute, and security dimensions, and provides guidance on building test systems, selecting workloads, and designing metrics to enable fair, open, and objective assessment of diverse DPU products.

ComputeDPUSecurity
0 likes · 10 min read
Specialized Data Processor (DPU) Performance Benchmark Methodology and Implementation
ByteDance Web Infra
ByteDance Web Infra
Jul 22, 2022 · Fundamentals

Challenges and Optimizations for Large Git Repositories

This article examines why Git struggles with monolithic repositories exceeding 100 GB, outlines storage, performance, stability, and reliability challenges, and presents preventive, mid‑process, and post‑process strategies—including LFS, shallow and partial clones, commit‑graph, bitmap, multi‑pack index, and bundle techniques—to efficiently manage and maintain large Git monorepos.

Gitlarge-repooptimization
0 likes · 16 min read
Challenges and Optimizations for Large Git Repositories
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jun 29, 2022 · Artificial Intelligence

Music FeatureBox: A Custom Feature Store for Machine Learning at NetEase Cloud Music

Music FeatureBox is NetEase Cloud Music’s custom feature store that centralizes metadata, unifies offline and online feature storage across multiple engines, provides a cross‑language DSL for extraction, ensures training‑inference data consistency, and offers built‑in monitoring, thereby streamlining feature engineering and accelerating the platform’s machine‑learning lifecycle.

DataHubFeature StoreMLOps
0 likes · 17 min read
Music FeatureBox: A Custom Feature Store for Machine Learning at NetEase Cloud Music
DataFunTalk
DataFunTalk
Jun 25, 2022 · Big Data

Alluxio Metadata and Data Synchronization: Design, Implementation, and Optimization

This article provides a comprehensive overview of Alluxio's metadata and data synchronization mechanisms, covering its unified namespace, mounting strategies, consistency models, various write modes, read workflows, metadata sync techniques, performance optimizations, and recommended configurations for different deployment scenarios.

AlluxioData Consistencymetadata synchronization
0 likes · 26 min read
Alluxio Metadata and Data Synchronization: Design, Implementation, and Optimization
Architects' Tech Alliance
Architects' Tech Alliance
Jun 4, 2022 · Operations

Comprehensive Survey of Large‑Scale RDMA Technologies and Practices

This article provides a detailed overview of large‑scale RDMA technology, covering basic concepts, major protocols, network‑level techniques such as congestion control, lossless‑to‑lossy evolution and multipath, virtualization, communication libraries for AI training and storage, performance tuning, monitoring, and real‑world deployment experiences.

AIRDMAVirtualization
0 likes · 16 min read
Comprehensive Survey of Large‑Scale RDMA Technologies and Practices
DataFunSummit
DataFunSummit
Jun 2, 2022 · Databases

An In‑Depth Overview of Apache BookKeeper: Architecture, Features, and Use Cases

This article provides a comprehensive technical overview of Apache BookKeeper, covering its role as a distributed append‑only log service, core concepts, high‑availability mechanisms, storage‑media evolution, comparisons with Raft, and community resources, while illustrating its use in Pulsar and large‑scale data platforms.

Apache BookKeeperDistributed LogPulsar
0 likes · 12 min read
An In‑Depth Overview of Apache BookKeeper: Architecture, Features, and Use Cases
dbaplus Community
dbaplus Community
May 23, 2022 · Cloud Native

Why Kubernetes Is the Core of Cloud‑Native Architecture: A Deep Dive

This article provides a comprehensive overview of Kubernetes, covering its revolutionary role in cloud computing, master‑worker architecture, core concepts such as Pods and Controllers, storage and networking models, workload types, and extensibility through CRI, CNI, and CSI standards.

Cloud NativeKubernetesNetworking
0 likes · 18 min read
Why Kubernetes Is the Core of Cloud‑Native Architecture: A Deep Dive
Open Source Linux
Open Source Linux
May 20, 2022 · Fundamentals

Why SSDs Outperform HDDs: Inside the Mechanics of Modern Storage

This article explains why solid‑state drives outperform mechanical hard drives by detailing their distinct operating principles, covering HDD magnetic storage, SSD floating‑gate transistors, and how these differences affect data writing, reading, erasing, and overall performance, especially for random access.

HDDHardwareSSD
0 likes · 7 min read
Why SSDs Outperform HDDs: Inside the Mechanics of Modern Storage