Tagged articles
174 articles
Page 1 of 2
Java Tech Enthusiast
Java Tech Enthusiast
Apr 27, 2026 · Industry Insights

Why Optical Drives Have Disappeared from Modern PCs

The article explains how optical drives, once essential for software installation, media playback, and data backup, have been phased out by faster, smaller, and more reliable alternatives such as USB flash drives, cloud storage, SSDs, and digital distribution platforms, leaving the drive as a niche, legacy component.

SSDUSB flash drivecloud storage
0 likes · 8 min read
Why Optical Drives Have Disappeared from Modern PCs
IT Services Circle
IT Services Circle
Apr 25, 2026 · Industry Insights

Massive Cloud‑Storage Crackdown: All Overseas Dramas Banned

Starting April 10, major Chinese cloud‑storage services began a sweeping purge of unauthorized overseas TV series, deleting millions of links, imposing bans on violators, and signaling a new era of strict compliance driven by the government‑led “Jianwang 2025” operation aimed at protecting copyright, cultural security, and user data.

cloud storagecontent moderationcopyright enforcement
0 likes · 8 min read
Massive Cloud‑Storage Crackdown: All Overseas Dramas Banned
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Apr 7, 2026 · Cloud Native

Boost OpenStack Storage Efficiency with Ceph RBD Erasure Coding

This article explains how to integrate Ceph's erasure‑coded RBD pools with OpenStack, covering the design principles, storage pool layout, performance trade‑offs, and step‑by‑step configuration for Nova and Cinder to achieve higher storage utilization while maintaining high availability.

CephHybrid ArchitectureOpenStack
0 likes · 13 min read
Boost OpenStack Storage Efficiency with Ceph RBD Erasure Coding
ITPUB
ITPUB
Dec 10, 2025 · Cloud Native

What’s the Best MinIO Alternative? RustFS, Garage, Ceph, and SeaweedFS Compared

With MinIO moving to maintenance mode, this article evaluates four open‑source S3‑compatible storage projects—RustFS, Garage, Ceph, and SeaweedFS—detailing their performance, licensing, maturity, and ideal use cases, and offers guidance for small teams versus large enterprises.

CephGarageMinIO alternative
0 likes · 11 min read
What’s the Best MinIO Alternative? RustFS, Garage, Ceph, and SeaweedFS Compared
Volcano Engine Developer Services
Volcano Engine Developer Services
Dec 5, 2025 · Artificial Intelligence

Why Vectors Power Scalable AI Search and How S3 Vectors Redefines Storage

This article explains how high‑dimensional vectors enable semantic AI search, compares exact and approximate nearest‑neighbor algorithms, examines the challenges of large‑scale vector storage, and evaluates AWS S3 Vectors' architecture, pricing, and hybrid solutions for cost‑effective, high‑performance retrieval.

AI semanticsANNS3 Vectors
0 likes · 17 min read
Why Vectors Power Scalable AI Search and How S3 Vectors Redefines Storage
macrozheng
macrozheng
Nov 20, 2025 · Backend Development

How to Build a Spring Boot OSS Starter Compatible with Amazon S3

This guide walks through creating a reusable Spring Boot starter that abstracts object storage services (OSS) via the Amazon S3 API, covering project setup, Maven dependencies, configuration properties, template interfaces, implementation, auto‑configuration, packaging, and a simple test case.

Amazon S3OSSStarter
0 likes · 15 min read
How to Build a Spring Boot OSS Starter Compatible with Amazon S3
JavaGuide
JavaGuide
Sep 27, 2025 · Cloud Computing

Alibaba Cloud OSS Browser 2.0: New UI and Features Compared to 1.0

The article reviews Alibaba Cloud's OSS Browser 2.0, highlighting its refreshed interface, three login options, batch and folder upload up to 10 GB, online preview/edit capabilities, and compares its stability and usability against the older 1.0 client that struggled on Windows Server.

Alibaba CloudGUI clientOSS
0 likes · 4 min read
Alibaba Cloud OSS Browser 2.0: New UI and Features Compared to 1.0
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Sep 22, 2025 · Cloud Computing

How Mantle Breaks the Hierarchical Namespace Bottleneck in Cloud Object Storage

The Mantle system, presented in a SOSP'25 paper by Baidu's storage team and collaborators, delivers a distributed hierarchical namespace for cloud object storage that overcomes traditional scalability and performance limits, enabling massive data lake workloads with dramatically reduced latency and vastly increased throughput.

Distributed SystemsSOSPcloud storage
0 likes · 8 min read
How Mantle Breaks the Hierarchical Namespace Bottleneck in Cloud Object Storage
Code Wrench
Code Wrench
Jul 1, 2025 · Databases

How to Build a Low‑Cost, Scalable SQL Server Backup Using Amazon S3

This guide explains why Amazon S3 is an economical, scalable storage option for SQL Server backups and provides a step‑by‑step tutorial—including IAM policy creation, bucket setup, backup job scripting, synchronization, and cleanup—to achieve automated, reliable, and cost‑effective data protection.

AWS S3AutomationDatabase Backup
0 likes · 7 min read
How to Build a Low‑Cost, Scalable SQL Server Backup Using Amazon S3
Java Tech Enthusiast
Java Tech Enthusiast
Jun 21, 2025 · Information Security

Why Your OneDrive Account Might Get Locked and How to Protect Your Data

A Reddit user’s Microsoft account was abruptly blocked, rendering decades of OneDrive‑stored photos and files inaccessible, which underscores the risks of relying on a single cloud backup and highlights the importance of multi‑point data protection strategies like the 3‑2‑1 rule.

3-2-1 ruleOneDriveaccount lock
0 likes · 4 min read
Why Your OneDrive Account Might Get Locked and How to Protect Your Data
Architects' Tech Alliance
Architects' Tech Alliance
Jun 3, 2025 · Artificial Intelligence

Comprehensive Analysis of RDMA Technology: Principles, Features, Products, and Applications in HPC, AI, and Cloud Storage

The article provides an in‑depth technical overview of Remote Direct Memory Access (RDMA), covering its zero‑copy, kernel‑bypass, and protocol‑offload features, hardware and software ecosystems, and its impact on high‑performance computing, artificial intelligence, cloud storage, finance, and edge computing.

Hardware accelerationHigh‑performance computingNetwork Protocols
0 likes · 10 min read
Comprehensive Analysis of RDMA Technology: Principles, Features, Products, and Applications in HPC, AI, and Cloud Storage
Open Source Linux
Open Source Linux
Dec 6, 2024 · Cloud Computing

How Live Streaming Works: From Encoder to Viewer in Real Time

Live streaming faces challenges due to real‑time video transmission and heavy computation, but by using globally distributed edge servers, transcoding streams into multiple resolutions, segmenting them into short clips, packaging them into formats like HLS, caching via CDNs, and optionally storing them in the cloud, the process delivers video seamlessly to viewers and enables replay.

CDNReal-time VideoVideo Transcoding
0 likes · 2 min read
How Live Streaming Works: From Encoder to Viewer in Real Time
Baidu Tech Salon
Baidu Tech Salon
Nov 8, 2024 · Cloud Computing

Design and Evolution of Baidu Canghai Storage Unified Technology Stack

Baidu Canghai Storage’s unified technology stack—comprising a meta‑aware distributed metadata layer, a hybrid single‑node‑distributed namespace, and an online erasure‑coding data layer—delivers AI‑driven, high‑performance, low‑cost, ZB‑scale cloud storage by modularizing metadata, namespace, and data services for object, file, and block workloads.

BaiduDistributed SystemsMicroservices
0 likes · 16 min read
Design and Evolution of Baidu Canghai Storage Unified Technology Stack
Baidu Geek Talk
Baidu Geek Talk
Nov 6, 2024 · Cloud Computing

Baidu Canghai Storage Unified Technology Base: Architecture and Evolution of Metadata, Namespace, and Data Layers

Baidu’s Canghai Storage unifies metadata, hierarchical namespace, and data layers into a Meta‑Aware, three‑generation architecture that scales to trillions of metadata items and zettabyte‑scale data, using a distributed transactional KV store, single‑machine‑distributed namespace, and online erasure‑coding micro‑services to deliver high performance, low cost, and seamless scalability.

Big DataDistributed SystemsNewSQL
0 likes · 18 min read
Baidu Canghai Storage Unified Technology Base: Architecture and Evolution of Metadata, Namespace, and Data Layers
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Nov 4, 2024 · Cloud Computing

How Baidu’s Unified Storage Platform Tackles AI‑Era Data Challenges

This article details Baidu’s unified storage architecture—covering its metadata, hierarchical namespace, and data layers—explaining how meta‑aware design, custom partitioning, flexible engines, and micro‑service based erasure coding together meet the scalability, performance, and cost demands of modern AI‑driven cloud storage workloads.

Microservicescloud storageerasure coding
0 likes · 17 min read
How Baidu’s Unified Storage Platform Tackles AI‑Era Data Challenges
Test Development Learning Exchange
Test Development Learning Exchange
Sep 1, 2024 · Fundamentals

Python Utility Scripts for Data Cleaning, Translation, File Sync, Cloud Backup, and More

This article presents a collection of Python utility scripts that demonstrate how to clean CSV data, translate text files, synchronize folders, upload files to S3, count directory contents, classify files by type, perform OCR on images, convert video to audio, extract images from webpages, and generate text summaries using modern libraries.

AIcloud storagedata-cleaning
0 likes · 6 min read
Python Utility Scripts for Data Cleaning, Translation, File Sync, Cloud Backup, and More
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 23, 2024 · Artificial Intelligence

Building a Multi‑Agent Bookmark Assistant Bot with Coze: From File Upload to AI‑Powered Search

This tutorial walks through creating a Coze bot that uses multi‑agent orchestration, memory variables, triggers, and large‑language‑model integration to upload bookmark files, extract and clean data, classify sites, generate importable HTML bookmarks, and provide AI‑driven search functionality, complete with Python code examples and deployment tips.

AI AgentsBookmark ManagementBot Development
0 likes · 24 min read
Building a Multi‑Agent Bookmark Assistant Bot with Coze: From File Upload to AI‑Powered Search
Test Development Learning Exchange
Test Development Learning Exchange
Apr 12, 2024 · Operations

Python Data Backup Scripts and Tools Overview

This article introduces various Python-based data backup techniques, covering standard library modules such as shutil, zipfile, and tarfile, as well as database dump tools like pg_dump and mysqldump, and cloud storage options using awscli or boto3, with example code snippets for each method.

OperationsPythonScripting
0 likes · 4 min read
Python Data Backup Scripts and Tools Overview
MaGe Linux Operations
MaGe Linux Operations
Mar 8, 2024 · Cloud Computing

Choosing Between Cloud, On-Premises, and Cloud‑Near Storage: Which Wins?

This article compares the advantages and disadvantages of cloud storage, on‑premises storage, and the hybrid cloud‑near storage model, explaining how each impacts scalability, cost, control, security, and integration for modern data platforms, and helps organizations select the most suitable solution.

Cost OptimizationData PlatformHybrid storage
0 likes · 12 min read
Choosing Between Cloud, On-Premises, and Cloud‑Near Storage: Which Wins?
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Mar 6, 2024 · Cloud Computing

Understanding IaaS: Definition, Features, Core Technologies, and Application Scenarios

This article provides a comprehensive overview of IaaS, detailing its definition, core characteristics, underlying technologies such as virtualization and automation, and common use cases, while highlighting benefits like cost reduction, elasticity, high availability, and security in cloud environments.

AutomationIaaSInfrastructure as a Service
0 likes · 8 min read
Understanding IaaS: Definition, Features, Core Technologies, and Application Scenarios
Baidu Geek Talk
Baidu Geek Talk
Jan 29, 2024 · Databases

BTS (Baidu Table Storage): Architecture and Core Technologies

BTS (Baidu Table Storage) is Baidu Intelligent Cloud’s high‑performance, low‑cost semi‑structured NoSQL service that evolved from single‑table to multi‑model (wide tables, time‑series, soon documents), featuring a three‑layer compute‑storage separation architecture, multi‑level caching, hot‑backup HA, and supporting massive IoT, AI, autonomous‑driving and monitoring workloads.

BTSBaidu Table StorageDatabase Architecture
0 likes · 21 min read
BTS (Baidu Table Storage): Architecture and Core Technologies
21CTO
21CTO
Sep 20, 2023 · Information Security

How a Misconfigured Azure SAS Token Exposed 38 TB of Microsoft Data

Microsoft inadvertently exposed 38 TB of private data, including employee passwords, private keys, and over 30,000 internal Teams messages, due to a misconfigured Azure SAS token in a public GitHub repository, prompting security researchers to alert the company and prompting Microsoft to revoke the token and tighten SAS best practices.

Azure SASMicrosoftcloud storage
0 likes · 6 min read
How a Misconfigured Azure SAS Token Exposed 38 TB of Microsoft Data
Architects' Tech Alliance
Architects' Tech Alliance
Sep 17, 2023 · Cloud Computing

Trends and Market Outlook of Cloud Storage, Software‑Defined Storage, and Hyper‑Convergence

The article analyzes the rapid growth of the global cloud storage market, explains core technologies such as software‑defined storage, distributed storage and hyper‑convergence, reviews major vendors and market forecasts, and discusses the evolving ecosystem of operating systems and open‑source initiatives driving future infrastructure development.

Software-Defined Storagecloud storagehyper-convergence
0 likes · 12 min read
Trends and Market Outlook of Cloud Storage, Software‑Defined Storage, and Hyper‑Convergence
DataFunTalk
DataFunTalk
Sep 15, 2023 · Cloud Computing

Design and Architecture of Baidu CFS Large‑Scale Distributed File System and Metadata Service

The talk from DataFun Summit 2023 explains how Baidu's CFS storage builds a trillion‑file‑scale distributed file system by revisiting file system fundamentals, POSIX limitations, historical storage architectures, and introducing a lock‑free metadata service with single‑shard primitives, data‑layout optimizations, and a simplified client‑centric architecture that achieves high scalability and performance.

CFSDistributed File SystemPOSIX
0 likes · 31 min read
Design and Architecture of Baidu CFS Large‑Scale Distributed File System and Metadata Service
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Sep 4, 2023 · Big Data

How Baidu’s Aries Cloud Storage Leverages Tape Libraries for Massive Cold Data Archiving

This article explains Baidu Intelligent Cloud’s tape‑library based cold‑data storage architecture, covering tape media basics, the Aries cloud storage system, its modular design, data flow, write and retrieval processes, and a real‑world deployment case that demonstrates cost‑effective petabyte‑scale archival.

ariescloud storagecold data
0 likes · 31 min read
How Baidu’s Aries Cloud Storage Leverages Tape Libraries for Massive Cold Data Archiving
Tencent Cloud Developer
Tencent Cloud Developer
Aug 31, 2023 · Cloud Computing

Tape Storage Technology: Enterprise Deep Archive and the Berg Cold‑Data Engine

Magnetic tape, once the music‑distribution workhorse, remains essential for enterprise deep‑archive thanks to its low cost, high capacity, and durability, with LTO and IBM 3592 cartridges housed in large libraries, while cloud object‑storage deep‑archive tiers and Tencent’s Berg cold‑data engine provide API‑driven ingestion, retrieval, erasure‑coding, and fault‑tolerant management for truly cold workloads that tolerate hours‑long latency.

Berg enginecloud storagecold data
0 likes · 27 min read
Tape Storage Technology: Enterprise Deep Archive and the Berg Cold‑Data Engine
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Aug 29, 2023 · Cloud Computing

How Baidu CFS Scales to Billions of Files with a Lock‑Free Metadata Service

This article explains Baidu's CFS architecture for building a billion‑file‑scale distributed file system, covering basic file system concepts, POSIX limitations, metadata service modeling, performance metrics, evolution of metadata architectures, and CFS's lock‑free design that achieves high scalability, low latency, and balanced load in cloud storage.

Distributed File SystemScalabilitycloud storage
0 likes · 32 min read
How Baidu CFS Scales to Billions of Files with a Lock‑Free Metadata Service
MaGe Linux Operations
MaGe Linux Operations
Aug 4, 2023 · Information Security

How to Detect and Exploit Cloud Access Key (AK/SK) Leaks

This article explains why cloud platforms rely on access keys, describes common scenarios where AK/SK credentials are exposed, provides practical examples such as heapdump and JavaScript leaks, and shows how attackers can hijack storage buckets or execute commands on compromised cloud hosts.

AK/SKaccess key leakagecloud security
0 likes · 7 min read
How to Detect and Exploit Cloud Access Key (AK/SK) Leaks
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 10, 2023 · Cloud Computing

Choosing the Right Cloud Storage: NAS, Block, and Object Explained

This article explains common storage types—file (NAS), block (EBS), and object (OSS)—detailing their physical foundations, performance metrics, underlying Alibaba Cloud architectures, protocol differences, and ideal application scenarios to help readers select the most suitable storage solution for any workload.

Alibaba CloudNASProtocols
0 likes · 15 min read
Choosing the Right Cloud Storage: NAS, Block, and Object Explained
Architects' Tech Alliance
Architects' Tech Alliance
Jul 4, 2023 · Cloud Computing

Challenges and Emerging Trends in Cloud and Internet Storage Architecture

The article analyzes the mismatched data‑preservation cycles, performance‑reliability trade‑offs, server‑centred data‑center tax, and the need for new storage‑compute disaggregation, then outlines hardware trends such as Ethernet‑based flash, CXL, DPUs, and proposes a novel storage‑compute separation architecture to improve resource utilization, reliability, and efficiency for cloud and internet workloads.

CXLDPUHardware trends
0 likes · 12 min read
Challenges and Emerging Trends in Cloud and Internet Storage Architecture
DaTaobao Tech
DaTaobao Tech
Jun 21, 2023 · Databases

Data Space Architecture and Metadata Models

The article outlines a data‑space architecture that employs a wide‑table design with dynamic columns and dedicated metadata tables, a metadata execution engine for business‑logic mapping, upgraded SQL parsing via Druid, MySQL‑proxy protocol handling, and distributed flow control using Redis and Zookeeper to enable scalable, multi‑tenant, low‑code and cloud‑native data management.

Data SpaceDatabase designOpen Platform
0 likes · 16 min read
Data Space Architecture and Metadata Models
Open Source Linux
Open Source Linux
Jun 19, 2023 · Fundamentals

Why Dual‑RAID Beats Triple‑Replication in Distributed Storage

This article compares triple‑replication and dual‑RAID architectures for distributed storage, outlining the performance, reliability and operational drawbacks of triple‑replication and demonstrating how dual‑RAID’s local RAID plus two‑copy strategy delivers better bandwidth usage, fault isolation, and near‑all‑flash performance.

Cephcloud storagedistributed storage
0 likes · 6 min read
Why Dual‑RAID Beats Triple‑Replication in Distributed Storage
Architects' Tech Alliance
Architects' Tech Alliance
Jun 18, 2023 · Cloud Computing

Comparison of Three‑Replica and Dual‑RAID Distributed Storage Architectures

The article compares three‑replica and dual‑RAID distributed storage designs, detailing their architectures, bandwidth and latency impacts, failure isolation, “slow‑disk” issues, and overall performance, stability, and reliability advantages of dual‑RAID for modern cloud and hyper‑converged environments.

Data ProtectionRAIDcloud storage
0 likes · 7 min read
Comparison of Three‑Replica and Dual‑RAID Distributed Storage Architectures
Bilibili Tech
Bilibili Tech
Jun 16, 2023 · Mobile Development

Optimizing Video Upload Experience for Bilibili Overseas Creators: Product and Technical Improvements

Bilibili’s Q4 2022 initiative overhauled the overseas video upload flow by defining new metrics, adding progress indicators, shrinking chunk size, enabling multi‑threaded and resumable uploads with background support, which boosted submission conversion by up to 10 % and cut failure rates from 4 % to 1 %.

BilibiliVideo Uploadcloud storage
0 likes · 13 min read
Optimizing Video Upload Experience for Bilibili Overseas Creators: Product and Technical Improvements
政采云技术
政采云技术
May 31, 2023 · Frontend Development

Frontend Single File Upload to Cloud Service Storage

This article explains the common approaches for uploading a single file from a web front‑end to cloud storage, compares backend‑mediated and direct browser uploads, provides SDK and temporary‑URL examples with code, and discusses encryption, request formats, and best‑practice recommendations.

SDKcloud storagefile upload
0 likes · 16 min read
Frontend Single File Upload to Cloud Service Storage
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
May 25, 2023 · Cloud Native

How Baidu’s CFS Achieved Billion‑File Scale with a Lock‑Free Metadata Service

This article explains the design and evolution of Baidu Cloud File System's (CFS) metadata service, detailing how a novel lock‑free architecture and strategic data layout enable POSIX‑compatible, highly scalable storage that can handle billions of files while maintaining high performance and consistency.

Distributed File SystemScalabilitycloud storage
0 likes · 42 min read
How Baidu’s CFS Achieved Billion‑File Scale with a Lock‑Free Metadata Service
Tencent Cloud Developer
Tencent Cloud Developer
Dec 14, 2022 · Operations

Optimizing Release Efficiency and Quality for Tencent Cloud Object Storage (COS)

Tencent Cloud Object Storage (COS) boosted release efficiency and quality by evolving its architecture to YottaStore’s cluster‑percentage changes, introducing Management Zone‑based parallel upgrades, automated gray‑testing and rollbacks, streamlined concurrency controls, visualized flow‑time reductions, standardized quality checks, and a five‑level maturity model aimed at fully automated, reliable deployments.

Operational EfficiencyTencent COScloud storage
0 likes · 16 min read
Optimizing Release Efficiency and Quality for Tencent Cloud Object Storage (COS)
Architects Research Society
Architects Research Society
Dec 5, 2022 · Fundamentals

Differences Between File, Block, and Object Storage and Why Object Storage Suits the Data Explosion

As global data volumes surge toward 163 zettabytes by 2025, traditional file and block storage struggle to scale, making object storage—offering unique IDs, metadata binding, and effortless scalability—the preferred solution for handling massive, mostly unstructured data in modern cloud environments.

Data GrowthStorage Fundamentalsblock storage
0 likes · 9 min read
Differences Between File, Block, and Object Storage and Why Object Storage Suits the Data Explosion
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Nov 28, 2022 · Cloud Computing

How Baidu’s ARIES Powers Exabyte-Scale Cloud Storage for Baidu Netdisk

This article presents a comprehensive overview of Baidu’s ARIES storage platform, detailing its design philosophy, architecture, key concepts, and engineering challenges, and explains how it underpins Baidu Netdisk’s massive data‑plane storage with high availability, cost‑performance trade‑offs, and robust monitoring.

Distributed SystemsResource Managementcloud storage
0 likes · 36 min read
How Baidu’s ARIES Powers Exabyte-Scale Cloud Storage for Baidu Netdisk
Baidu Geek Talk
Baidu Geek Talk
Sep 29, 2022 · Databases

Design and Challenges of TafDB: A Scalable Metadata Storage Engine for Cloud Data Lakes

TafDB, Baidu’s Spanner‑like distributed transaction database built on RocksDB and Multi‑Raft, provides a virtually unlimited metadata layer for cloud data lakes by unifying hierarchical and flat namespaces, minimizing cross‑shard transaction overhead, handling garbage collection, and employing a distributed clock, thus delivering trillion‑scale metadata capacity and tens of millions of QPS with low latency.

NamespaceScalabilityTafDB
0 likes · 21 min read
Design and Challenges of TafDB: A Scalable Metadata Storage Engine for Cloud Data Lakes
Tencent Cloud Developer
Tencent Cloud Developer
Sep 27, 2022 · Big Data

GooseFS: Accelerating Cloud Storage for Big Data and Data Lake Platforms

GooseFS, Tencent Cloud’s Hadoop‑compatible storage accelerator, adds a local NVMe‑SSD cache layer to cloud‑native data lakes, letting users boost query speeds by up to 46 % and cut backend bandwidth by 200 Gbps without code changes, as demonstrated by a music‑industry customer’s 200‑node deployment caching ten million files.

Cost reductionData LakeGooseFS
0 likes · 16 min read
GooseFS: Accelerating Cloud Storage for Big Data and Data Lake Platforms
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Sep 23, 2022 · Databases

How Baidu’s TafDB Achieves Trillion‑Scale Metadata Storage with Near‑Zero Latency

This article explores the design and engineering of Baidu’s TafDB, a distributed metadata database that powers cloud object and file storage, detailing its architecture, namespace evolution, transaction optimizations, garbage collection strategies, and clock mechanisms that enable trillion‑scale metadata and millions of QPS.

Scalabilitycloud storagemetadata
0 likes · 19 min read
How Baidu’s TafDB Achieves Trillion‑Scale Metadata Storage with Near‑Zero Latency
DaTaobao Tech
DaTaobao Tech
Sep 2, 2022 · Backend Development

Debugging Cloud Storage Middleware Migration Issues in Backend Development

The team resolved export failures caused by a deprecated cloud‑storage middleware by adding a new method that generates user‑specific sub‑paths and saves files, but recognized security, configurability, and maintainability issues, recommending configurable buckets, thorough permission reviews, and reusable design when migrating third‑party dependencies.

BackendCode reviewcloud storage
0 likes · 7 min read
Debugging Cloud Storage Middleware Migration Issues in Backend Development
Programmer DD
Programmer DD
Aug 23, 2022 · Cloud Computing

How Switching S3 Compression to Zstd Cut Storage Costs by 30%

AWS switched S3 log compression from gzip to Zstandard, saving roughly 30% of storage space—equivalent to exabytes—by changing the internal storage method, a move confirmed by employee comments and a coinciding 31% price drop in late 2021.

AWSCost OptimizationS3
0 likes · 4 min read
How Switching S3 Compression to Zstd Cut Storage Costs by 30%
Baidu Geek Talk
Baidu Geek Talk
Aug 17, 2022 · Industry Insights

How Baidu Cloud Storage Solves the Four Big Challenges of the ABC Era

This article examines the massive data, cost, stability, and diversity challenges of the AI‑driven, big‑data, cloud‑first "ABC" era and explains how Baidu's Canghai storage portfolio—including BOS, CDS, CFS, PFS, RapidFS, CloudFlow, and storage gateways—addresses each issue through scalable architecture, tiered lifecycle policies, multi‑AZ disaster recovery, and integrated hybrid‑cloud solutions.

BaiduData Migrationcloud storage
0 likes · 16 min read
How Baidu Cloud Storage Solves the Four Big Challenges of the ABC Era
DataFunTalk
DataFunTalk
Aug 17, 2022 · Cloud Computing

High‑Performance Computing Storage Challenges and Baidu Canghai Storage Solutions

This article explains the storage problems faced by traditional HPC, AI‑driven HPC and high‑performance data analysis, describes Baidu's internal high‑performance storage practices, and introduces the Baidu Canghai solution—including object storage BOS, parallel file system PFS, RapidFS, data‑flow mechanisms and a customer case—demonstrating how these technologies meet the demanding throughput, latency and cost requirements of modern high‑performance workloads.

AIBaiduHigh‑performance computing
0 likes · 29 min read
High‑Performance Computing Storage Challenges and Baidu Canghai Storage Solutions
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Aug 15, 2022 · Cloud Computing

How Baidu’s Canghai Storage Tackles Massive Data Challenges in the Cloud

This article outlines the four major storage challenges of the ABC era—massive scale, cost efficiency, stability, and diversity—and explains how Baidu’s Canghai storage suite, including BOS, CDS, CFS, PFS, RapidFS, CloudFlow, and storage gateways, addresses each through multi‑cloud migration, tiered lifecycle management, and robust disaster‑recovery solutions.

AIBig DataData Migration
0 likes · 15 min read
How Baidu’s Canghai Storage Tackles Massive Data Challenges in the Cloud
Baidu Geek Talk
Baidu Geek Talk
Aug 5, 2022 · Big Data

How Baidu Cloud Accelerates Data Lakes with Compute‑Storage Separation

This article analyzes Baidu Intelligent Cloud's data‑lake acceleration strategy, covering the evolution of big‑data architectures, the advantages and challenges of compute‑storage separation, the native hierarchical namespace and RapidFS cache solutions, performance test results, and recommended deployment patterns.

BOSCompute-Storage SeparationData Lake
0 likes · 17 min read
How Baidu Cloud Accelerates Data Lakes with Compute‑Storage Separation
Infra Learning Club
Infra Learning Club
Jul 29, 2022 · Fundamentals

Understanding File, Block, and Object Storage: Key Differences and Use Cases

The article explains the core concepts, advantages, and limitations of file storage, block storage, and object storage, illustrating how each format organizes data, interacts with operating systems, and fits various application scenarios such as databases, cloud services, and unstructured data handling.

block storagecloud storagedata storage fundamentals
0 likes · 6 min read
Understanding File, Block, and Object Storage: Key Differences and Use Cases
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jul 28, 2022 · Big Data

How Baidu Cloud Accelerates Data Lakes with Compute‑Storage Separation

This article explains Baidu Intelligent Cloud’s data lake acceleration solution, covering the evolution of big‑data technologies, the benefits and challenges of compute‑storage separation, the architecture of BOS object storage, and the native hierarchical namespace and RapidFS cache mechanisms that boost performance and reduce costs.

BOSBig DataCompute-Storage Separation
0 likes · 18 min read
How Baidu Cloud Accelerates Data Lakes with Compute‑Storage Separation
Baidu Geek Talk
Baidu Geek Talk
Jul 26, 2022 · Industry Insights

How Baidu’s Canghai Storage Powers High‑Performance Computing: Challenges and Solutions

This article analyzes the storage challenges of high‑performance computing—including traditional HPC, AI‑driven HPC, and high‑performance data analysis—examines Baidu’s internal practices, and presents the Canghai storage platform with its object storage, parallel file system (PFS) and RapidFS solutions that address throughput, latency, and scalability requirements.

AI trainingHigh‑performance computingcloud storage
0 likes · 31 min read
How Baidu’s Canghai Storage Powers High‑Performance Computing: Challenges and Solutions
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jul 21, 2022 · Cloud Computing

How Baidu’s Cloud Storage Powers High‑Performance Computing and AI Workloads

This article explains the storage challenges of high‑performance computing—including traditional HPC, AI‑driven HPC, and HPDA—then details Baidu’s unified storage platform, object storage BOS, and runtime solutions PFS and RapidFS, illustrating their architecture, features, and a real‑world autonomous‑driving customer case.

AI trainingData Lakecloud storage
0 likes · 29 min read
How Baidu’s Cloud Storage Powers High‑Performance Computing and AI Workloads
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jun 30, 2022 · Big Data

Why Data Lakes Need Data Warehouses: Evolution of Modern Data Platforms

This article traces the evolution of enterprise data platforms—from early data warehouses to modern data lakes and the emerging lakehouse—detailing key technologies, challenges, and best practices for storage, compute engines, metadata, and integration, while highlighting how cloud-native object storage reshapes scalability and cost.

Big DataData LakeData Warehouse
0 likes · 27 min read
Why Data Lakes Need Data Warehouses: Evolution of Modern Data Platforms
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jun 23, 2022 · Cloud Computing

Adoption of Curve Block Storage for NetEase Cloud Music: Background, Challenges, and Benefits

Facing Ceph’s high latency, jitter, and upgrade constraints, NetEase Cloud Music adopted Curve block storage, which delivers over twice the IOPS, sub‑2‑second upgrade pauses, and stable, low‑latency I/O across 40 GB‑4 TB volumes, enabling the platform to meet its 99.99% availability SLA for billions of users while supporting future cloud‑native services.

CephCurveOpenStack
0 likes · 10 min read
Adoption of Curve Block Storage for NetEase Cloud Music: Background, Challenges, and Benefits
Big Data Technology Architecture
Big Data Technology Architecture
Jun 5, 2022 · Big Data

Introduction to Data Lake Concepts, Capabilities, and Applications

This article explains the origin and definition of data lakes, describes their ability to store structured, semi‑structured and unstructured data at any scale on‑premises or in the cloud, outlines essential lake capabilities such as unified storage, raw‑data preservation, scalable compute, metadata and security management, and compares data lakes with data warehouses and lakehouse architectures through real‑world cloud‑native examples.

cloud storagemetadata management
0 likes · 16 min read
Introduction to Data Lake Concepts, Capabilities, and Applications
Python Programming Learning Circle
Python Programming Learning Circle
May 24, 2022 · Operations

Automating Legal Evidence Management: How a Programmer Works Only 10 Minutes a Day

A law‑firm IT specialist shared on Reddit how he built a PowerShell script that scans, hashes, uploads, and re‑hashes electronic evidence to the cloud, allowing him to complete his full workload in just ten minutes each day while earning nearly $90,000 annually, sparking ethical debate about automation in the workplace.

AutomationEthicsLegal tech
0 likes · 8 min read
Automating Legal Evidence Management: How a Programmer Works Only 10 Minutes a Day
Wukong Talks Architecture
Wukong Talks Architecture
May 18, 2022 · Frontend Development

How I Restored My Website After a jsDelivr CDN Outage

The article recounts a real‑world CDN failure that broke the passjava.cn site, explains what a CDN is, and details step‑by‑step recovery actions—including downloading assets, using alternative storage, and refreshing a new CDN—to get the site back online.

CDNJsDelivrcloud storage
0 likes · 5 min read
How I Restored My Website After a jsDelivr CDN Outage
Programmer DD
Programmer DD
Apr 13, 2022 · Cloud Computing

How to Deploy and Use Cloudreve: An Open‑Source Multi‑Cloud File System

This article introduces Cloudreve, an open‑source public cloud file system supporting multiple storage backends, outlines its key features such as direct client uploads, Aria2 integration, WebDAV, and provides step‑by‑step deployment and build instructions for Linux and Windows environments.

CloudreveDeploymentcloud storage
0 likes · 4 min read
How to Deploy and Use Cloudreve: An Open‑Source Multi‑Cloud File System
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 6, 2022 · Cloud Computing

Build a Private, Unlimited-Speed Cloud Drive with KodBox on Serverless

Learn how to create a personal, unlimited‑speed cloud drive that mimics a Windows experience by deploying the open‑source KodBox file manager on Alibaba Cloud Function Compute, using NAS for persistent storage and OSS for object storage, with detailed cost analysis and step‑by‑step deployment guide.

Alibaba CloudKodBoxNAS
0 likes · 9 min read
Build a Private, Unlimited-Speed Cloud Drive with KodBox on Serverless
Architects' Tech Alliance
Architects' Tech Alliance
Mar 26, 2022 · Fundamentals

A Comprehensive Overview of Storage Development History, Types, and Models

This article provides a detailed overview of the evolution of storage systems—from single‑node to cloud‑native—covering fundamental concepts, data classifications, storage types such as block, file, and object, and examples like Rook and MinIO, aimed at beginners seeking a macro understanding of storage architecture.

Data ModelsDistributed Systemsblock storage
0 likes · 8 min read
A Comprehensive Overview of Storage Development History, Types, and Models
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Mar 16, 2022 · Databases

RDB: Cloud Music's Customized Algorithm Feature KV Storage System Based on RocksDB

To meet Cloud Music’s massive algorithm‑feature KV storage needs, the team built RDB—a RocksDB‑based engine within Tair—adding bulk‑load, dual‑version imports, KV‑separation, in‑place sequence appends and protobuf field updates, cutting storage cost, write amplification and latency while scaling to billions of records and millions of QPS.

Algorithm FeaturesKV SeparationKV storage
0 likes · 16 min read
RDB: Cloud Music's Customized Algorithm Feature KV Storage System Based on RocksDB
Baidu Geek Talk
Baidu Geek Talk
Feb 9, 2022 · Backend Development

Business Storage Resource Optimization for Baidu Mobile Assistant: Process, Analysis, and Lessons

The author describes how Baidu Mobile Assistant’s massive storage costs were slashed by analyzing billions of objects, identifying low‑value incremental patch files, and applying a three‑step “elephant‑in‑the‑fridge” cleanup that reduced petabytes to a few hundred terabytes while highlighting the need for ROI‑driven update policies and ongoing documentation.

BOScloud storagedata analysis
0 likes · 9 min read
Business Storage Resource Optimization for Baidu Mobile Assistant: Process, Analysis, and Lessons
Tencent Tech
Tencent Tech
Jan 25, 2022 · Cloud Native

How Tencent Built YottaStore: A Cloud‑Native Storage Revolution

This article recounts Tencent's journey from its legacy TFS storage to the cloud‑native YottaStore system, detailing the technical challenges, strategic decisions, team dynamics, and the impressive scalability, reliability, and cost benefits that now power major Tencent services and external customers.

ScalabilityTencentYottaStore
0 likes · 14 min read
How Tencent Built YottaStore: A Cloud‑Native Storage Revolution
21CTO
21CTO
Jan 6, 2022 · Backend Development

Mastering Web Application Architecture: Key Components and Best Practices

This article explains the fundamentals of web application architecture, detailing components such as user agents, DNS, load balancers, virtual machines, web servers, databases, caching services, CDNs, external and cloud storage, as well as best‑practice guidelines for building efficient, scalable, and secure backend systems.

Backend Developmentcachingcloud storage
0 likes · 10 min read
Mastering Web Application Architecture: Key Components and Best Practices
21CTO
21CTO
Oct 18, 2021 · Big Data

Why Unstructured Big Data Storage Must Be Central to Your IT Strategy

The article explains how unstructured big data, often overlooked, should become a strategic focus, highlighting its massive unused value, the challenges of cataloging and moving petabyte‑scale datasets, and why agile cloud‑based storage is essential for modern enterprises.

Data Managementcloud storagestorage strategy
0 likes · 7 min read
Why Unstructured Big Data Storage Must Be Central to Your IT Strategy
Programmer DD
Programmer DD
Oct 11, 2021 · Information Security

How to Secure Your Files with Cryptomator: A Step‑by‑Step Guide

This article introduces the open‑source encryption tool Cryptomator, explains its security‑first design, and provides a detailed, illustrated walkthrough for creating a vault, adding files, and locking the vault so that sensitive data stays hidden on local disks and cloud drives.

Cryptomatorcloud storagedata security
0 likes · 4 min read
How to Secure Your Files with Cryptomator: A Step‑by‑Step Guide
Efficient Ops
Efficient Ops
Aug 31, 2021 · Cloud Computing

Why Object Storage Is the New Backbone of Cloud Data Management

This article explains how object storage emerged as a cloud-native solution that surpasses traditional DAS, SAN, and NAS architectures by offering virtually unlimited capacity, robust metadata handling, and simple RESTful APIs for modern applications and large‑scale data workloads.

Data ArchitectureScalabilitycloud storage
0 likes · 11 min read
Why Object Storage Is the New Backbone of Cloud Data Management
Qingyun Technology Community
Qingyun Technology Community
Aug 3, 2021 · Cloud Computing

How QingStor’s Object Storage Architecture Powers Massive Data Scalability

This article explains QingStor's object storage concepts, core advantages, global data model, subsystem design, massive small‑file optimizations, key features like lifecycle management and cross‑region replication, and showcases a traffic‑industry use case, highlighting its scalability, reliability, and ease of integration.

Data ArchitectureQingStorScalability
0 likes · 20 min read
How QingStor’s Object Storage Architecture Powers Massive Data Scalability
Architects' Tech Alliance
Architects' Tech Alliance
Jun 30, 2021 · Cloud Computing

Ceph Distributed Storage System: Architecture, CRUSH Algorithm, and Backend Evolution

This article provides a comprehensive overview of Ceph, covering its origins, cluster architecture, CRUSH data placement algorithm, unified access interfaces, the transition from FileStore to BlueStore, and emerging storage back‑ends such as SeaStore and PFStore, highlighting performance characteristics and design trade‑offs.

BlueStoreCRUSHCeph
0 likes · 12 min read
Ceph Distributed Storage System: Architecture, CRUSH Algorithm, and Backend Evolution
Huolala Tech
Huolala Tech
May 14, 2021 · Cloud Native

Mount S3 as a Filesystem in Kubernetes with s3fs-fuse and DaemonSet

This article explains how to use FUSE‑based s3fs to mount an Amazon S3 bucket as a regular filesystem inside Kubernetes pods via a DaemonSet, covering background, FUSE principles, implementation steps, Docker image creation, ConfigMap and DaemonSet configuration, and performance trade‑offs.

ContainerDaemonSetFUSE
0 likes · 14 min read
Mount S3 as a Filesystem in Kubernetes with s3fs-fuse and DaemonSet
Liangxu Linux
Liangxu Linux
Feb 16, 2021 · Fundamentals

How to Enable Automatic Image Upload in Typora with Plugins and Built‑in Support

This guide explains how to add image‑upload capabilities to the Typora markdown editor by using a third‑party Windows plugin, configuring the built‑in upload feature, and setting up various image hosting services such as GitHub, self‑hosted servers, Tencent OSS, Aliyun OSS, Qiniu, Gitee, and SM.MS, complete with installation steps and code snippets.

Typoracloud storageimage upload
0 likes · 11 min read
How to Enable Automatic Image Upload in Typora with Plugins and Built‑in Support
21CTO
21CTO
Feb 4, 2021 · Cloud Computing

Top 10 Free Cloud Storage Services for Developers: Pros, Cons, and Security

Explore ten free cloud storage options, each detailed with advantages, drawbacks, and security considerations, helping developers choose the most suitable platform without spending money, while comparing bandwidth limits, file size restrictions, and cross‑device compatibility to find the best fit for personal or project needs.

cloud storagedata backupdeveloper tools
0 likes · 9 min read
Top 10 Free Cloud Storage Services for Developers: Pros, Cons, and Security
Big Data Technology & Architecture
Big Data Technology & Architecture
Jan 28, 2021 · Big Data

Understanding Data Lakes: Definitions, Benefits, Architectures, and Technology Choices

Data lakes, emerging since 2020, are centralized repositories that store structured and unstructured data at any scale, offering flexible analytics, but require robust management to avoid becoming data swamps; this article explains definitions, advantages, typical architectures, and compares cloud and open‑source solutions such as AWS Lake Formation, Alibaba Cloud, Delta, Iceberg, and Hudi.

AnalyticsBig Datacloud storage
0 likes · 13 min read
Understanding Data Lakes: Definitions, Benefits, Architectures, and Technology Choices
JD Cloud Developers
JD Cloud Developers
Dec 8, 2020 · Cloud Computing

How JD Cloud’s Self‑Developed Cloud Disk Powers Ultra‑Stable Mega‑Sale Events

This article reveals how JD Intelligent Cloud’s in‑house cloud disk delivers ultra‑low latency, high‑availability block storage for massive sales events, detailing its Raft‑based architecture, three‑replica consistency, NVMe caching, incremental snapshots, and smart‑NIC acceleration that together ensure 99.99999% data reliability across diverse workloads.

NVMe cachingRaft Protocolcloud storage
0 likes · 7 min read
How JD Cloud’s Self‑Developed Cloud Disk Powers Ultra‑Stable Mega‑Sale Events
Efficient Ops
Efficient Ops
Nov 23, 2020 · Information Security

When Revenge Becomes a Crime: A Programmer’s OBS Sabotage Case

A Chinese programmer, angry over an unpaid bonus, altered Huawei Cloud OBS storage code, crippling a consumer platform for hours, leading to a conviction for destroying a computer information system and sparking a broader discussion on developer respect and security safeguards.

cloud storagecode sabotagedeveloper misconduct
0 likes · 7 min read
When Revenge Becomes a Crime: A Programmer’s OBS Sabotage Case
Tencent Tech
Tencent Tech
Nov 19, 2020 · Cloud Computing

How Tencent Built a Massive Cloud Storage System to Power QQ Album and Beyond

This article chronicles Tencent's journey from the early development of the TFS distributed storage platform to large‑scale data migrations, flexible bandwidth strategies, and the creation of the cloud‑native YottaStore, illustrating how a small architecture team solved massive storage challenges for billions of users.

Big DataData MigrationYottaStore
0 likes · 15 min read
How Tencent Built a Massive Cloud Storage System to Power QQ Album and Beyond
Architects' Tech Alliance
Architects' Tech Alliance
Nov 9, 2020 · Cloud Computing

Ceph Storage Architecture: Overview, Cluster Design, Client Interfaces, and Encryption

This article provides a comprehensive technical overview of Red Hat Ceph, covering its distributed storage architecture, cluster components, storage pool types, authentication, placement algorithms, I/O paths, replication and erasure‑coding strategies, internal management operations, high‑availability mechanisms, client libraries, data striping, and encryption details.

CRUSHCephData Striping
0 likes · 39 min read
Ceph Storage Architecture: Overview, Cluster Design, Client Interfaces, and Encryption
JD Cloud Developers
JD Cloud Developers
Nov 2, 2020 · Cloud Computing

Why Edge Storage Beats Central Cloud Storage for Real‑Time IoT Data

As IoT and 5G drive massive data generation, edge storage emerges as a flexible solution that reduces latency, lowers bandwidth costs, and enhances data security compared to traditional central cloud storage, especially for video surveillance and other high‑throughput scenarios.

Edge StorageIoTbandwidth cost
0 likes · 7 min read
Why Edge Storage Beats Central Cloud Storage for Real‑Time IoT Data
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 11, 2020 · Operations

How Alibaba’s SLS Powers a Unified Observability Platform for Massive Data

Alibaba Cloud’s Log Service (SLS) has evolved into a unified observability middle‑platform that handles tens of petabytes daily, offering integrated storage, processing, and AI‑driven analysis for logs, metrics, and traces, while addressing challenges of data ingestion, performance, and scalability across diverse Ops scenarios.

Big DataLog AnalyticsObservability
0 likes · 16 min read
How Alibaba’s SLS Powers a Unified Observability Platform for Massive Data