Tagged articles
2 articles
Page 1 of 1
DataFunSummit
DataFunSummit
Sep 23, 2025 · Artificial Intelligence

How PCache Supercharges Large‑Scale AI Training Storage Performance

This talk explores large‑scale AI training storage challenges and presents PCache, a high‑performance, cloud‑native caching system that optimizes metadata, read/write paths, deployment, and high‑availability, delivering significant throughput gains and cost savings for massive model training workloads.

AI trainingPCacheStorage Optimization
0 likes · 25 min read
How PCache Supercharges Large‑Scale AI Training Storage Performance
AntData
AntData
Mar 7, 2025 · Artificial Intelligence

Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training

This article examines the challenges of AI storage for massive models, describes Ant Group's multi‑cloud, high‑availability PCache architecture, and details its GPU‑mixed deployment, metadata services, data‑link optimizations, and performance results that enable petabyte‑scale training with low cost and high stability.

AI storagePCachePerformance Optimization
0 likes · 19 min read
Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training