AI Explorer
AI Explorer
Mar 16, 2026 · Artificial Intelligence

HyperOffload: A New Storage Paradigm Aiming to Break the AI Memory Wall

HyperOffload, a joint effort by Shanghai Jiao Tong University and Huawei’s MindSpore team, proposes a dynamic tensor offloading system that moves data between GPU memory, CPU RAM, and SSDs, aiming to overcome the “memory wall” that limits trillion‑parameter AI model training and deployment.

AI infrastructureAI memory wallGPU memory management
0 likes · 6 min read
HyperOffload: A New Storage Paradigm Aiming to Break the AI Memory Wall
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 3, 2025 · Artificial Intelligence

How to Train a 671B‑Scale Model with RL: Insights from a verl Internship

This article shares a detailed, first‑hand analysis of the technical challenges, framework choices, memory management, weight conversion, precision alignment, and efficiency optimizations encountered while building reinforcement‑learning pipelines for a 671‑billion‑parameter model using the verl ecosystem.

GPU memory managementMegatronModel Parallelism
0 likes · 16 min read
How to Train a 671B‑Scale Model with RL: Insights from a verl Internship