Alibaba Cloud Big Data AI Platform
Jan 17, 2025 · Artificial Intelligence
How BladeDISC++ Cuts Memory Peaks for Dynamic‑Shape Deep Learning Models
This article explains the challenges of dynamic‑shape deep learning workloads and introduces BladeDISC++, an AI compiler that uses symbolic shape graphs, operation scheduling, and just‑in‑time auto‑rematerialization to dramatically reduce GPU memory peaks while maintaining training throughput.
AI compilerBladeDISC++LLM training
0 likes · 16 min read
