Tagged articles
1 articles
Page 1 of 1
Volcano Engine Developer Services
Volcano Engine Developer Services
Mar 16, 2022 · Artificial Intelligence

How veGiantModel Boosts Large Language Model Training Up to 6.9× Faster

The article introduces Volcano Engine's veGiantModel, a high‑performance large‑model training framework built on PyTorch, Megatron and DeepSpeed, details its distributed parallel strategies, hardware setups, benchmark results showing up to 6.9× speedup over Megatron and DeepSpeed, and provides open‑source links for further use.

ByteCCLDistributed Traininglarge language models
0 likes · 6 min read
How veGiantModel Boosts Large Language Model Training Up to 6.9× Faster