Tagged articles
1 articles
Page 1 of 1
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 9, 2017 · Artificial Intelligence

How to Train Deeper TensorFlow Models by Optimizing GPU Memory

This article summarizes an NIPS 2017 paper that introduces GPU memory‑optimization techniques—swap‑out/in and a memory‑efficient attention layer—integrated into TensorFlow, enabling significantly larger batch sizes and deeper models without sacrificing accuracy.

Deep LearningGPU memory optimizationNIPS 2017
0 likes · 8 min read
How to Train Deeper TensorFlow Models by Optimizing GPU Memory