What’s New in PyTorch 1.8? Key Features, APIs, and Performance Boosts

PyTorch 1.8, released by the PyTorch team, bundles over 3,000 commits since 1.7, introducing AMD ROCm support, enhanced Python function conversion, stable FFT and linear‑algebra APIs, complex‑tensor autograd, distributed‑training improvements, new mobile tutorials, performance tools, and several prototype features.

MaGe Linux Operations
MaGe Linux Operations
MaGe Linux Operations
What’s New in PyTorch 1.8? Key Features, APIs, and Performance Boosts

The PyTorch team announced PyTorch 1.8, integrating more than 3,000 commits since the 1.7 release and delivering major updates in compilation, code optimization, and scientific‑computing front‑end APIs, with new support for AMD ROCm.

Key highlights include:

Support for Python function conversion.

Stabilized APIs for FFT (torch.fft) and linear‑algebra functions (torch.linalg).

Complex‑tensor autograd support.

Performance improvements for Hessian and Jacobian matrix calculations.

Enhanced distributed training: more reliable NCCL, pipeline parallelism, RPC profiling, and gradient‑compression communication hooks.

Since PyTorch 1.6, features are classified as Stable, Beta, or Prototype.

Major updates were also made to auxiliary libraries such as TorchCSPRNG, TorchVision, TorchText, and TorchAudio.

New and Updated APIs

torch.fft – NumPy‑compatible FFT operations

The FFT feature, introduced as Beta in 1.7, becomes stable in 1.8, offering hardware‑accelerated, autograd‑compatible FFT functionality comparable to NumPy’s np.fft.

torch.linalg – NumPy‑style linear algebra functions

torch.linalg mirrors NumPy’s np.linalg module, providing operations such as Cholesky decomposition, determinant, and eigenvalue calculations.

Python code conversion with torch.fx

This Beta feature enables Python‑level code transformation, allowing developers to perform Conv/BN fusion, graph‑mode quantization, vmap implementation, and other custom graph transformations using torch.fx.

Distributed Training Enhancements

Pipeline parallelism

A new Beta API makes it easy to incorporate pipeline parallelism into the training loop.

DDP communication hooks

DDP communication hooks provide a generic interface for controlling gradient communication between workers.

Additional prototype features include:

ZeroRedundancyOptimizer – reduces per‑thread memory usage.

Process Group NCCL Send/Recv – enables collective operations at the Python level.

CUDA‑support in RPC using TensorPipe – speeds up RPC on multi‑GPU machines.

Remote Module – allows remote workers to be manipulated like local modules.

PyTorch Mobile Support

New mobile tutorials demonstrate image‑segmentation (DeepLabV3) on iOS and Android, and the release includes demo apps for image segmentation, object detection, neural machine translation, question answering, and vision transformers.

The Mobile Lite Interpreter is also introduced, reducing the runtime binary size.

Performance Optimizations

Benchmark utilities are added, and a new automatic quantization API – FX Graph Mode Quantization – is made available.

Hardware Support

Two Beta features expand the PyTorch dispatcher for new C++ backends and add AMD ROCm support, which is currently limited to Linux systems.

Reference links:

https://pytorch.org/blog/pytorch-1.8-released/

https://github.com/pytorch/pytorch

https://pytorch.org/

https://twitter.com/cHHillee/status/1367621538791317504

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

MobileDeep LearningGPUPyTorchVersion Release
MaGe Linux Operations
Written by

MaGe Linux Operations

Founded in 2009, MaGe Education is a top Chinese high‑end IT training brand. Its graduates earn 12K+ RMB salaries, and the school has trained tens of thousands of students. It offers high‑pay courses in Linux cloud operations, Python full‑stack, automation, data analysis, AI, and Go high‑concurrency architecture. Thanks to quality courses and a solid reputation, it has talent partnerships with numerous internet firms.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.