Tagged articles
5 articles
Page 1 of 1
Baobao Algorithm Notes
Baobao Algorithm Notes
May 12, 2025 · Artificial Intelligence

Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability

Training massive AI models now commonly omits dropout because its original scaling trick fails to match training and inference distributions, leading to poorer performance, higher computational cost, and instability, while alternative regularization like normalization remains useful, as illustrated by practical observations and historical tricks.

AI stabilityDropoutRegularization
0 likes · 6 min read
Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 26, 2023 · Artificial Intelligence

Building and Training a Fully Connected Neural Network for Fashion-MNIST Classification with PyTorch

This tutorial demonstrates how to download the Fashion‑MNIST dataset, build a four‑layer fully connected neural network with PyTorch, and train it using loss functions, Adam optimizer, learning‑rate strategies, and Dropout to achieve high‑accuracy multi‑class image classification.

AdamDeep LearningDropout
0 likes · 17 min read
Building and Training a Fully Connected Neural Network for Fashion-MNIST Classification with PyTorch
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 16, 2021 · Artificial Intelligence

Boost Model Robustness with 5 Lines of R‑Drop Contrastive Learning

This article introduces a simple five‑line implementation of R‑Drop, a contrastive self‑supervised learning technique that leverages dropout‑induced perturbations to improve model robustness, explains the underlying principle, provides the exact PyTorch code, and compares it with the ConSERT method.

DropoutPyTorchcontrastive learning
0 likes · 5 min read
Boost Model Robustness with 5 Lines of R‑Drop Contrastive Learning
Code DAO
Code DAO
Dec 5, 2021 · Artificial Intelligence

Why DropBlock Outperforms Dropout as an Image Regularizer

This article demonstrates how to implement DropBlock in PyTorch, explains why Dropout fails on image data, details the gamma calculation and mask generation, and shows visual comparisons that illustrate the superiority of contiguous region dropping over random pixel dropout.

Computer VisionDeep LearningDropBlock
0 likes · 11 min read
Why DropBlock Outperforms Dropout as an Image Regularizer