Tagged articles
197 articles
Page 2 of 2
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 24, 2023 · Artificial Intelligence

Neural Style Transfer with PyTorch: Theory and Implementation

This article introduces neural style transfer, explains its underlying principles using VGG19 feature extraction, content and style loss definitions, and provides a complete PyTorch implementation with code for loading images, extracting features, computing Gram matrices, and optimizing the output image.

Computer VisionDeep LearningPyTorch
0 likes · 14 min read
Neural Style Transfer with PyTorch: Theory and Implementation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 13, 2023 · Artificial Intelligence

Image Mosaic Removal Using Autoencoder and UNet in PyTorch

This article explains the principle behind using deep‑learning autoencoders and UNet architectures to reconstruct mosaicked images, provides a complete PyTorch implementation with dataset preparation, network definition, training, and inference, and demonstrates the restored results.

AutoencoderImage ProcessingMosaic Removal
0 likes · 13 min read
Image Mosaic Removal Using Autoencoder and UNet in PyTorch
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 11, 2023 · Artificial Intelligence

Debugging Random OOM Issues in PyTorch Distributed Training on A100 Clusters

The iQIYI backend team traced random OOM crashes in PyTorch Distributed Data Parallel on an A100 cluster to a malformed DDP message injected by a security scan, which forced a near‑terabyte allocation; using jemalloc for diagnostics, they mitigated the issue by adjusting scan policies and collaborating with PyTorch to harden the protocol.

Distributed TrainingMemory DebuggingOOM
0 likes · 9 min read
Debugging Random OOM Issues in PyTorch Distributed Training on A100 Clusters
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 26, 2023 · Artificial Intelligence

Building and Training a Fully Connected Neural Network for Fashion-MNIST Classification with PyTorch

This tutorial demonstrates how to download the Fashion‑MNIST dataset, build a four‑layer fully connected neural network with PyTorch, and train it using loss functions, Adam optimizer, learning‑rate strategies, and Dropout to achieve high‑accuracy multi‑class image classification.

AdamDeep LearningDropout
0 likes · 17 min read
Building and Training a Fully Connected Neural Network for Fashion-MNIST Classification with PyTorch
Nightwalker Tech
Nightwalker Tech
Jul 19, 2023 · Artificial Intelligence

Step‑by‑Step Implementation of Transformer Blocks, Attention, Normalization, Feed‑Forward, Encoder and Decoder in PyTorch

This article provides a comprehensive tutorial on building the core components of a Transformer model—including multi‑head attention, layer normalization, feed‑forward networks, encoder and decoder layers—and assembles them into a complete PyTorch implementation, supplemented with explanatory diagrams and runnable code.

DecoderDeep LearningEncoder
0 likes · 13 min read
Step‑by‑Step Implementation of Transformer Blocks, Attention, Normalization, Feed‑Forward, Encoder and Decoder in PyTorch
Nightwalker Tech
Nightwalker Tech
Jul 18, 2023 · Artificial Intelligence

Implementing the Input Processing Layer of a Transformer Model: Tokenization, Embedding, and Positional Encoding

This article explains how to build the input processing stage of a Transformer—including tokenization with Hugging Face tokenizers, token‑to‑embedding conversion using BERT models, custom BPE tokenizers, and positional encoding—providing complete Python code examples and test results.

BPEEmbeddingPositional Encoding
0 likes · 14 min read
Implementing the Input Processing Layer of a Transformer Model: Tokenization, Embedding, and Positional Encoding
Ctrip Technology
Ctrip Technology
May 18, 2023 · Artificial Intelligence

LSTM‑Based Advertising Inventory Forecasting with Embedding and Incremental Training at Ctrip

This article presents Ctrip's end‑to‑end solution for precise ad‑inventory forecasting using an LSTM model combined with entity embedding, covering data preprocessing, K‑means clustering, model architecture, offline‑online incremental training, early‑stop mechanisms, evaluation metrics, and Python service deployment.

EmbeddingLSTMPyTorch
0 likes · 19 min read
LSTM‑Based Advertising Inventory Forecasting with Embedding and Incremental Training at Ctrip
Baidu Geek Talk
Baidu Geek Talk
Apr 19, 2023 · Artificial Intelligence

Why Does Recompute Crash Distributed Training? A Deep Dive into Checkpoint Issues and Fixes

When training large‑batch deep learning models, developers often use recompute to trade computation for memory, but in dynamic graph frameworks this can trigger synchronization errors in distributed data parallel training; the article explains the underlying DDP mechanics, illustrates the error, and offers a practical no_sync workaround with code examples.

CheckpointDistributed TrainingPyTorch
0 likes · 14 min read
Why Does Recompute Crash Distributed Training? A Deep Dive into Checkpoint Issues and Fixes
DataFunTalk
DataFunTalk
Apr 3, 2023 · Artificial Intelligence

Implementing RNN, LSTM, and GRU with PyTorch

This article introduces the basic architectures of recurrent neural networks (RNN), LSTM, and GRU, explains PyTorch APIs such as nn.RNN, nn.LSTM, nn.GRU, details their parameters, demonstrates code examples for building and testing these models, and provides practical insights for deep learning practitioners.

Deep LearningGRULSTM
0 likes · 9 min read
Implementing RNN, LSTM, and GRU with PyTorch
DataFunTalk
DataFunTalk
Apr 3, 2023 · Artificial Intelligence

Large‑Scale Recommendation System Training with TorchRec and Dynamic Embedding

This article explains how Tencent’s AI team leverages the PyTorch‑based TorchRec library and a custom dynamic embedding solution to train billion‑scale recommendation models efficiently, detailing the benefits of TorchRec, GPU embedding, optimized kernels, embedding partition strategies, experimental results, and practical deployment guidance.

GPU EmbeddingLarge-Scale TrainingPyTorch
0 likes · 15 min read
Large‑Scale Recommendation System Training with TorchRec and Dynamic Embedding
Python Programming Learning Circle
Python Programming Learning Circle
Mar 22, 2023 · Artificial Intelligence

Overview of PyTorch 2.0 Features and New APIs

The article provides a detailed overview of PyTorch 2.0, highlighting its stable and beta features such as torch.compile, accelerated transformers, MPS backend, new quantization support, and prototype parallelism tools, while emphasizing performance improvements for dynamic shapes, distributed training, and CPU/GPU inference.

AIAccelerated TransformersDeep Learning
0 likes · 6 min read
Overview of PyTorch 2.0 Features and New APIs
58 Tech
58 Tech
Jan 12, 2023 · Artificial Intelligence

Efficient Conformer for End‑to‑End Speech Recognition: Model, Implementation, Streaming Inference, and Experimental Results

This article presents a comprehensive overview of the Efficient Conformer model for large‑scale end‑to‑end speech recognition, detailing its architectural improvements such as progressive downsampling and grouped multi‑head self‑attention, the PyTorch implementation in WeNet, streaming inference handling, experimental CER gains on AISHELL‑1 and production data, and future development plans.

ASREfficient ConformerModel Optimization
0 likes · 16 min read
Efficient Conformer for End‑to‑End Speech Recognition: Model, Implementation, Streaming Inference, and Experimental Results
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 9, 2022 · Artificial Intelligence

What’s New in BladeDISC 0.3.0? Boosting PyTorch 2.0, GPU/CPU Optimizations, and Quantization

BladeDISC 0.3.0 introduces full PyTorch 2.0 compilation support, new TorchDynamo optimizations, extensive GPU memory‑intensive compute enhancements, Shape Constraint IR, experimental quantization across multiple hardware platforms, and a suite of compiler‑level improvements for training and inference acceleration.

BladeDISCGPU OptimizationMLIR
0 likes · 11 min read
What’s New in BladeDISC 0.3.0? Boosting PyTorch 2.0, GPU/CPU Optimizations, and Quantization
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 16, 2022 · Artificial Intelligence

Deep Learning Semantic Segmentation: FCN Source Code Analysis

This tutorial walks through the complete FCN pipeline for semantic segmentation, covering VOC dataset loading, data augmentation, collate functions, model construction, training loops, loss computation with cross‑entropy (including ignore‑index handling), and inference, while providing full PyTorch code snippets for each step.

FCNPyTorchVOC dataset
0 likes · 19 min read
Deep Learning Semantic Segmentation: FCN Source Code Analysis
Python Programming Learning Circle
Python Programming Learning Circle
Oct 20, 2022 · Artificial Intelligence

Overview of Common Python AI Libraries with Code Examples

This article provides a concise introduction to a wide range of popular Python libraries for artificial intelligence and data science, such as NumPy, OpenCV, scikit-image, Pillow, Scikit-learn, TensorFlow, PyTorch, and many others, accompanied by practical code snippets and performance comparisons.

NumPyOpenCVPyTorch
0 likes · 33 min read
Overview of Common Python AI Libraries with Code Examples
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 18, 2022 · Artificial Intelligence

Practical Implementation of Vision Transformer (ViT) for Image Classification in PyTorch

This article walks readers through building, training, and evaluating a Vision Transformer (ViT) model for a five‑class flower classification task, providing detailed code snippets, model architecture explanations, training script adjustments, and experimental results that highlight the importance of pre‑trained weights.

Deep LearningImage ClassificationPyTorch
0 likes · 13 min read
Practical Implementation of Vision Transformer (ViT) for Image Classification in PyTorch
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 11, 2022 · Artificial Intelligence

GANomaly: Theory and Source Code Analysis

This article explains the GANomaly model for semi‑supervised anomaly detection, detailing its generator‑encoder‑discriminator architecture, loss functions, testing phase scoring, and provides annotated PyTorch source code to help readers implement and understand the approach.

Deep LearningEncoder-DecoderGAN
0 likes · 15 min read
GANomaly: Theory and Source Code Analysis
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 10, 2022 · Artificial Intelligence

A Beginner’s Journey into Vision Transformers (ViT) for Computer Vision Engineers

This article introduces the fundamentals of Vision Transformers (ViT) for computer‑vision developers, starting with an overview of the transformer architecture, detailed explanation of self‑attention and multi‑head attention, and step‑by‑step PyTorch code examples that illustrate query, key, value computation and attention scoring.

PyTorchSelf-AttentionTransformer
0 likes · 12 min read
A Beginner’s Journey into Vision Transformers (ViT) for Computer Vision Engineers
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 8, 2022 · Artificial Intelligence

Wasserstein GAN (WGAN): Theory and Hands‑On Implementation

This article explains why traditional GANs suffer from training instability, introduces the Wasserstein (Earth‑Mover) distance as a smoother alternative, derives the WGAN objective, discusses Lipschitz constraints, and provides practical PyTorch code modifications to convert a vanilla GAN into a stable WGAN.

Deep LearningGANPyTorch
0 likes · 21 min read
Wasserstein GAN (WGAN): Theory and Hands‑On Implementation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Sep 28, 2022 · Artificial Intelligence

EGBAD: Efficient GAN‑Based Anomaly Detection – Theory and Practical Implementation

This article introduces the EGBAD model, an efficient GAN‑based anomaly detection method that replaces AnoGAN's costly latent variable search with an encoder, provides detailed PyTorch code for data loading, model construction, training, and inference, and compares its testing speed with AnoGAN.

DiscriminatorEGBADEncoder
0 likes · 18 min read
EGBAD: Efficient GAN‑Based Anomaly Detection – Theory and Practical Implementation
DataFunTalk
DataFunTalk
Sep 20, 2022 · Artificial Intelligence

Graph4NLP: An Open‑Source Graph Neural Network Library for Natural Language Processing

Graph4NLP is a PyTorch‑ and DGL‑based open‑source library that provides a full pipeline—from static and dynamic graph construction to embedding, learning, prediction, and inference—for applying graph neural networks to a wide range of NLP tasks, with extensive documentation, demos, and future scalability plans.

DGLGraph4NLPNLP
0 likes · 13 min read
Graph4NLP: An Open‑Source Graph Neural Network Library for Natural Language Processing
Python Programming Learning Circle
Python Programming Learning Circle
Jul 4, 2022 · Artificial Intelligence

Building an Advertising Recommendation Model with Python and PyTorch

This article walks through the development of a simple advertising recommendation system using Python, covering data collection, preprocessing with label encoding, text embedding via Torch, constructing an MLP model, and initiating training, while reflecting on the challenges faced by Python developers in the big‑data era.

EmbeddingMLPPyTorch
0 likes · 5 min read
Building an Advertising Recommendation Model with Python and PyTorch
DataFunTalk
DataFunTalk
Jun 9, 2022 · Artificial Intelligence

Understanding and Reproducing MAE (Masked AutoEncoder) for Self‑Supervised Vision Learning with EasyCV

This article introduces the MAE (Masked AutoEncoder) self‑supervised learning method, explains its asymmetric encoder‑decoder design and high masking ratio, evaluates its performance, and provides a step‑by‑step guide to reproduce MAE using Alibaba’s EasyCV framework, including code snippets, training tips, and troubleshooting.

EasyCVMAEPyTorch
0 likes · 15 min read
Understanding and Reproducing MAE (Masked AutoEncoder) for Self‑Supervised Vision Learning with EasyCV
Code DAO
Code DAO
Jun 7, 2022 · Artificial Intelligence

How to Implement SRCNN for Image Super‑Resolution in PyTorch

This article walks through a complete PyTorch implementation of the SRCNN model for image super‑resolution, covering dataset preparation, patch extraction, model architecture, training on a GTX 770 GPU for 2500 epochs, PSNR evaluation, and visual comparisons with bicubic up‑sampling.

PSNRPatchifyPyTorch
0 likes · 22 min read
How to Implement SRCNN for Image Super‑Resolution in PyTorch
Code DAO
Code DAO
Jun 3, 2022 · Artificial Intelligence

Decomposing PointGAN: Teaching a Machine to Generate a Single Point

This article walks through building and analyzing a minimal GAN—PointGAN—that learns to output the single value 1, covering the linear generator, a two‑layer discriminator, training loops, loss visualizations, instability diagnostics, and practical fixes such as loss easing, weighted examples, weight decay, and noisy generator parameters.

DiscriminatorGANNoise Injection
0 likes · 24 min read
Decomposing PointGAN: Teaching a Machine to Generate a Single Point
Code DAO
Code DAO
May 21, 2022 · Artificial Intelligence

How Quantization and Fusion Accelerate CNN Inference on Edge Devices

The article explains CNN inference optimization by applying PyTorch quantization and module‑fusion techniques, compares model size and latency before and after quantization, shows code for building, quantizing, and fusing a simple CNN, and presents benchmark results on CPU, highlighting a four‑fold size reduction and up to 1.7× speed‑up.

CNNPyTorchedge inference
0 likes · 11 min read
How Quantization and Fusion Accelerate CNN Inference on Edge Devices
Code DAO
Code DAO
May 18, 2022 · Artificial Intelligence

A Practical Guide to PyTorch Visualization Tools for Deep Learning

This article walks through the core PyTorch visualization utilities—making image grids, drawing bounding boxes, segmentation masks, and keypoints—explaining why they are needed, how to set up the pipeline, and providing complete code examples for each computer‑vision task.

Bounding BoxesComputer VisionKeypoints
0 likes · 18 min read
A Practical Guide to PyTorch Visualization Tools for Deep Learning
Tencent Cloud Developer
Tencent Cloud Developer
May 12, 2022 · Backend Development

Practical Guide to PyTorch Distributed Training: DP, DDP, Groups, and IO Considerations

This guide explains PyTorch’s distributed training, contrasting single‑node DataParallel with multi‑node DistributedDataParallel, detailing essential parameters, group communication setup, proper use of DistributedSampler for data loading, handling IO bottlenecks, and avoiding common pitfalls such as memory imbalance, unsynchronized buffers, and unused‑parameter errors.

DDPDataParallelDistributed Training
0 likes · 15 min read
Practical Guide to PyTorch Distributed Training: DP, DDP, Groups, and IO Considerations
Code DAO
Code DAO
May 8, 2022 · Artificial Intelligence

Solving Differential Equations with Physics‑Informed Neural Networks in PyTorch

This article explains how to build a Physics‑Informed Neural Network (PINN) in PyTorch to solve a simple logistic ordinary differential equation, covering the underlying theory, loss formulation with equation residuals and boundary conditions, network architecture, automatic differentiation, and training results.

PINNPhysics‑Informed Neural NetworksPyTorch
0 likes · 11 min read
Solving Differential Equations with Physics‑Informed Neural Networks in PyTorch
Code DAO
Code DAO
May 3, 2022 · Artificial Intelligence

How to Build Your Own NeRF Model in PyTorch – Step‑by‑Step Guide

This tutorial walks through the theory and implementation of Neural Radiance Fields (NeRF) in PyTorch, covering positional encoding, the MLP architecture, differentiable volume rendering, hierarchical sampling, training tricks, and references to the original research.

Hierarchical SamplingNeRFNeural Radiance Fields
0 likes · 23 min read
How to Build Your Own NeRF Model in PyTorch – Step‑by‑Step Guide
DataFunSummit
DataFunSummit
Apr 16, 2022 · Big Data

Angel Graph: A Scalable Graph Computing Platform – Architecture, Optimizations, and Applications

The article introduces Angel Graph, a large‑scale graph computing platform built on Angel's parameter‑server architecture and Spark, detailing its evolution, framework components (including Spark‑on‑Angel and PyTorch‑on‑Angel), data and model partitioning strategies, communication and computation optimizations, stability mechanisms, usability features, and real‑world applications across recommendation, risk control, social and gaming domains.

Parameter ServerPyTorchSpark
0 likes · 15 min read
Angel Graph: A Scalable Graph Computing Platform – Architecture, Optimizations, and Applications
Python Programming Learning Circle
Python Programming Learning Circle
Apr 9, 2022 · Artificial Intelligence

Image Resizing with OpenCV and PyTorch

This article explains how to resize images using OpenCV's cv2.resize function and how to scale multi‑dimensional tensors in PyTorch with torch.nn.functional.interpolate, providing detailed parameter descriptions and practical code examples for both single images and batch processing.

Computer VisionImage ProcessingPyTorch
0 likes · 6 min read
Image Resizing with OpenCV and PyTorch
Top Architect
Top Architect
Feb 21, 2022 · Databases

Key New Features in Elasticsearch 8.0

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default-enabled security with registration tokens, known issues on ARM/macOS, a preview KNN search API using dense_vector, storage reductions for keyword and text fields, faster geo indexing, PyTorch model support, and numerous other enhancements across aggregations, allocation, analysis, authentication, and core infrastructure.

ElasticsearchPyTorchSecurity
0 likes · 10 min read
Key New Features in Elasticsearch 8.0
MaGe Linux Operations
MaGe Linux Operations
Jan 30, 2022 · Artificial Intelligence

PyTorch vs TensorFlow in 2022: Which Framework Wins for Your Needs?

This article compares PyTorch and TensorFlow in 2022 across model availability, deployment ease, and ecosystem support, using data from HuggingFace, research papers, and industry tools, and offers tailored recommendations for industry engineers, researchers, educators, career changers, hobbyists, and beginners.

AIDeep LearningModel Deployment
0 likes · 20 min read
PyTorch vs TensorFlow in 2022: Which Framework Wins for Your Needs?
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 28, 2022 · Artificial Intelligence

How Masked Autoencoders Revolutionize Vision Pre‑Training: A Deep Dive

This article provides a detailed technical walkthrough of Masked Autoencoders (MAE) for computer vision, covering its BERT‑inspired masking strategy, asymmetric encoder‑decoder design, implementation specifics, experimental findings on mask ratios and decoder depth, and the resulting performance gains over supervised ViT models.

Computer VisionMAEMasked Modeling
0 likes · 11 min read
How Masked Autoencoders Revolutionize Vision Pre‑Training: A Deep Dive
Code DAO
Code DAO
Jan 15, 2022 · Artificial Intelligence

How Intel BF16 with IPEX and oneDNN Boosts PyTorch Performance

This article explains how Intel and Facebook's BF16 support, combined with the Intel Extension for PyTorch (IPEX) and oneDNN, automates type and layout conversions and adds graph‑fusion optimizations, delivering 1.4×‑4.3× inference and up to 2.4× training speedups on Xeon CPUs for models such as DLRM, BERT‑Large, and ResNext‑101‑32x4d.

BF16CPU accelerationDeep Learning
0 likes · 13 min read
How Intel BF16 with IPEX and oneDNN Boosts PyTorch Performance
Python Programming Learning Circle
Python Programming Learning Circle
Jan 11, 2022 · Artificial Intelligence

Dynamic Learning Rate Adjustment in PyTorch: Optimizer Basics and Scheduler Usage

This article explains how to configure and use PyTorch optimizers, their attributes and methods, and demonstrates various learning‑rate scheduling techniques—including manual updates and built‑in schedulers such as LambdaLR, StepLR, MultiStepLR, ExponentialLR, CosineAnnealingLR, and ReduceLROnPlateau—through clear code examples.

Deep LearningPyTorchScheduler
0 likes · 14 min read
Dynamic Learning Rate Adjustment in PyTorch: Optimizer Basics and Scheduler Usage
Code DAO
Code DAO
Dec 31, 2021 · Cloud Computing

How to Run Distributed PyTorch Training on AzureML with CLI v2

This article walks through the complete workflow for building, testing, and launching a distributed PyTorch training job on AzureML using the CLI v2, covering local script preparation, Accelerate configuration, Docker environment setup, dataset registration, compute target definition, job YAML creation, and job submission with monitoring.

CLIDistributed TrainingDocker
0 likes · 15 min read
How to Run Distributed PyTorch Training on AzureML with CLI v2
Python Programming Learning Circle
Python Programming Learning Circle
Dec 27, 2021 · Artificial Intelligence

PyTorch vs TensorFlow in 2022: Which Framework to Choose?

An in‑depth 2022 comparison of PyTorch and TensorFlow evaluates model availability, deployment ease, and ecosystem support, showing PyTorch dominates research while TensorFlow excels in deployment, and offers tailored recommendations for industry professionals, researchers, educators, career changers, hobbyists, and beginners.

AIDeep LearningPyTorch
0 likes · 20 min read
PyTorch vs TensorFlow in 2022: Which Framework to Choose?
Code DAO
Code DAO
Dec 25, 2021 · Artificial Intelligence

Understanding Graph Neural Networks: Nodes, Edges, and Message Passing

This article explains the fundamentals of graph neural networks, covering graph concepts, node classification via neighborhood aggregation, message‑passing mechanics, mathematical notation, a full DGL‑PyTorch implementation on the Reddit dataset, and training results showing accuracy improvements up to 91 %.

DGLGCNGNN
0 likes · 9 min read
Understanding Graph Neural Networks: Nodes, Edges, and Message Passing
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 21, 2021 · Artificial Intelligence

Boost Time Series Forecasting with Autocorrelated Error Adjustment – A 5‑Line PyTorch Trick

This article explains a NeurIPS 2021 paper that introduces a learnable autocorrelation correction for neural network time‑series models, shows the underlying theory, provides concise PyTorch code implementing the adjustment, reports a ~17% average performance gain across datasets, and lists additional practical tricks for time‑series forecasting.

Neural NetworksPyTorchTime Series
0 likes · 6 min read
Boost Time Series Forecasting with Autocorrelated Error Adjustment – A 5‑Line PyTorch Trick
Code DAO
Code DAO
Dec 17, 2021 · Artificial Intelligence

Applying UNETR Transformer for 3D Medical Image Segmentation

This article walks through using the UNETR transformer architecture to segment 3D brain MRI scans from the BRATS dataset, detailing environment setup, data preprocessing with MONAI, model construction, training with DiceCE loss, validation metrics, and visualizing the best‑performing model outputs.

3D segmentationBRATSMONAI
0 likes · 16 min read
Applying UNETR Transformer for 3D Medical Image Segmentation
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 16, 2021 · Artificial Intelligence

Boost Model Robustness with 5 Lines of R‑Drop Contrastive Learning

This article introduces a simple five‑line implementation of R‑Drop, a contrastive self‑supervised learning technique that leverages dropout‑induced perturbations to improve model robustness, explains the underlying principle, provides the exact PyTorch code, and compares it with the ConSERT method.

DropoutPyTorchcontrastive learning
0 likes · 5 min read
Boost Model Robustness with 5 Lines of R‑Drop Contrastive Learning
Code DAO
Code DAO
Dec 14, 2021 · Artificial Intelligence

Building a Chess AI from Scratch: Combining AlphaZero and Transformers (Part 2)

This article walks through constructing a learnable chess AI by integrating AlphaZero‑style Monte Carlo Tree Search with a decoder‑only Transformer, detailing the game tree logic, model architecture, input and output encodings, self‑play training loop, and code implementation in PyTorch.

AlphaZeroMonteCarloTreeSearchPyTorch
0 likes · 23 min read
Building a Chess AI from Scratch: Combining AlphaZero and Transformers (Part 2)
Code DAO
Code DAO
Dec 11, 2021 · Artificial Intelligence

Using DCGAN to Generate Synthetic Marine Plastic Images

This article explains how to apply a Deep Convolutional GAN in PyTorch to create realistic synthetic images of marine plastic, addressing dataset scarcity, detailing the network architecture, training procedure, and showing loss curves and generated samples.

DCGANGANMarine Plastic
0 likes · 13 min read
Using DCGAN to Generate Synthetic Marine Plastic Images
Code DAO
Code DAO
Dec 8, 2021 · Artificial Intelligence

Understanding Compact Transformers: Build and Train Vision & NLP Models on a Personal PC

This article walks through the design of Compact Transformers, explaining scaled dot‑product self‑attention, positional embeddings, multi‑head attention, and Vision Transformer architecture, and provides full PyTorch code so readers can train lightweight CV and NLP classifiers on a single PC.

Compact TransformersPatch EmbeddingPositional Embedding
0 likes · 19 min read
Understanding Compact Transformers: Build and Train Vision & NLP Models on a Personal PC
Code DAO
Code DAO
Dec 8, 2021 · Artificial Intelligence

Optimizers and Schedulers in Neural Network Architecture: A Detailed Guide

This article explains how optimizers and learning‑rate schedulers work, how to configure their hyperparameters and parameter groups, and how to apply differential learning rates and adaptive schedules in PyTorch and Keras to improve model training and transfer‑learning performance.

KerasPyTorchhyperparameter tuning
0 likes · 10 min read
Optimizers and Schedulers in Neural Network Architecture: A Detailed Guide
Code DAO
Code DAO
Dec 5, 2021 · Artificial Intelligence

Why DropBlock Outperforms Dropout as an Image Regularizer

This article demonstrates how to implement DropBlock in PyTorch, explains why Dropout fails on image data, details the gamma calculation and mask generation, and shows visual comparisons that illustrate the superiority of contiguous region dropping over random pixel dropout.

Computer VisionDeep LearningDropBlock
0 likes · 11 min read
Why DropBlock Outperforms Dropout as an Image Regularizer
Code DAO
Code DAO
Dec 2, 2021 · Artificial Intelligence

Transfer Learning with ShuffleNetV2 for Flower Classification

This article walks through building a PyTorch ShuffleNetV2 model, preparing the Kaggle Flowers dataset, training with transfer learning on a GPU, visualizing loss and accuracy, and performing inference on five test images, achieving nearly 90% validation accuracy after 95 epochs.

CNNPyTorchShuffleNetV2
0 likes · 19 min read
Transfer Learning with ShuffleNetV2 for Flower Classification
Code DAO
Code DAO
Dec 1, 2021 · Artificial Intelligence

Building a Satellite Image Classifier with PyTorch ResNet34

This article walks through creating a satellite image classification pipeline using PyTorch and a pretrained ResNet34 model, covering dataset preparation, project structure, data loading, model definition, training, validation, loss/accuracy plotting, and inference on new images with detailed code examples and results.

Deep LearningImage ClassificationPyTorch
0 likes · 17 min read
Building a Satellite Image Classifier with PyTorch ResNet34
Code DAO
Code DAO
Nov 30, 2021 · Artificial Intelligence

How to Train a Custom Object Detector with PyTorch Faster R‑CNN

This article provides a step‑by‑step guide to building, training, and evaluating a custom object detection model using PyTorch Faster R‑CNN on a microcontroller dataset, covering data preparation, configuration, model modification, training loops, loss visualization, and inference on new images.

Faster R-CNNPyTorchPython
0 likes · 23 min read
How to Train a Custom Object Detector with PyTorch Faster R‑CNN
Code DAO
Code DAO
Nov 28, 2021 · Artificial Intelligence

Adapting Soft Actor‑Critic for Discrete Action Spaces in Deep Reinforcement Learning

This article explains how to modify the Soft Actor‑Critic (SAC) algorithm—originally designed for continuous actions—to work with discrete action environments, presents the required changes to the actor and critic loss functions, provides a full PyTorch implementation, and evaluates the method on the CartPole‑v1 benchmark.

CartPoleDiscrete ActionsEntropy Regularization
0 likes · 20 min read
Adapting Soft Actor‑Critic for Discrete Action Spaces in Deep Reinforcement Learning
Python Programming Learning Circle
Python Programming Learning Circle
Nov 8, 2021 · Artificial Intelligence

YOLOv5 Tutorial: From YOLOv3 to YOLOv5, Code Walkthrough, Model Export (JIT & ONNX) and Usage

This article provides a comprehensive guide on YOLOv5, covering its background from YOLOv3, detailed code analysis of the model architecture, step‑by‑step instructions for running detect.py, configuring yolov5s.yaml, exporting the model to TorchScript JIT and ONNX formats, and practical inference examples using PyTorch and ONNX Runtime.

JITONNXPyTorch
0 likes · 16 min read
YOLOv5 Tutorial: From YOLOv3 to YOLOv5, Code Walkthrough, Model Export (JIT & ONNX) and Usage
Python Programming Learning Circle
Python Programming Learning Circle
Aug 24, 2021 · Artificial Intelligence

Top 10 Python Libraries for Machine Learning

An overview of ten widely used Python machine‑learning libraries—including TensorFlow, Scikit‑Learn, NumPy, Keras, PyTorch, LightGBM, Eli5, SciPy, Theano, and Pandas—detailing their core features, typical applications, and why they are essential tools for data scientists and AI developers.

KerasNumPyPyTorch
0 likes · 15 min read
Top 10 Python Libraries for Machine Learning
Python Programming Learning Circle
Python Programming Learning Circle
Aug 23, 2021 · Artificial Intelligence

Efficient PyTorch Training Pipeline: Tips, Profiling, and Multi‑GPU Strategies

This article presents practical strategies for building high‑performance PyTorch training pipelines, covering bottleneck identification, efficient data loading, RAM‑based datasets, profiling tools, multi‑GPU training with DataParallel and DistributedDataParallel, custom loss implementation, and hardware‑vs‑software trade‑offs to accelerate deep‑learning workloads.

Custom LossDataLoaderDeep Learning
0 likes · 13 min read
Efficient PyTorch Training Pipeline: Tips, Profiling, and Multi‑GPU Strategies
Python Programming Learning Circle
Python Programming Learning Circle
Jul 23, 2021 · Artificial Intelligence

Understanding PyTorch's Backward Propagation Engine (BP Engine)

This article explains how PyTorch's BP Engine dynamically builds the computation graph for back‑propagation, detailing its C++ class structure, thread management, task queues, and key functions such as start_threads, compute_dependencies, execute, and evaluate_function, with illustrative code examples.

BackpropagationCEngine
0 likes · 17 min read
Understanding PyTorch's Backward Propagation Engine (BP Engine)
Kuaishou Tech
Kuaishou Tech
Jul 16, 2021 · Artificial Intelligence

Bagua: An Open‑Source Distributed Training Framework for Deep Learning

Bagua is a distributed training framework co‑developed by Kuaishou and ETH Zürich that combines algorithmic and system‑level optimizations—such as decentralized, asynchronous, and compressed communication—to achieve up to 60% higher performance than existing frameworks like PyTorch‑DDP, Horovod, and BytePS across various AI workloads.

BaguaDeep LearningDistributed Training
0 likes · 15 min read
Bagua: An Open‑Source Distributed Training Framework for Deep Learning
MaGe Linux Operations
MaGe Linux Operations
Jul 8, 2021 · Artificial Intelligence

TensorFlow vs PyTorch 2.x: Which AI Framework Wins in 2021?

An in‑depth comparison of TensorFlow 2.x and PyTorch 1.8 highlights new features, deployment options like TensorFlow Lite and PyTorch Mobile, coding style differences, and practical guidance on choosing the right deep‑learning library for various projects and skill levels.

Deep LearningPyTorchTensorFlow
0 likes · 6 min read
TensorFlow vs PyTorch 2.x: Which AI Framework Wins in 2021?
Python Programming Learning Circle
Python Programming Learning Circle
Jul 6, 2021 · Artificial Intelligence

Understanding ResNet and Building It from Scratch with PyTorch

This article explains the motivation behind residual networks, describes the architecture of ResNet including residual blocks and skip connections, lists available Keras implementations, and provides a step‑by‑step PyTorch tutorial with complete code to construct and test ResNet‑50/101/152 models.

CNNDeep LearningPyTorch
0 likes · 10 min read
Understanding ResNet and Building It from Scratch with PyTorch
Python Programming Learning Circle
Python Programming Learning Circle
May 29, 2021 · Artificial Intelligence

Comparing PyTorch 1.8 and TensorFlow 2.5: New Features, Use Cases, and Choosing the Right Framework

This article reviews the latest releases of PyTorch 1.8 and TensorFlow 2.5, outlining their new functionalities, ecosystem tools such as TensorFlow.js, Lite, and TFX, as well as PyTorch Mobile and Lightning, and provides guidance on selecting the most suitable framework for different deep‑learning projects.

Deep LearningPyTorchTensorFlow
0 likes · 7 min read
Comparing PyTorch 1.8 and TensorFlow 2.5: New Features, Use Cases, and Choosing the Right Framework
MaGe Linux Operations
MaGe Linux Operations
Mar 29, 2021 · Artificial Intelligence

Mastering PyTorch Visualization: TensorBoard and Visdom Guide

This tutorial explains how to install, launch, and use TensorBoard and Visdom with PyTorch, providing step‑by‑step commands, code examples for logging training metrics, and visualizing images and plots to monitor deep‑learning experiments.

Deep LearningPyTorchPython
0 likes · 6 min read
Mastering PyTorch Visualization: TensorBoard and Visdom Guide
Bitu Technology
Bitu Technology
Mar 26, 2021 · Artificial Intelligence

Applying Machine Learning to Advertising‑Based Video‑On‑Demand (AVOD) at Tubi

This article explains how Tubi leverages machine learning—particularly PyTorch, Databricks, and cloud services—to improve content understanding, advertising technology, and recommendation systems within its advertising‑based video‑on‑demand platform, outlining the three AVOD pillars, technical stack, and future research directions.

AVODDatabricksPyTorch
0 likes · 13 min read
Applying Machine Learning to Advertising‑Based Video‑On‑Demand (AVOD) at Tubi
360 Quality & Efficiency
360 Quality & Efficiency
Mar 26, 2021 · Operations

Deploying a Code Clone Detection Model with TorchServe

This article explains how to build a code clone detection service using a CodeBERT classification model, create a custom TorchServe handler, package the model with torch-model-archiver, launch the service, and test it with example code pairs to demonstrate clone and non‑clone predictions.

HandlerModel DeploymentPyTorch
0 likes · 8 min read
Deploying a Code Clone Detection Model with TorchServe
JD Cloud Developers
JD Cloud Developers
Mar 15, 2021 · Artificial Intelligence

Top Tech Weekly: AI Earthquake Monitor, PyTorch 1.8, Language Rankings & More

This developer community weekly roundup highlights CCTV's new big‑data governance platform, RedMonk's programming language rankings, Chromium‑based browsers adopting a four‑week release cycle, PyTorch 1.8 with AMD support, the world’s first AI‑driven earthquake monitoring system, Red Hat OpenShift 4.7, a deep meta‑learning model for city sales prediction, and a CVPR breakthrough in controllable human image generation.

Big DataCloud NativePyTorch
0 likes · 9 min read
Top Tech Weekly: AI Earthquake Monitor, PyTorch 1.8, Language Rankings & More
MaGe Linux Operations
MaGe Linux Operations
Mar 11, 2021 · Artificial Intelligence

What’s New in PyTorch 1.8? Key Features, APIs, and Performance Boosts

PyTorch 1.8, released by the PyTorch team, bundles over 3,000 commits since 1.7, introducing AMD ROCm support, enhanced Python function conversion, stable FFT and linear‑algebra APIs, complex‑tensor autograd, distributed‑training improvements, new mobile tutorials, performance tools, and several prototype features.

Deep LearningGPUMobile
0 likes · 6 min read
What’s New in PyTorch 1.8? Key Features, APIs, and Performance Boosts
21CTO
21CTO
Jan 31, 2021 · Artificial Intelligence

How Facebook’s Open‑Source Surge is Shaping AI and Development in 2020

Facebook’s 2020 open‑source report reveals a rapid expansion to over 700 repositories and 1.3 million GitHub followers, highlights AI breakthroughs like the M2M‑100 multilingual model and ReBel algorithm, and showcases frontend tools such as React Native and the newly released Recoil library.

FacebookM2M-100PyTorch
0 likes · 4 min read
How Facebook’s Open‑Source Surge is Shaping AI and Development in 2020
Sohu Tech Products
Sohu Tech Products
Nov 11, 2020 · Artificial Intelligence

Illustrated Transformer: Comprehensive Explanation and Code Implementation

This article provides a step‑by‑step illustrated guide to the Transformer architecture, covering its macro structure, detailed self‑attention mechanisms, multi‑head attention, positional encoding, residual connections, decoder operation, training process, loss functions, and includes complete PyTorch and custom Python code examples.

NLPPyTorchSelf-Attention
0 likes · 33 min read
Illustrated Transformer: Comprehensive Explanation and Code Implementation
Python Programming Learning Circle
Python Programming Learning Circle
Apr 26, 2020 · Artificial Intelligence

Understanding PyTorch Autograd: Tensors, Gradients, and Backpropagation

This article explains PyTorch's autograd system, covering tensor creation, the requires_grad flag, detaching tensors, disabling gradient tracking with no_grad, the Function class and computational graph, and demonstrates forward and backward passes with code examples illustrating gradient computation and Jacobian‑vector products.

AutogradBackpropagationPyTorch
0 likes · 6 min read
Understanding PyTorch Autograd: Tensors, Gradients, and Backpropagation
58 Tech
58 Tech
Mar 27, 2020 · Artificial Intelligence

dl_inference: Open‑Source General Deep Learning Inference Service

dl_inference is an open‑source inference platform that simplifies deployment of TensorFlow and PyTorch models in production, offering unified gRPC access, load‑balanced multi‑node serving, GPU/CPU options, customizable pre‑ and post‑processing, and extensible architecture for future AI workloads.

AI inferenceDeep LearningModel Serving
0 likes · 11 min read
dl_inference: Open‑Source General Deep Learning Inference Service
UCloud Tech
UCloud Tech
Mar 24, 2020 · Artificial Intelligence

Why Does PyTorch Struggle with UFS Storage? Insights and Optimizations

A detailed case study reveals why PyTorch training on UFS file storage suffers severe I/O bottlenecks, compares it with local SSD and SSHFS, and presents practical optimizations such as using cv2.imdecode, caching DataLoader handles, and converting small‑file datasets into large UFS files to close the performance gap.

AI trainingPyTorchUFS
0 likes · 14 min read
Why Does PyTorch Struggle with UFS Storage? Insights and Optimizations
58 Tech
58 Tech
Dec 20, 2019 · Artificial Intelligence

Deep Learning Platform on Kubernetes: Architecture, Resource Management, Offline Training and Online Inference

The article presents a comprehensive overview of 58.com’s AI platform built on Kubernetes, detailing its layered architecture, resource scheduling, offline training pipelines, debugging environment, distributed TensorFlow/PyTorch training, performance benchmarks, and online inference services, highlighting how the system empowers various business units with scalable AI capabilities.

Distributed TrainingKubernetesPyTorch
0 likes · 11 min read
Deep Learning Platform on Kubernetes: Architecture, Resource Management, Offline Training and Online Inference
MaGe Linux Operations
MaGe Linux Operations
Sep 27, 2019 · Artificial Intelligence

Top 10 Python Libraries Every AI Developer Should Master

This article introduces ten essential Python libraries—TensorFlow, Scikit‑Learn, NumPy, Keras, PyTorch, LightGBM, Eli5, SciPy, Theano, and Pandas—detailing their features, typical use cases, and adoption in machine‑learning and data‑science projects, while highlighting each library's performance advantages, community support, and integration capabilities to help developers choose the right tool for their AI workflows.

KerasNumPyPyTorch
0 likes · 15 min read
Top 10 Python Libraries Every AI Developer Should Master
Qunar Tech Salon
Qunar Tech Salon
Sep 11, 2018 · Artificial Intelligence

Overview of Deep Learning Object Detection Methods and Detailed Implementation of Faster R‑CNN

This article reviews major deep‑learning object detection approaches—including one‑stage YOLO and SSD and two‑stage RCNN, Fast RCNN, and Faster RCNN—then provides a step‑by‑step explanation of Faster RCNN’s architecture, region‑proposal network, RoI pooling, loss functions, and sample PyTorch code.

Computer VisionFaster R-CNNPyTorch
0 likes · 20 min read
Overview of Deep Learning Object Detection Methods and Detailed Implementation of Faster R‑CNN
MaGe Linux Operations
MaGe Linux Operations
Mar 3, 2017 · Artificial Intelligence

Top 5 Python Libraries to Supercharge Your Machine Learning Projects

This article introduces five highly rated Python libraries—PyWren, Tfdeploy, Luigi, Kubelib, and PyTorch—that streamline data handling, cloud execution, workflow orchestration, and GPU acceleration, helping machine‑learning engineers boost productivity and tackle complex projects more efficiently.

AWS LambdaKubernetesPyTorch
0 likes · 6 min read
Top 5 Python Libraries to Supercharge Your Machine Learning Projects