Tagged articles

ResNet

18 articles · Page 1 of 1
PaperAgent
PaperAgent
Jun 7, 2026 · Artificial Intelligence

CVPR 2026 Awards Spotlight: D4RT, ResNet, and the Rise of 4D Vision AI

The CVPR 2026 award ceremony, with 16,092 submissions and a 25.3% acceptance rate, highlights a shift in computer vision from static image understanding to dynamic 4D reconstruction, single‑image 3D generation, game‑agent modeling, and real‑time image editing, while honoring foundational works like ResNet and YOLO.

4D reconstructionCVPR 2026D4RT
0 likes · 7 min read
CVPR 2026 Awards Spotlight: D4RT, ResNet, and the Rise of 4D Vision AI
Machine Heart
Machine Heart
Jun 5, 2026 · Industry Insights

ResNet and YOLO Win Time-Tested Awards at CVPR 2026 – Full Award Breakdown

CVPR 2026 received 16,092 submissions with a 25.3% acceptance rate, announced a record‑high paper count, and presented detailed award analyses—including the Longuet‑Higgins Prize for ResNet and YOLO, best paper breakthroughs in dynamic 4D reconstruction, 3D object generation, and generalist gaming agents, as well as student and young researcher honors.

Award AnalysisCVPR 2026Longuet-Higgins Prize
0 likes · 12 min read
ResNet and YOLO Win Time-Tested Awards at CVPR 2026 – Full Award Breakdown
DeepHub IMBA
DeepHub IMBA
Mar 23, 2026 · Artificial Intelligence

How KgCoOp Uses Knowledge‑Guided Context Optimization to Prevent Prompt Tuning Forgetting

The article analyzes why standard prompt tuning (CoOp) causes catastrophic forgetting in visual‑language models, introduces the KgCoOp framework that adds a knowledge‑guided loss to regularize prompts, and shows through extensive experiments on 11 benchmarks that KgCoOp improves unseen‑class accuracy, harmonic mean, and efficiency while discussing trade‑offs and limitations.

Catastrophic ForgettingKnowledge-guided OptimizationPrompt Tuning
0 likes · 11 min read
How KgCoOp Uses Knowledge‑Guided Context Optimization to Prevent Prompt Tuning Forgetting
xkx's Tech General Store
xkx's Tech General Store
Jan 12, 2026 · Artificial Intelligence

How Traditional Programmers Can Thrive in the AI Era: Understanding YOLOv2 Architecture and Implementation

This article walks through YOLOv2’s eight core upgrades over YOLOv1, explains the design rationale behind each change, provides detailed PyTorch code for the backbone, neck, head and prediction layers, demonstrates training on COCO, and outlines further optimization directions for real‑world object detection.

PyTorchResNetYOLOv2
0 likes · 16 min read
How Traditional Programmers Can Thrive in the AI Era: Understanding YOLOv2 Architecture and Implementation
AI Insight Log
AI Insight Log
Jan 1, 2026 · Artificial Intelligence

Can DeepSeek’s mHC Architecture Break ResNet’s Decade-Long Dominance?

DeepSeek’s new paper “mHC: Manifold‑Constrained Hyper‑Connections” proposes a novel architecture that replaces traditional residual connections with mathematically constrained hyper‑connections, showing on a 27B model a modest 6.7 % training‑time increase but significant stability gains and superior performance on BBH, DROP and GSM8K benchmarks.

DeepSeekLLM trainingResNet
0 likes · 8 min read
Can DeepSeek’s mHC Architecture Break ResNet’s Decade-Long Dominance?
xkx's Tech General Store
xkx's Tech General Store
Dec 30, 2025 · Artificial Intelligence

From Theory to Practice: Reproducing YOLOv1 – A Step‑by‑Step Guide for Traditional Programmers

This article provides a comprehensive, hands‑on walkthrough of YOLOv1—from its single‑stage detection principles and core architectural questions to a full PyTorch implementation, training pipeline, common pitfalls, and a live camera demo—targeted at developers transitioning into AI.

Deep LearningPyTorchResNet
0 likes · 10 min read
From Theory to Practice: Reproducing YOLOv1 – A Step‑by‑Step Guide for Traditional Programmers
IT Services Circle
IT Services Circle
May 2, 2025 · Artificial Intelligence

Understanding Gradient Vanishing in Deep Neural Networks and How to Mitigate It

The article explains why deep networks suffer from gradient vanishing—especially when using sigmoid or tanh activations—covers the underlying mathematics, compares activation functions, and presents practical techniques such as proper weight initialization, batch normalization, residual connections, and code examples to visualize the phenomenon.

Batch NormalizationDeep LearningResNet
0 likes · 7 min read
Understanding Gradient Vanishing in Deep Neural Networks and How to Mitigate It
Bilibili Tech
Bilibili Tech
Aug 27, 2024 · Artificial Intelligence

Multimodal Video Scene Classification for Adaptive Video Processing

The paper presents a multimodal video scene classification system that leverages CLIP‑generated pseudo‑labels and a fine‑tuned image encoder to automatically identify nature, animation/game, and document scenes, enabling more effective adaptive transcoding, intelligent restoration, and quality assessment for user‑generated content on platforms such as Bilibili.

Bilibili multimediaCLIPDeep Learning
0 likes · 17 min read
Multimodal Video Scene Classification for Adaptive Video Processing
Baobao Algorithm Notes
Baobao Algorithm Notes
Apr 11, 2022 · Artificial Intelligence

Can ResNet Still Beat Transformers? A Deep Dive into Modern Training Tricks

This article reviews recent research and official PyTorch blog updates that modify ResNet architectures and training tricks, compares their performance against EfficientNet, ConvNeXt, and Vision Transformers using extensive ImageNet benchmarks, and provides both literature‑based and local evaluation results to assess whether classic CNNs remain competitive.

CNNResNetVision Transformer
0 likes · 13 min read
Can ResNet Still Beat Transformers? A Deep Dive into Modern Training Tricks
Code DAO
Code DAO
Dec 29, 2021 · Artificial Intelligence

Understanding Stand-Alone Axial-Attention for Panoptic Segmentation

The paper proposes a stand‑alone axial‑attention mechanism that converts 2‑D attention into 1‑D to lower computational cost while preserving global context, introduces position‑sensitive self‑attention, integrates it into Axial‑ResNet and Axial‑DeepLab, and demonstrates strong results on four large segmentation datasets.

Axial AttentionDeepLabPanoptic Segmentation
0 likes · 7 min read
Understanding Stand-Alone Axial-Attention for Panoptic Segmentation
Code DAO
Code DAO
Dec 22, 2021 · Artificial Intelligence

Understanding SimCLR: A Simple Contrastive Learning Framework for Visual Representations

This article explains SimCLR, the 2020 Google Research framework that advances self‑supervised visual pre‑training by using extensive data augmentations, a ResNet encoder, a projection‑head MLP, and the NT‑Xent loss to learn robust image representations that outperform many prior methods on ImageNet and other benchmarks.

NT-Xent lossResNetSimCLR
0 likes · 7 min read
Understanding SimCLR: A Simple Contrastive Learning Framework for Visual Representations
Python Programming Learning Circle
Python Programming Learning Circle
Jul 6, 2021 · Artificial Intelligence

Understanding ResNet and Building It from Scratch with PyTorch

This article explains the motivation behind residual networks, describes the architecture of ResNet including residual blocks and skip connections, lists available Keras implementations, and provides a step‑by‑step PyTorch tutorial with complete code to construct and test ResNet‑50/101/152 models.

CNNDeep LearningPyTorch
0 likes · 10 min read
Understanding ResNet and Building It from Scratch with PyTorch
Hulu Beijing
Hulu Beijing
Mar 7, 2019 · Artificial Intelligence

From AlexNet to ResNeXt: Key Milestones in CNN Evolution

This article traces the evolution of convolutional neural networks from the pioneering AlexNet through VGG, Inception, ResNet, Inception‑v4, Inception‑ResNet and ResNeXt, highlighting architectural innovations, performance gains, and the underlying biological inspirations that shaped modern deep learning models.

AlexNetCNNDeep Learning
0 likes · 13 min read
From AlexNet to ResNeXt: Key Milestones in CNN Evolution
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 6, 2018 · Artificial Intelligence

How Wide‑ResNet with Batch Norm Boosts 1688’s ‘You May Like’

This article introduces the Wide&Deep, PNN, DeepFM, and a novel Wide‑ResNet model applied to Alibaba’s 1688 “You May Like” recommendation, describes the system architecture, training data, experimental results showing AUC improvements with batch normalization, and shares practical tuning insights.

Batch NormalizationResNetdeepfm
0 likes · 12 min read
How Wide‑ResNet with Batch Norm Boosts 1688’s ‘You May Like’
Qunar Tech Salon
Qunar Tech Salon
Aug 19, 2016 · Artificial Intelligence

Deep Learning Anti‑Scam Guide: A Non‑Technical Overview of Neural Networks, Training, and Practical Tips

This article provides a humorous yet informative, non‑mathematical guide to deep learning, covering neural network basics, layer addition, training methods, back‑propagation, unsupervised pre‑training, regularization, ResNet shortcuts, GPU computation, framework choices, and practical advice for applying deep learning to industrial data.

AIDeep LearningGPU
0 likes · 26 min read
Deep Learning Anti‑Scam Guide: A Non‑Technical Overview of Neural Networks, Training, and Practical Tips