Tagged articles
23 articles
Page 1 of 1
AIWalker
AIWalker
Mar 20, 2026 · Artificial Intelligence

Plug‑and‑Play reAR Boosts Visual AR to SOTA Quality with Only 177M Parameters

The paper introduces reAR, a plug‑and‑play regularization framework that aligns generator and tokenizer representations in visual autoregressive models, dramatically improving image quality and matching large diffusion models while using far fewer parameters, and validates the approach with extensive experiments, ablations, and scalability analysis.

AI researchRegularizationimage generation
0 likes · 20 min read
Plug‑and‑Play reAR Boosts Visual AR to SOTA Quality with Only 177M Parameters
Tencent Cloud Developer
Tencent Cloud Developer
Nov 4, 2025 · Artificial Intelligence

From Functions to Transformers: Mastering Neural Networks Step by Step

This article walks you through the evolution from basic mathematical functions to modern large‑scale models, explaining activation functions, forward and backward propagation, loss calculation, gradient descent, regularization, dropout, word embeddings, RNNs, and the core mechanics of the Transformer architecture.

Attention MechanismDeep LearningNeural Networks
0 likes · 15 min read
From Functions to Transformers: Mastering Neural Networks Step by Step
IT Services Circle
IT Services Circle
Sep 5, 2025 · Artificial Intelligence

10 Must‑Know Tencent AI Interview Topics: Overfitting, Dropout, Transformers & Beyond

This article compiles the ten core questions from a Tencent algorithm interview, covering overfitting, regularization, generalization error, dropout, residual connections, attention, embeddings, BART vs BERT, instruction‑tuning data, LLM hallucination, and why GANs collapse more than diffusion models, with concise explanations and interview‑ready tips.

GANLLMRegularization
0 likes · 22 min read
10 Must‑Know Tencent AI Interview Topics: Overfitting, Dropout, Transformers & Beyond
Data Party THU
Data Party THU
Aug 18, 2025 · Artificial Intelligence

Unlock XGBoost Performance: Master the Core Parameters

This article provides a detailed, visual guide to XGBoost's most important hyper‑parameters—such as max_depth, min_child_weight, learning_rate, gamma, subsample, colsample_bytree, scale_pos_weight, alpha, and lambda—explaining how each influences tree complexity, regularization, and model generalization, and offering practical examples for effective tuning.

Model OptimizationRegularizationXGBoost
0 likes · 12 min read
Unlock XGBoost Performance: Master the Core Parameters
AI Frontier Lectures
AI Frontier Lectures
Jul 10, 2025 · Artificial Intelligence

Can Dispersive Loss Supercharge Diffusion Models Without Extra Pre‑training?

Dispersive Loss is a plug‑and‑play regularization technique that enhances diffusion‑based generative models by encouraging dispersed internal representations, requiring no additional pre‑training, parameters, or data, and consistently improves performance across various model sizes and configurations, as demonstrated through extensive experiments.

Dispersive LossModel EvaluationRegularization
0 likes · 18 min read
Can Dispersive Loss Supercharge Diffusion Models Without Extra Pre‑training?
Baobao Algorithm Notes
Baobao Algorithm Notes
May 12, 2025 · Artificial Intelligence

Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability

Training massive AI models now commonly omits dropout because its original scaling trick fails to match training and inference distributions, leading to poorer performance, higher computational cost, and instability, while alternative regularization like normalization remains useful, as illustrated by practical observations and historical tricks.

AI stabilityDropoutRegularization
0 likes · 6 min read
Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability
Ops Development & AI Practice
Ops Development & AI Practice
Jul 8, 2024 · Artificial Intelligence

Essential Denoising Techniques for Training Large AI Models

This article outlines key denoising methods—including data cleaning, augmentation, regularization, adversarial training, and self‑supervised learning—that improve the performance, generalization, and robustness of large neural network and transformer models.

DenoisingRegularizationadversarial training
0 likes · 5 min read
Essential Denoising Techniques for Training Large AI Models
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 5, 2024 · Artificial Intelligence

Comprehensive Guide to Neural Network Algorithms: Definitions, Structure, Implementation, and Training

This article provides an in‑depth tutorial on neural network algorithms, covering their biological inspiration, significance, advantages and drawbacks, detailed architecture, data preparation, one‑hot encoding, weight initialization, forward and backward propagation, cost functions, regularization, gradient checking, and complete Python code examples.

AIBackpropagationNeural Networks
0 likes · 37 min read
Comprehensive Guide to Neural Network Algorithms: Definitions, Structure, Implementation, and Training
DataFunSummit
DataFunSummit
Feb 14, 2023 · Artificial Intelligence

Deep Learning Hyperparameter Tuning and Training Tips: Insights from Zhihu Experts

This article compiles practical deep learning training and hyperparameter tuning advice from Zhihu contributors, covering model debugging, learning‑rate strategies, optimizer choices, data preprocessing, regularization techniques, initialization methods, common pitfalls, recommended research papers, and ensemble approaches.

Deep LearningRegularizationgradient clipping
0 likes · 13 min read
Deep Learning Hyperparameter Tuning and Training Tips: Insights from Zhihu Experts
Model Perspective
Model Perspective
Jan 15, 2023 · Artificial Intelligence

Mastering Model Evaluation: Key Metrics, Validation Techniques, and Diagnostics

This guide explains essential evaluation metrics for classification and regression models—including confusion matrix, ROC/AUC, R², and main performance indicators—covers model selection strategies such as train‑validation‑test splits, k‑fold cross‑validation, and regularization techniques, and discusses bias‑variance trade‑offs and diagnostic tools.

Evaluation MetricsModel SelectionRegularization
0 likes · 6 min read
Mastering Model Evaluation: Key Metrics, Validation Techniques, and Diagnostics
Model Perspective
Model Perspective
Jul 21, 2022 · Artificial Intelligence

Tackling Multicollinearity: Ridge and LASSO Regression Explained with Python

This article explains how multicollinearity undermines ordinary least squares estimates, introduces ridge and LASSO regularization as remedies, and demonstrates their application on a 1966 French economic dataset using Python’s statsmodels, complete with code and interpretation of results.

LASSOPythonRegularization
0 likes · 7 min read
Tackling Multicollinearity: Ridge and LASSO Regression Explained with Python
Code DAO
Code DAO
Dec 18, 2021 · Artificial Intelligence

Essential Feature Selection Techniques for Machine Learning

This article explains why feature selection is crucial for building robust machine‑learning models and walks through popular filter, wrapper, and embedded methods—including information gain, chi‑square, LASSO, random‑forest importance, and PCA—providing code examples and practical guidance.

PCARegularizationembedded methods
0 likes · 18 min read
Essential Feature Selection Techniques for Machine Learning
Code DAO
Code DAO
Dec 5, 2021 · Artificial Intelligence

Why DropBlock Outperforms Dropout as an Image Regularizer

This article demonstrates how to implement DropBlock in PyTorch, explains why Dropout fails on image data, details the gamma calculation and mask generation, and shows visual comparisons that illustrate the superiority of contiguous region dropping over random pixel dropout.

Computer VisionDeep LearningDropBlock
0 likes · 11 min read
Why DropBlock Outperforms Dropout as an Image Regularizer
DataFunTalk
DataFunTalk
Dec 4, 2021 · Artificial Intelligence

Practical Deep Learning Training Tricks: Cyclic LR, Flooding, Warmup, RAdam, Adversarial Training, Focal Loss, Dropout, Normalization and More

This article compiles essential deep learning training techniques—including cyclic learning rates, flooding, warmup, RAdam optimizer, adversarial training, focal loss, dropout, batch/group/weight normalization, label smoothing, Wasserstein GAN, skip connections, and weight initialization—providing concise explanations and code snippets for each method.

Deep LearningNeural NetworksRegularization
0 likes · 11 min read
Practical Deep Learning Training Tricks: Cyclic LR, Flooding, Warmup, RAdam, Adversarial Training, Focal Loss, Dropout, Normalization and More
DataFunTalk
DataFunTalk
Aug 10, 2021 · Artificial Intelligence

Practical Deep Learning Tricks: Cyclic LR, Flooding, Warmup, RAdam, Adversarial Training, Focal Loss, Dropout, Normalization, ReLU, Group Normalization, Label Smoothing, Wasserstein GAN, Skip Connections, Weight Initialization

This article presents a concise collection of practical deep‑learning techniques—including cyclic learning‑rate, flooding, warmup, RAdam, adversarial training, focal loss, dropout, various normalization methods, ReLU, group normalization, label smoothing, Wasserstein GAN, skip connections, and weight initialization—along with code snippets and references for implementation.

Deep LearningGANRegularization
0 likes · 8 min read
Practical Deep Learning Tricks: Cyclic LR, Flooding, Warmup, RAdam, Adversarial Training, Focal Loss, Dropout, Normalization, ReLU, Group Normalization, Label Smoothing, Wasserstein GAN, Skip Connections, Weight Initialization
DataFunTalk
DataFunTalk
Apr 5, 2021 · Artificial Intelligence

Summary of Methods and Findings from the NLP Chinese Pre‑training Model Generalization Challenge

The article reviews the Chinese NLP pre‑training model generalization competition, detailing data preprocessing, augmentation, external data usage, model scaling and architecture tweaks, loss functions, learning‑rate and adversarial training strategies, regularization techniques, post‑processing optimizations, and ineffective methods, highlighting their impact on performance metrics.

Loss FunctionsModel OptimizationNLP
0 likes · 15 min read
Summary of Methods and Findings from the NLP Chinese Pre‑training Model Generalization Challenge
Architects' Tech Alliance
Architects' Tech Alliance
Sep 3, 2020 · Artificial Intelligence

Deep Learning Specialization Infographic Overview

This article presents a comprehensive English summary of the deep learning specialization infographics originally shared by Andrew Ng, covering fundamentals, logistic regression, shallow and deep neural networks, regularization, optimization, hyperparameters, convolutional and recurrent networks, and practical advice for model building and evaluation.

CNNDeep LearningNeural Networks
0 likes · 21 min read
Deep Learning Specialization Infographic Overview
Beike Product & Technology
Beike Product & Technology
Mar 21, 2019 · Artificial Intelligence

Optimization Foundations and Applications in Machine Learning and Computer Vision

This article introduces how machine learning problems are formulated as optimization tasks, explains the construction of objective functions with examples such as linear regression, robust fitting, regularization, and demonstrates various applications ranging from K‑means clustering to image inpainting and 3D reconstruction.

Computer VisionRegularizationlinear regression
0 likes · 9 min read
Optimization Foundations and Applications in Machine Learning and Computer Vision
Qunar Tech Salon
Qunar Tech Salon
Oct 10, 2018 · Artificial Intelligence

Introduction to Lasso Regression with scikit-learn

This article provides a comprehensive guide to Lasso regression, covering its theoretical background, scikit-learn API parameters, step‑by‑step Python implementation, cross‑validation for hyper‑parameter tuning, visualization of predictions, and a discussion of its advantages over ridge regression.

Cross‑ValidationData visualizationPython
0 likes · 6 min read
Introduction to Lasso Regression with scikit-learn
Qunar Tech Salon
Qunar Tech Salon
Oct 9, 2018 · Artificial Intelligence

Ridge Regression with scikit-learn: Theory, Implementation, and Example

This article introduces Ridge regression, explains its theory and regularization role, discusses overfitting and bias‑variance trade‑offs, presents scikit‑learn parameters, and provides a complete Python example from data loading to model training, evaluation, and optimal alpha selection.

PythonRegularizationmachine learning
0 likes · 7 min read
Ridge Regression with scikit-learn: Theory, Implementation, and Example