Tagged articles

23 articles

Page 1 of 1

Mar 20, 2026 · Artificial Intelligence

Plug‑and‑Play reAR Boosts Visual AR to SOTA Quality with Only 177M Parameters

The paper introduces reAR, a plug‑and‑play regularization framework that aligns generator and tokenizer representations in visual autoregressive models, dramatically improving image quality and matching large diffusion models while using far fewer parameters, and validates the approach with extensive experiments, ablations, and scalability analysis.

AI researchRegularizationimage generation

0 likes · 20 min read

Plug‑and‑Play reAR Boosts Visual AR to SOTA Quality with Only 177M Parameters

Tencent Cloud Developer

Nov 4, 2025 · Artificial Intelligence

From Functions to Transformers: Mastering Neural Networks Step by Step

This article walks you through the evolution from basic mathematical functions to modern large‑scale models, explaining activation functions, forward and backward propagation, loss calculation, gradient descent, regularization, dropout, word embeddings, RNNs, and the core mechanics of the Transformer architecture.

Attention MechanismDeep LearningNeural Networks

0 likes · 15 min read

From Functions to Transformers: Mastering Neural Networks Step by Step

IT Services Circle

Sep 5, 2025 · Artificial Intelligence

10 Must‑Know Tencent AI Interview Topics: Overfitting, Dropout, Transformers & Beyond

This article compiles the ten core questions from a Tencent algorithm interview, covering overfitting, regularization, generalization error, dropout, residual connections, attention, embeddings, BART vs BERT, instruction‑tuning data, LLM hallucination, and why GANs collapse more than diffusion models, with concise explanations and interview‑ready tips.

GANLLMRegularization

0 likes · 22 min read

10 Must‑Know Tencent AI Interview Topics: Overfitting, Dropout, Transformers & Beyond

Data Party THU

Aug 18, 2025 · Artificial Intelligence

Unlock XGBoost Performance: Master the Core Parameters

This article provides a detailed, visual guide to XGBoost's most important hyper‑parameters—such as max_depth, min_child_weight, learning_rate, gamma, subsample, colsample_bytree, scale_pos_weight, alpha, and lambda—explaining how each influences tree complexity, regularization, and model generalization, and offering practical examples for effective tuning.

Model OptimizationRegularizationXGBoost

0 likes · 12 min read

Unlock XGBoost Performance: Master the Core Parameters

AI Frontier Lectures

Jul 10, 2025 · Artificial Intelligence

Can Dispersive Loss Supercharge Diffusion Models Without Extra Pre‑training?

Dispersive Loss is a plug‑and‑play regularization technique that enhances diffusion‑based generative models by encouraging dispersed internal representations, requiring no additional pre‑training, parameters, or data, and consistently improves performance across various model sizes and configurations, as demonstrated through extensive experiments.

Dispersive LossModel EvaluationRegularization

0 likes · 18 min read

Can Dispersive Loss Supercharge Diffusion Models Without Extra Pre‑training?

AI Large Model Application Practice

Jul 7, 2025 · Artificial Intelligence

Understanding Dropout: Preventing Overfitting in Neural Networks

This article explains what overfitting is, introduces dropout as a regularization technique, describes how dropout randomly deactivates neurons during training and rescales outputs during inference, discusses its limitations, and outlines why large language models may use alternative strategies.

AIDropoutNeural Networks

0 likes · 11 min read

Understanding Dropout: Preventing Overfitting in Neural Networks

Baobao Algorithm Notes

May 12, 2025 · Artificial Intelligence

Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability

Training massive AI models now commonly omits dropout because its original scaling trick fails to match training and inference distributions, leading to poorer performance, higher computational cost, and instability, while alternative regularization like normalization remains useful, as illustrated by practical observations and historical tricks.

AI stabilityDropoutRegularization

0 likes · 6 min read

Why Dropout Is Dropped in Large‑Scale Model Training: Effects, Efficiency, Stability

AntTech

May 8, 2025 · Artificial Intelligence

How Recommendation Models Amplify Popularity Bias: A Spectral Perspective and the ReSN Correction Method

The award‑winning WSDM 2025 paper reveals how recommendation models magnify popularity bias through spectral alignment of the rating matrix’s leading singular vector with item popularity, and proposes the ReSN regularization technique to mitigate this bias efficiently.

FairnessRecommendation SystemsRegularization

0 likes · 8 min read

How Recommendation Models Amplify Popularity Bias: A Spectral Perspective and the ReSN Correction Method

Ops Development & AI Practice

Jul 8, 2024 · Artificial Intelligence

Essential Denoising Techniques for Training Large AI Models

This article outlines key denoising methods—including data cleaning, augmentation, regularization, adversarial training, and self‑supervised learning—that improve the performance, generalization, and robustness of large neural network and transformer models.

DenoisingRegularizationadversarial training

0 likes · 5 min read

Essential Denoising Techniques for Training Large AI Models

Rare Earth Juejin Tech Community

May 5, 2024 · Artificial Intelligence

Comprehensive Guide to Neural Network Algorithms: Definitions, Structure, Implementation, and Training

This article provides an in‑depth tutorial on neural network algorithms, covering their biological inspiration, significance, advantages and drawbacks, detailed architecture, data preparation, one‑hot encoding, weight initialization, forward and backward propagation, cost functions, regularization, gradient checking, and complete Python code examples.

AIBackpropagationNeural Networks

0 likes · 37 min read

Comprehensive Guide to Neural Network Algorithms: Definitions, Structure, Implementation, and Training

DataFunSummit

Feb 14, 2023 · Artificial Intelligence

Deep Learning Hyperparameter Tuning and Training Tips: Insights from Zhihu Experts

This article compiles practical deep learning training and hyperparameter tuning advice from Zhihu contributors, covering model debugging, learning‑rate strategies, optimizer choices, data preprocessing, regularization techniques, initialization methods, common pitfalls, recommended research papers, and ensemble approaches.

Deep LearningRegularizationgradient clipping

0 likes · 13 min read

Deep Learning Hyperparameter Tuning and Training Tips: Insights from Zhihu Experts

Model Perspective

Jan 15, 2023 · Artificial Intelligence

Mastering Model Evaluation: Key Metrics, Validation Techniques, and Diagnostics

This guide explains essential evaluation metrics for classification and regression models—including confusion matrix, ROC/AUC, R², and main performance indicators—covers model selection strategies such as train‑validation‑test splits, k‑fold cross‑validation, and regularization techniques, and discusses bias‑variance trade‑offs and diagnostic tools.

Evaluation MetricsModel SelectionRegularization

0 likes · 6 min read

Mastering Model Evaluation: Key Metrics, Validation Techniques, and Diagnostics

Model Perspective

Jul 22, 2022 · Artificial Intelligence

Understanding Ridge Regression: Definitions, Properties, and Parameter Selection

This article explains ridge regression by defining the estimator, outlining its key properties, discussing methods for choosing the ridge parameter, and demonstrating its application to economic data with Python code and visualizations.

PythonRegularizationlinear models

0 likes · 6 min read

Understanding Ridge Regression: Definitions, Properties, and Parameter Selection

Model Perspective

Jul 21, 2022 · Artificial Intelligence

Tackling Multicollinearity: Ridge and LASSO Regression Explained with Python

This article explains how multicollinearity undermines ordinary least squares estimates, introduces ridge and LASSO regularization as remedies, and demonstrates their application on a 1966 French economic dataset using Python’s statsmodels, complete with code and interpretation of results.

LASSOPythonRegularization

0 likes · 7 min read

Tackling Multicollinearity: Ridge and LASSO Regression Explained with Python

Code DAO

Dec 18, 2021 · Artificial Intelligence

Essential Feature Selection Techniques for Machine Learning

This article explains why feature selection is crucial for building robust machine‑learning models and walks through popular filter, wrapper, and embedded methods—including information gain, chi‑square, LASSO, random‑forest importance, and PCA—providing code examples and practical guidance.

PCARegularizationembedded methods

0 likes · 18 min read

Essential Feature Selection Techniques for Machine Learning

Code DAO

Dec 5, 2021 · Artificial Intelligence

Why DropBlock Outperforms Dropout as an Image Regularizer

This article demonstrates how to implement DropBlock in PyTorch, explains why Dropout fails on image data, details the gamma calculation and mask generation, and shows visual comparisons that illustrate the superiority of contiguous region dropping over random pixel dropout.

Computer VisionDeep LearningDropBlock

0 likes · 11 min read

Why DropBlock Outperforms Dropout as an Image Regularizer

DataFunTalk

Dec 4, 2021 · Artificial Intelligence

Practical Deep Learning Training Tricks: Cyclic LR, Flooding, Warmup, RAdam, Adversarial Training, Focal Loss, Dropout, Normalization and More

This article compiles essential deep learning training techniques—including cyclic learning rates, flooding, warmup, RAdam optimizer, adversarial training, focal loss, dropout, batch/group/weight normalization, label smoothing, Wasserstein GAN, skip connections, and weight initialization—providing concise explanations and code snippets for each method.

Deep LearningNeural NetworksRegularization

0 likes · 11 min read

Practical Deep Learning Training Tricks: Cyclic LR, Flooding, Warmup, RAdam, Adversarial Training, Focal Loss, Dropout, Normalization and More

DataFunTalk

Aug 10, 2021 · Artificial Intelligence

Practical Deep Learning Tricks: Cyclic LR, Flooding, Warmup, RAdam, Adversarial Training, Focal Loss, Dropout, Normalization, ReLU, Group Normalization, Label Smoothing, Wasserstein GAN, Skip Connections, Weight Initialization

This article presents a concise collection of practical deep‑learning techniques—including cyclic learning‑rate, flooding, warmup, RAdam, adversarial training, focal loss, dropout, various normalization methods, ReLU, group normalization, label smoothing, Wasserstein GAN, skip connections, and weight initialization—along with code snippets and references for implementation.

Deep LearningGANRegularization

0 likes · 8 min read

Practical Deep Learning Tricks: Cyclic LR, Flooding, Warmup, RAdam, Adversarial Training, Focal Loss, Dropout, Normalization, ReLU, Group Normalization, Label Smoothing, Wasserstein GAN, Skip Connections, Weight Initialization

DataFunTalk

Apr 5, 2021 · Artificial Intelligence

Summary of Methods and Findings from the NLP Chinese Pre‑training Model Generalization Challenge

The article reviews the Chinese NLP pre‑training model generalization competition, detailing data preprocessing, augmentation, external data usage, model scaling and architecture tweaks, loss functions, learning‑rate and adversarial training strategies, regularization techniques, post‑processing optimizations, and ineffective methods, highlighting their impact on performance metrics.

Loss FunctionsModel OptimizationNLP

0 likes · 15 min read

Summary of Methods and Findings from the NLP Chinese Pre‑training Model Generalization Challenge

Architects' Tech Alliance

Sep 3, 2020 · Artificial Intelligence

Deep Learning Specialization Infographic Overview

This article presents a comprehensive English summary of the deep learning specialization infographics originally shared by Andrew Ng, covering fundamentals, logistic regression, shallow and deep neural networks, regularization, optimization, hyperparameters, convolutional and recurrent networks, and practical advice for model building and evaluation.

CNNDeep LearningNeural Networks

0 likes · 21 min read

Deep Learning Specialization Infographic Overview

Beike Product & Technology

Mar 21, 2019 · Artificial Intelligence

Optimization Foundations and Applications in Machine Learning and Computer Vision

This article introduces how machine learning problems are formulated as optimization tasks, explains the construction of objective functions with examples such as linear regression, robust fitting, regularization, and demonstrates various applications ranging from K‑means clustering to image inpainting and 3D reconstruction.

Computer VisionRegularizationlinear regression

0 likes · 9 min read

Optimization Foundations and Applications in Machine Learning and Computer Vision

Qunar Tech Salon

Oct 10, 2018 · Artificial Intelligence

Introduction to Lasso Regression with scikit-learn

This article provides a comprehensive guide to Lasso regression, covering its theoretical background, scikit-learn API parameters, step‑by‑step Python implementation, cross‑validation for hyper‑parameter tuning, visualization of predictions, and a discussion of its advantages over ridge regression.

Cross‑ValidationData visualizationPython

0 likes · 6 min read

Introduction to Lasso Regression with scikit-learn

Qunar Tech Salon

Oct 9, 2018 · Artificial Intelligence

Ridge Regression with scikit-learn: Theory, Implementation, and Example

This article introduces Ridge regression, explains its theory and regularization role, discusses overfitting and bias‑variance trade‑offs, presents scikit‑learn parameters, and provides a complete Python example from data loading to model training, evaluation, and optimal alpha selection.

PythonRegularizationmachine learning

0 likes · 7 min read

Ridge Regression with scikit-learn: Theory, Implementation, and Example