Tagged articles
8 articles
Page 1 of 1
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 28, 2025 · Artificial Intelligence

Why Entropy Collapse Limits LLM Reinforcement Learning and How to Fix It

The article explains how information entropy, cross‑entropy, and KL‑divergence shape reinforcement learning for large language models, describes the phenomenon of entropy collapse, compares token‑level and policy‑level entropy, and reviews recent methods like Clip‑Cov and KL‑Cov that mitigate this issue.

cross entropyentropypolicy entropy
0 likes · 11 min read
Why Entropy Collapse Limits LLM Reinforcement Learning and How to Fix It
Data STUDIO
Data STUDIO
Sep 15, 2025 · Artificial Intelligence

Understanding Linear and Logistic Regression: From MSE to Cross‑Entropy

The article explains linear regression and logistic regression fundamentals, covering loss functions such as mean‑squared error and cross‑entropy, analytic solutions, feature expansion for non‑linear separability, and provides Python code examples to illustrate the concepts.

Pythoncross entropylinear regression
0 likes · 7 min read
Understanding Linear and Logistic Regression: From MSE to Cross‑Entropy
Model Perspective
Model Perspective
Sep 10, 2024 · Artificial Intelligence

Why Cross-Entropy Is the Key Loss Function for Classification Models

This article explains how loss functions evaluate model performance, contrasts regression’s mean squared error with classification’s cross‑entropy, describes one‑hot encoding and softmax outputs, and shows why higher predicted probabilities for the correct class yield lower loss, highlighting applications in image, language, and speech tasks.

Softmaxclassificationcross entropy
0 likes · 5 min read
Why Cross-Entropy Is the Key Loss Function for Classification Models
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 16, 2022 · Artificial Intelligence

Deep Learning Semantic Segmentation: FCN Source Code Analysis

This tutorial walks through the complete FCN pipeline for semantic segmentation, covering VOC dataset loading, data augmentation, collate functions, model construction, training loops, loss computation with cross‑entropy (including ignore‑index handling), and inference, while providing full PyTorch code snippets for each step.

FCNPyTorchVOC dataset
0 likes · 19 min read
Deep Learning Semantic Segmentation: FCN Source Code Analysis
Code DAO
Code DAO
May 16, 2022 · Artificial Intelligence

How to Build a Simple Neural Network from Scratch with NumPy

This article walks through implementing a basic multi‑layer neural network using only NumPy, covering terminology, network architecture, forward and backward propagation, activation functions, loss calculation, parameter updates with SGD, and compares the custom model with a Keras implementation.

BackpropagationNeural NetworkNumPy
0 likes · 17 min read
How to Build a Simple Neural Network from Scratch with NumPy
Code DAO
Code DAO
May 6, 2022 · Fundamentals

Information Theory Foundations for Machine Learning and Deep Learning

The article explains Shannon information content, entropy, cross‑entropy, KL‑divergence, conditional entropy and mutual information, illustrating each concept with coin‑flip and dice examples, visual formulas, and discusses their roles as loss functions and evaluation metrics in machine‑learning models.

KL divergencecross entropyentropy
0 likes · 8 min read
Information Theory Foundations for Machine Learning and Deep Learning
21CTO
21CTO
Feb 7, 2018 · Artificial Intelligence

Demystifying Entropy: From Basic Concepts to Cross‑Entropy and KL Divergence

This article explains entropy, joint entropy, conditional entropy, and related measures such as KL divergence and cross‑entropy, using intuitive coin‑flip examples and mathematical formulas to show how they quantify uncertainty and information in probability distributions.

KL divergencecross entropyentropy
0 likes · 14 min read
Demystifying Entropy: From Basic Concepts to Cross‑Entropy and KL Divergence