Tagged articles
4 articles
Page 1 of 1
AI Algorithm Path
AI Algorithm Path
Feb 19, 2025 · Artificial Intelligence

How Temperature Shapes Output in Large Language Models

The article explains the Temperature hyper‑parameter in large language models, shows how it modifies the softmax distribution, provides a Python visualisation script, and demonstrates through experiments that higher values increase creativity while lower values make outputs more deterministic.

Large Language ModelsPythonSampling
0 likes · 5 min read
How Temperature Shapes Output in Large Language Models
Model Perspective
Model Perspective
Sep 10, 2024 · Artificial Intelligence

Why Cross-Entropy Is the Key Loss Function for Classification Models

This article explains how loss functions evaluate model performance, contrasts regression’s mean squared error with classification’s cross‑entropy, describes one‑hot encoding and softmax outputs, and shows why higher predicted probabilities for the correct class yield lower loss, highlighting applications in image, language, and speech tasks.

Softmaxclassificationcross entropy
0 likes · 5 min read
Why Cross-Entropy Is the Key Loss Function for Classification Models
dbaplus Community
dbaplus Community
Nov 10, 2016 · Artificial Intelligence

Demystifying Recurrent Neural Networks: Theory, Training, and Implementation

This article explains the fundamentals of recurrent neural networks (RNNs), their role in language modeling, various RNN architectures such as bidirectional and deep RNNs, the back‑propagation through time (BPTT) training algorithm, gradient challenges, vectorization techniques, and provides a step‑by‑step code implementation.

BPTTDeep LearningLanguage Model
0 likes · 21 min read
Demystifying Recurrent Neural Networks: Theory, Training, and Implementation