Layer Normalization — 2 Technical Articles

May 30, 2025 · Artificial Intelligence

Why Layer Normalization Stabilizes Transformers: A Deep Dive

This article explains the mathematical foundation of layer normalization, why it is needed for deep neural networks like Transformers, how scaling (γ) and bias (β) parameters restore important signal variations, and practical placement tips for stable training.

BiasLayer NormalizationScaling

0 likes · 8 min read

Why Layer Normalization Stabilizes Transformers: A Deep Dive

Baobao Algorithm Notes

Nov 13, 2023 · Artificial Intelligence

Mastering LLM Fundamentals: Tokenizers, Layer Norm, and PEFT Explained

This article provides a comprehensive technical guide on large language model fundamentals, covering tokenizer construction methods such as BPE, WordPiece, and SentencePiece, detailed explanations of Layer Normalization variants, Deep Norm concepts with code, and an overview of parameter‑efficient fine‑tuning techniques like LoRA and PEFT.

Artificial IntelligenceLLMLayer Normalization

0 likes · 36 min read

Mastering LLM Fundamentals: Tokenizers, Layer Norm, and PEFT Explained