Tagged articles
7 articles
Page 1 of 1
AI Algorithm Path
AI Algorithm Path
Mar 4, 2025 · Artificial Intelligence

How to Control LLM Output Using Temperature, Top‑K, and Top‑P

The article explains how sampling parameters—Temperature, Top‑k, and Top‑p—shape the output of large language models, comparing greedy and beam search, illustrating probability changes with concrete examples, and offering practical guidance on adjusting these settings for different tasks.

Beam SearchGreedy SearchLLM
0 likes · 9 min read
How to Control LLM Output Using Temperature, Top‑K, and Top‑P
Baidu Geek Talk
Baidu Geek Talk
Aug 21, 2023 · Artificial Intelligence

Decoding Strategies for Generative Models: Top‑k, Top‑p, Contrastive Search, Beam Search, and Sampling

The article explains how generative models use deterministic methods like greedy and beam search and stochastic techniques such as top‑k, top‑p, contrastive search and sampling, describing their mechanisms, temperature control, repetition penalties, and practical trade‑offs for balancing fluency, diversity and coherence.

AIBeam SearchSampling
0 likes · 9 min read
Decoding Strategies for Generative Models: Top‑k, Top‑p, Contrastive Search, Beam Search, and Sampling
DataFunTalk
DataFunTalk
Sep 30, 2019 · Artificial Intelligence

Reinforcement Learning for Recommender Systems: Challenges, Solutions, and Key Papers

This article reviews recent advances in applying reinforcement learning to recommendation systems, explains the fundamental RL concepts, discusses the specific challenges such as large action spaces, bias, and long‑term reward modeling, and summarizes two influential YouTube papers along with practical insights and future directions.

Top‑Klong-term rewardoff‑policy
0 likes · 13 min read
Reinforcement Learning for Recommender Systems: Challenges, Solutions, and Key Papers