Tagged articles
3 articles
Page 1 of 1
Data Party THU
Data Party THU
Nov 4, 2025 · Artificial Intelligence

Why Evolution Strategies Beat Reinforcement Learning for Large‑Model Fine‑Tuning

This article reviews the paper “Evolution Strategies at Scale: LLM Fine‑Tuning Beyond Reinforcement Learning”, explaining how parameter‑space exploration via ES provides more stable, sample‑efficient, and reproducible fine‑tuning for billion‑parameter LLMs such as Qwen‑2.5 and LLaMA‑3, and detailing the algorithmic and engineering innovations that make full‑parameter ES practical.

Evolution StrategiesParameter Space OptimizationScalable Training
0 likes · 15 min read
Why Evolution Strategies Beat Reinforcement Learning for Large‑Model Fine‑Tuning
DataFunSummit
DataFunSummit
Nov 21, 2023 · Artificial Intelligence

Automatic Hyperparameter Tuning in Tencent Recommendation System (TRS): Techniques, Evolution, and Practice

This article presents an in‑depth overview of Tencent's TRS automatic hyperparameter tuning, covering background, challenges, the evolution from Bayesian optimization to evolution strategies and reinforcement learning, a systematic platform solution, real‑world deployment results, and a Q&A session.

Bayesian OptimizationEvolution StrategiesOnline Learning
0 likes · 20 min read
Automatic Hyperparameter Tuning in Tencent Recommendation System (TRS): Techniques, Evolution, and Practice
GuanYuan Data Tech Team
GuanYuan Data Tech Team
Jul 28, 2022 · Artificial Intelligence

Unlocking Reinforcement Learning: Core Concepts, Algorithms, and Real‑World Applications

This article introduces reinforcement learning by defining agents, environments, rewards, and policies, explains key concepts such as Markov Decision Processes and Bellman equations, and surveys major algorithms—including dynamic programming, Monte‑Carlo, TD learning, policy gradients, Q‑learning, DQN, and evolution strategies—while highlighting practical challenges and notable case studies like AlphaGo Zero.

Deep LearningEvolution StrategiesMDP
0 likes · 27 min read
Unlocking Reinforcement Learning: Core Concepts, Algorithms, and Real‑World Applications