Tagged articles
7 articles
Page 1 of 1
Machine Heart
Machine Heart
May 1, 2026 · Artificial Intelligence

From PPO to MaxRL: The Evolution of Reinforcement Learning for LLM Inference

This article surveys the rapid evolution of reinforcement‑learning algorithms for large‑language‑model inference from early REINFORCE and PPO to newer approaches such as GRPO, RLOO, DAPO, CISPO, DPPO, ScaleRL and MaxRL, highlighting their design motivations, mathematical formulations, empirical trade‑offs and open research challenges.

GRPOLLMMaxRL
0 likes · 27 min read
From PPO to MaxRL: The Evolution of Reinforcement Learning for LLM Inference
Fun with Large Models
Fun with Large Models
Sep 24, 2025 · Artificial Intelligence

Interview Guide: Core Differences Between PPO and GRPO Algorithms for Large Model Fine‑Tuning

The article explains the fundamental principles of PPO and GRPO reinforcement‑learning algorithms, compares their architectures and training workflows, highlights why GRPO is gaining traction in large‑model fine‑tuning, discusses associated risks, and offers practical guidance on group size selection for engineers preparing for interviews.

GRPOLarge Language ModelsPPO
0 likes · 9 min read
Interview Guide: Core Differences Between PPO and GRPO Algorithms for Large Model Fine‑Tuning
Model Perspective
Model Perspective
Nov 1, 2023 · Artificial Intelligence

Master Differential Evolution: Principles, Code Example, and GA Comparison

This article introduces the Differential Evolution (DE) algorithm, detailing its core concepts of mutation, crossover, and selection, outlines the step-by-step procedure, provides a Python implementation, compares DE with Genetic Algorithms, and highlights practical applications across engineering, machine learning, and image processing.

Differential EvolutionEvolutionary AlgorithmsPython
0 likes · 11 min read
Master Differential Evolution: Principles, Code Example, and GA Comparison
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Oct 31, 2022 · Industry Insights

How Do Process Mining Algorithms Compare? A Deep Dive into Control‑Flow Techniques

This article examines the rapid growth of process‑mining techniques, explains the core concepts and modeling languages, evaluates model quality criteria, and provides a detailed comparison of direct, heuristic, genetic, and log‑classification algorithms from the perspective of control‑flow analysis.

Control Flowalgorithm comparisonprocess discovery
0 likes · 31 min read
How Do Process Mining Algorithms Compare? A Deep Dive into Control‑Flow Techniques
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 5, 2022 · Artificial Intelligence

When to Use Logistic Regression, SVM, Decision Trees, and More? A Practical Frequency Guide

This article analyzes how often common machine‑learning algorithms such as k‑NN, Naïve Bayes, decision trees, SVM, logistic regression, and neural networks are used in industry, explains their typical scenarios, highlights strengths and weaknesses, and shows how non‑linearity and feature engineering affect their suitability.

algorithm comparisondecision treefeature engineering
0 likes · 12 min read
When to Use Logistic Regression, SVM, Decision Trees, and More? A Practical Frequency Guide
Tencent Cloud Developer
Tencent Cloud Developer
Jun 16, 2021 · Backend Development

Comparison of Four Consistent Hashing Algorithms: Ketama, Rendezvous, Jump Consistent Hash, and Maglev

The article compares four consistent‑hashing algorithms—Ketama’s ring with virtual nodes, Rendezvous’s highest‑random‑weight method, Google’s Jump Consistent Hash, and Maglev’s lookup‑table approach—evaluating their balance, monotonicity, stability, scalability, and time complexity, and concludes that Ketama and Jump offer the best overall trade‑off.

Distributed Systemsalgorithm comparisonconsistent hashing
0 likes · 23 min read
Comparison of Four Consistent Hashing Algorithms: Ketama, Rendezvous, Jump Consistent Hash, and Maglev
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 24, 2020 · Fundamentals

Why Raft Beats Paxos and EPaxos: A Deep Dive into Distributed Consensus

This article explores the evolution of distributed consensus—from Paxos to Multi‑Paxos, Raft, and EPaxos—examining their mechanisms, understandability, efficiency, availability, and suitable scenarios, while providing comparative analysis and thought‑provoking questions for practitioners in modern cloud systems.

EPaxosPaxosRaft
0 likes · 14 min read
Why Raft Beats Paxos and EPaxos: A Deep Dive into Distributed Consensus