Agentic RL: Transforming LLMs into Autonomous Decision‑Making Agents

This survey formalizes the shift from preference‑based reinforcement fine‑tuning to Agentic Reinforcement Learning, defines Agentic RL via MDP/POMDP abstractions, proposes a dual taxonomy of capabilities and task domains, compiles over 500 recent works, and outlines open challenges for scalable, robust AI agents.

AI agentsLLMPOMDP

0 likes · 12 min read

Agentic RL: Transforming LLMs into Autonomous Decision‑Making Agents

Bighead's Algorithm Notes

Sep 14, 2025 · Artificial Intelligence

How MM‑DREX Uses Multimodal LLMs for Dynamic Expert Routing in Financial Trading

The article reviews the MM‑DREX framework, which tackles the non‑stationarity of financial markets by modeling trading as a POMDP, employing a vision‑language model‑driven dynamic router to allocate four heterogeneous experts, and demonstrates superior returns, Sharpe ratios, and drawdown control across stocks, futures, and crypto compared with 15 strong baselines.

LLMPOMDPReinforcement Learning

0 likes · 13 min read

How MM‑DREX Uses Multimodal LLMs for Dynamic Expert Routing in Financial Trading

Code DAO

Apr 28, 2022 · Artificial Intelligence

Model-Based Reinforcement Learning from Raw Video: A Detailed Walkthrough

The article explains how to train robots to learn tasks directly from raw video using model-based reinforcement learning, covering POMDP formulation, CNN auto‑encoders, latent‑space representations, iLQR optimization, and a step‑by‑step pipeline with concrete examples and references.

CNN autoencoderPOMDPReinforcement Learning

0 likes · 11 min read

Model-Based Reinforcement Learning from Raw Video: A Detailed Walkthrough

DataFunTalk

Feb 27, 2020 · Artificial Intelligence

Technical Challenges in Planning and Control for Autonomous Heavy Trucks

The article reviews the complex system model of autonomous heavy trucks, outlines traditional and modern planning and control methods—including rule‑based FSM, POMDP, learning‑based and optimization techniques—highlights safety, efficiency, fuel‑economy, and dynamic modeling challenges specific to heavy‑truck and trailer configurations, and shares practical attempts such as lane‑changing, merging, and trailer‑aware trajectory planning.

POMDPPlanningautonomous driving

0 likes · 13 min read

Technical Challenges in Planning and Control for Autonomous Heavy Trucks