Tagged articles
4 articles
Page 1 of 1
Data Party THU
Data Party THU
Sep 15, 2025 · Artificial Intelligence

Agentic RL: Transforming LLMs into Autonomous Decision‑Making Agents

This survey formalizes the shift from preference‑based reinforcement fine‑tuning to Agentic Reinforcement Learning, defines Agentic RL via MDP/POMDP abstractions, proposes a dual taxonomy of capabilities and task domains, compiles over 500 recent works, and outlines open challenges for scalable, robust AI agents.

AI agentsLLMPOMDP
0 likes · 12 min read
Agentic RL: Transforming LLMs into Autonomous Decision‑Making Agents
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 14, 2025 · Artificial Intelligence

How MM‑DREX Uses Multimodal LLMs for Dynamic Expert Routing in Financial Trading

The article reviews the MM‑DREX framework, which tackles the non‑stationarity of financial markets by modeling trading as a POMDP, employing a vision‑language model‑driven dynamic router to allocate four heterogeneous experts, and demonstrates superior returns, Sharpe ratios, and drawdown control across stocks, futures, and crypto compared with 15 strong baselines.

LLMPOMDPReinforcement Learning
0 likes · 13 min read
How MM‑DREX Uses Multimodal LLMs for Dynamic Expert Routing in Financial Trading
Code DAO
Code DAO
Apr 28, 2022 · Artificial Intelligence

Model-Based Reinforcement Learning from Raw Video: A Detailed Walkthrough

The article explains how to train robots to learn tasks directly from raw video using model-based reinforcement learning, covering POMDP formulation, CNN auto‑encoders, latent‑space representations, iLQR optimization, and a step‑by‑step pipeline with concrete examples and references.

CNN autoencoderPOMDPReinforcement Learning
0 likes · 11 min read
Model-Based Reinforcement Learning from Raw Video: A Detailed Walkthrough
DataFunTalk
DataFunTalk
Feb 27, 2020 · Artificial Intelligence

Technical Challenges in Planning and Control for Autonomous Heavy Trucks

The article reviews the complex system model of autonomous heavy trucks, outlines traditional and modern planning and control methods—including rule‑based FSM, POMDP, learning‑based and optimization techniques—highlights safety, efficiency, fuel‑economy, and dynamic modeling challenges specific to heavy‑truck and trailer configurations, and shares practical attempts such as lane‑changing, merging, and trailer‑aware trajectory planning.

POMDPPlanningautonomous driving
0 likes · 13 min read
Technical Challenges in Planning and Control for Autonomous Heavy Trucks