Tagged articles

imitation learning

12 articles · Page 1 of 1

May 10, 2026 · Artificial Intelligence

Embodied AI Unveiled: Ted Xiao Revisits Three Eras of Robot Learning from Google RT‑1/2 to SayCan

In a detailed interview, Ted Xiao, former Google DeepMind researcher, walks through the existence‑proof, foundation‑model, and scaling eras of embodied robot learning, explaining the technical challenges, pivotal decisions, and the evolving role of large language and vision models in robotics.

Embodied AIFoundation Modelsimitation learning

0 likes · 19 min read

Embodied AI Unveiled: Ted Xiao Revisits Three Eras of Robot Learning from Google RT‑1/2 to SayCan

Machine Heart

Apr 5, 2026 · Artificial Intelligence

How Imitation Learning Powers Dexterous Manipulation: A 2021‑2025 Technical Roadmap

This survey maps the 2021‑2025 progress of imitation learning for dexterous manipulation, detailing theoretical foundations, datasets, algorithms, hardware platforms, and evaluation protocols, and highlights challenges such as data quality, hardware dependence, and the need for standardized benchmarks to advance embodied AI.

Evaluationalgorithmsdatasets

0 likes · 11 min read

How Imitation Learning Powers Dexterous Manipulation: A 2021‑2025 Technical Roadmap

Bighead's Algorithm Notes

Mar 24, 2026 · Artificial Intelligence

How an Interactive Imitation‑Learning Agent Framework Trains Robust Trading Strategies

The article analyzes the simulation‑reality gap in algorithmic trading and proposes an interactive market simulator that combines a pool of imitation‑learning agents, an action‑synthesis network, and a DDPG‑based reinforcement‑learning trader, showing superior robustness and downside protection on QQQ data.

Agent-based ModelingDDPGfinancial AI

0 likes · 16 min read

How an Interactive Imitation‑Learning Agent Framework Trains Robust Trading Strategies

Alibaba Cloud Big Data AI Platform

Nov 10, 2025 · Artificial Intelligence

How to Boost Robot Imitation Learning with Cosmos World Model Data Augmentation

This guide demonstrates an end‑to‑end workflow on Alibaba Cloud PAI that uses the Cosmos world model to replace Isaac simulation for robot action data augmentation, including minimal human demonstrations, prompt‑driven data expansion, rejection sampling, IDM inverse‑kinematics extraction, imitation‑learning fine‑tuning, and model evaluation.

AICosmosdata augmentation

0 likes · 17 min read

How to Boost Robot Imitation Learning with Cosmos World Model Data Augmentation

Alibaba Cloud Big Data AI Platform

Nov 3, 2025 · Artificial Intelligence

Build Physical AI with Isaac Lab: Data Augmentation, Imitation Learning & Evaluation

This article walks through an end‑to‑end Physical AI workflow on Alibaba Cloud PAI, covering robot teleoperation data collection, Isaac Lab‑based data augmentation and enhancement, imitation‑learning model training, distributed DLC execution, and systematic evaluation across varied visual conditions.

Simulationdata augmentationimitation learning

0 likes · 17 min read

Build Physical AI with Isaac Lab: Data Augmentation, Imitation Learning & Evaluation

Amap Tech

Oct 5, 2025 · Artificial Intelligence

Can One Navigation Brain Power All Robots? Inside CE-Nav’s Cross‑Embodiment Breakthrough

CE-Nav introduces a two‑stage imitation‑then‑reinforcement framework that decouples generic geometric planning from robot‑specific dynamics, enabling low‑cost, high‑performance navigation across quadrupeds, humanoids, and drones while requiring only brief online fine‑tuning.

SimulationVelFlowcross‑embodiment

0 likes · 11 min read

Can One Navigation Brain Power All Robots? Inside CE-Nav’s Cross‑Embodiment Breakthrough

Data Party THU

Aug 9, 2025 · Artificial Intelligence

Demystifying MaxEnt Inverse Reinforcement Learning: Theory, Algorithms, and Practical Implementation

This article provides a comprehensive, step‑by‑step exploration of MaxEnt Inverse Reinforcement Learning, covering its statistical foundations, feature‑expectation matching, algorithmic details, deep extensions, and practical engineering considerations for complex decision‑making tasks.

Deep IRLFeature Matchingimitation learning

0 likes · 21 min read

Demystifying MaxEnt Inverse Reinforcement Learning: Theory, Algorithms, and Practical Implementation

AI Frontier Lectures

May 31, 2025 · Artificial Intelligence

Why Embodied Intelligence Is Exploding and What It Means for the Future

The article analyzes the recent surge in embodied intelligence, examines why physical agents matter despite advances in large language models, outlines common failure modes, discusses key research decisions such as 2D versus 3D perception and tactile sensing, and explores the roles of imitation learning, VLA, and reinforcement learning in shaping the field.

VLAVisionimitation learning

0 likes · 24 min read

Why Embodied Intelligence Is Exploding and What It Means for the Future

Didi Tech

Jun 13, 2023 · Operations

Supply-Demand Dynamics and Regulation Techniques in Didi’s Ride-Hailing Platform

Didi balances ride‑hailing supply and demand by forecasting regional needs with time‑series and deep‑learning models, then optimally repositioning drivers through integer programming and refining policies via imitation and offline reinforcement learning, ultimately enhancing passenger experience and platform efficiency.

DidiRide Hailingforecasting

0 likes · 16 min read

Supply-Demand Dynamics and Regulation Techniques in Didi’s Ride-Hailing Platform

GuanYuan Data Tech Team

Sep 8, 2022 · Artificial Intelligence

How AI Reinforcement Learning Transforms Smart Replenishment in Retail

This article examines the technical challenges of intelligent replenishment—model stability, complexity, generalization, and interpretability—and explains how a few‑shot imitation learning and inverse reinforcement learning framework can overcome these issues to deliver reliable, low‑cost AI‑driven supply‑chain decisions.

AIimitation learningmodel stability

0 likes · 22 min read

How AI Reinforcement Learning Transforms Smart Replenishment in Retail

JD Cloud Developers

Mar 3, 2022 · Artificial Intelligence

How JD Explore’s Silver‑Bullet‑3D Dominated the SAPIEN ManiSkill Challenge

JD Explore Research Institute’s Visual and Multimedia Lab team “Silver‑Bullet‑3D” secured top positions in the 2021 SAPIEN ManiSkill Challenge by excelling in both imitation‑learning and rule‑based tracks, showcasing cutting‑edge computer‑vision and robotic‑arm control technologies that earned them international recognition.

AI competitioncomputer visionimitation learning

0 likes · 5 min read

How JD Explore’s Silver‑Bullet‑3D Dominated the SAPIEN ManiSkill Challenge

Tencent Cloud Developer

Mar 15, 2018 · Artificial Intelligence

Learning Long-Horizon Surgical Robot Tasks via Transition State Clustering, SWIRL, and DDCO

The article surveys three recent approaches—Transition State Clustering, Sequential Windowed Inverse Reinforcement Learning, and Deep Discovery of Continuous Options—that automatically segment long‑horizon surgical‑robot demonstrations into sub‑tasks, learn hierarchical policies from limited data, and achieve markedly higher success rates on da Vinci cutting, tension, and needle‑picking tasks.

hierarchical learningimitation learningreinforcement learning

0 likes · 18 min read

Learning Long-Horizon Surgical Robot Tasks via Transition State Clustering, SWIRL, and DDCO