Meituan Technology Team
Meituan Technology Team
Apr 23, 2026 · Artificial Intelligence

LARYBench Introduces an ImageNet‑Style Benchmark for Embodied Action Representations Learned from Human Video

LARYBench (Latent Action Representation Yielding Benchmark) provides the first systematic, ImageNet‑scale evaluation for implicit action representations derived from large‑scale human video, decoupling representation quality from downstream control, and shows that general‑purpose vision models outperform specialized embodied models in both action generalization and control precision across diverse robot morphologies and environments.

RoboticsVision-Language-Actionaction representation
0 likes · 13 min read
LARYBench Introduces an ImageNet‑Style Benchmark for Embodied Action Representations Learned from Human Video
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 17, 2026 · Artificial Intelligence

LARYBench: An ImageNet‑Scale Benchmark Unlocks Embodied AI Generalization

Researchers introduce LARYBench, the first large‑scale benchmark for evaluating implicit action representations in embodied AI, providing over 1.2 million annotated video clips, a unified metric for motion semantics, and extensive experiments showing that general visual encoders outperform specialized robot models in action understanding and control.

LARYBenchRoboticsaction representation
0 likes · 12 min read
LARYBench: An ImageNet‑Scale Benchmark Unlocks Embodied AI Generalization