Machine Heart
Jul 4, 2026 · Artificial Intelligence
When Swapping Two Images Breaks VLMs: EgoTSR Enables Robots to Judge Real Task Progress
The paper reveals that visual language models often rely on chronological bias, mistaking later frames for progress, and introduces EgoTSR—a 46‑million‑sample ego‑centric dataset and three‑stage curriculum that teaches models to assess task state, evaluate with forward‑reverse tests, and achieve over 92% accuracy on long‑term robotic tasks.
chronological-biascurriculum-learningego-centric reasoning
0 likes · 11 min read
