Tagged articles

world model

41 articles · Page 1 of 1

Machine Learning Algorithms & Natural Language Processing

Jul 2, 2026 · Artificial Intelligence

From Agentic Tools to Agentive Systems: A Review of “Critique of Agent Model”

The paper distinguishes agentic tools that rely on external scaffolding from truly agentive systems whose goals, identity, decision‑making, self‑regulation and learning are internalized, proposes the GIC (Goal‑Identity‑Configurator) architecture, and evaluates its safety, auditability and applicability through a pilot‑training use case.

AI agentsGIC architectureagency

0 likes · 19 min read

From Agentic Tools to Agentive Systems: A Review of “Critique of Agent Model”

Machine Heart

Jun 29, 2026 · Artificial Intelligence

How MWA™'s Long‑Sequence Bidirectional Physical Causal Chain Sets a New Record in Embodied AI

The article presents MWA™, the first long‑sequence bidirectional physical causal chain hidden‑space world model, details its bidirectional dynamics, latent‑action pre‑training, three‑gradient constraints and AnyPhys negative‑sample system, and shows it achieved a 75.2% success rate on the RoboCasa GR1 TableTop benchmark, surpassing leading competitors.

AnyPhysEmbodied AIRoboCasa benchmark

0 likes · 14 min read

How MWA™'s Long‑Sequence Bidirectional Physical Causal Chain Sets a New Record in Embodied AI

Machine Heart

Jun 26, 2026 · Artificial Intelligence

How RuoYu Technology Secured China’s First Explosion‑Proof Certification and the World’s First Fueling‑Brain Robot

Amid a booming Chinese embodied‑intelligence market, RuoYu Technology’s explosion‑proof robot RuoYu LanYue 01, powered by the self‑developed RuoYu Jiutian brain, achieved the nation’s first explosion‑proof certification and the world’s first fueling‑brain solution, demonstrating end‑to‑end perception‑planning‑execution across fueling stations, oil‑gas fields, and ports.

Embodied AIH-GARexplosion-proof robotics

0 likes · 16 min read

How RuoYu Technology Secured China’s First Explosion‑Proof Certification and the World’s First Fueling‑Brain Robot

Machine Learning Algorithms & Natural Language Processing

Jun 24, 2026 · Industry Insights

Chinese Team Brings World‑Model AI to Mass Production – The Physical‑World Anthropic

The article analyzes how world‑model AI, which predicts the next physical frame instead of the next word, is reshaping autonomous driving, highlights Momenta's three‑stage R7 architecture and massive data loop, compares its path with Anthropic's software‑only strategy, and projects a multi‑trillion‑dollar physical‑AI market by 2030.

AI marketAnthropicMomenta

0 likes · 10 min read

Chinese Team Brings World‑Model AI to Mass Production – The Physical‑World Anthropic

Machine Heart

Jun 23, 2026 · Industry Insights

Momenta's Physical AI IPO: World Model as the New AI Foundation

Momenta has cleared the Hong Kong Stock Exchange listing hearing, positioning itself as the first "Physical AI" stock with its R7 World Model—a three‑layer architecture that leverages massive real‑world driving data, simulation and reinforcement learning—to challenge Nvidia, Tesla and Anthropic while targeting a multi‑trillion‑dollar market.

AI marketCommercial LoopData Scaling

0 likes · 17 min read

Momenta's Physical AI IPO: World Model as the New AI Foundation

Machine Heart

Jun 21, 2026 · Artificial Intelligence

Can World Models Bridge LLMs' Dynamic Reasoning Gaps?

The article analyzes why large language model agents struggle with dynamic tasks, critiques existing CoT‑style optimizations, and shows how recent world‑model approaches such as EvoAgent, WebEvolver, COMAP, RWML and ProPlay quantitatively improve prediction, planning and success rates in evolving environments.

AgentCoTEvoAgent

0 likes · 9 min read

Can World Models Bridge LLMs' Dynamic Reasoning Gaps?

Machine Heart

Jun 18, 2026 · Artificial Intelligence

Curr-0: Enabling Humanoid Robots to Perform Continuous Full-Body Dexterous Operations

Current Robotics introduces Curr-0, a single-policy model that unifies locomotion, whole-body posture coordination, and fine hand manipulation for 70-plus-degree humanoid robots, trained on 21,000 hours of human behavior data collected via the HumanEx exoskeleton system, and supported by a multi-modal world model for scalable evaluation and deployment.

Humanoid Robotfull-body dexterityhuman behavior dataset

0 likes · 6 min read

Curr-0: Enabling Humanoid Robots to Perform Continuous Full-Body Dexterous Operations

Machine Learning Algorithms & Natural Language Processing

Jun 15, 2026 · Artificial Intelligence

A Comprehensive Survey of Agentic Time Series Systems: Architecture, Reliability, and Research Frontiers

This survey maps the emerging field of agentic time‑series systems, outlining a five‑layer architecture that integrates perception, reasoning, planning, memory, and world modeling, while emphasizing reliability constraints, benchmark evolution, diverse applications, and six key research frontiers.

LLMReliabilityagentic time series

0 likes · 27 min read

A Comprehensive Survey of Agentic Time Series Systems: Architecture, Reliability, and Research Frontiers

Machine Heart

Jun 11, 2026 · Artificial Intelligence

MBench: Tsinghua and Tencent Define Long-Term Memory for Video World Models

MBench, a new benchmark from Tsinghua University and Tencent, systematically evaluates the long‑term memory ability of streaming video generation models across entity, environment, and causal consistency, introduces a trigger‑conditioned scoring scheme, and reveals that memory remains a major bottleneck for current SOTA models.

AIBenchmarklong-term consistency

0 likes · 8 min read

MBench: Tsinghua and Tencent Define Long-Term Memory for Video World Models

PaperAgent

Jun 5, 2026 · Artificial Intelligence

Tongji’s “Boundless” World Model Wins Open‑Source #1 and Overall #2 in WorldArena

The Tongji University “Boundless” world model achieved the top open‑source score (64.54) and the second‑overall rank (67.87) on WorldArena’s Track‑1, demonstrating high‑quality video generation, stable long‑sequence physics, and embodied interaction across six evaluation dimensions, while using data‑efficient training and a hybrid open/closed‑source strategy.

BoundlessEmbodied AIOpen-source

0 likes · 9 min read

Tongji’s “Boundless” World Model Wins Open‑Source #1 and Overall #2 in WorldArena

SuanNi

May 31, 2026 · Artificial Intelligence

How NVIDIA’s Gamma‑World Turns Single‑Agent Models into Multiplayer Experiences

Gamma‑World introduces a multi‑agent world model that solves identity, interaction, and real‑time inference challenges with parameter‑free geometric encoding, sparse hub attention, and teacher‑student distillation, enabling zero‑shot generalization from two to four agents and achieving 24 FPS interactive video generation.

Gamma-WorldReal-time inferenceSimplex Rotary Agent Encoding

0 likes · 11 min read

How NVIDIA’s Gamma‑World Turns Single‑Agent Models into Multiplayer Experiences

Machine Heart

May 29, 2026 · Artificial Intelligence

ZhiYuan’s GE 2.0 Wins WorldArena World Model Championship – How It Achieved Bare‑Bones Victory

ZhiYuan’s Genie Envisioner‑Sim 2.0 (GE 2.0) captured the overall WorldArena world‑model title without any task‑specific tuning, demonstrating superior long‑sequence stability, multi‑view generation, real‑time inference and a closed‑loop reward feedback loop that outperforms industry baselines across 16 metrics and three real‑world tasks.

Closed-loop EvaluationEmbodied AIGE 2.0

0 likes · 9 min read

ZhiYuan’s GE 2.0 Wins WorldArena World Model Championship – How It Achieved Bare‑Bones Victory

Xiaomi Tech

May 26, 2026 · Artificial Intelligence

Xiaomi Auto Unveils Integrated Reconstruction‑Generation World Model Framework Achieving SOTA on Major Benchmarks

Xiaomi Auto introduces a novel world‑model framework that tightly couples 3D reconstruction and generative prediction, delivering state‑of‑the‑art performance on Waymo and nuScenes benchmarks while enabling high‑fidelity, long‑duration video synthesis for autonomous‑driving scenarios.

3D reconstructionBenchmark SOTAXiaomi Auto

0 likes · 10 min read

Xiaomi Auto Unveils Integrated Reconstruction‑Generation World Model Framework Achieving SOTA on Major Benchmarks

Java Tech Enthusiast

May 23, 2026 · Artificial Intelligence

LeCun Slams Hinton’s LLM Enthusiasm and Defends World‑Model Research

In a candid interview, Yann LeCun criticizes Geoffrey Hinton’s sudden endorsement of large language models, argues that LLMs cannot achieve human‑level intelligence, explains his world‑model and JEPA approaches, and details why he left Meta to pursue more ambitious AI research.

AI researchJEPALLM

0 likes · 32 min read

LeCun Slams Hinton’s LLM Enthusiasm and Defends World‑Model Research

Baidu Intelligent Cloud Tech Hub

May 22, 2026 · Artificial Intelligence

How Baidu Baige’s Full‑Stack AI Infra Accelerates Embodied Model Iteration

The article details Baidu Baige’s end‑to‑end AI infrastructure for embodied intelligence, covering VLA and world‑model architectures, scaling challenges for medium‑sized models, cloud‑based motion‑control pipelines, open‑source integration, hardware‑aware training optimizations, and simulation‑engine improvements that together speed up model development and deployment.

AI InfraBaidu BaigeEmbodied AI

0 likes · 13 min read

How Baidu Baige’s Full‑Stack AI Infra Accelerates Embodied Model Iteration

Machine Heart

May 14, 2026 · Artificial Intelligence

Breaking the 3D Perception Bottleneck: VGGT Series Enables Dynamic High‑Fidelity Reconstruction

The VGGT series from KOKONI 3D and collaborators tackles three core 3D perception limits—unbounded sequence memory, dynamic‑static entanglement, and compute‑precision trade‑offs—by introducing StreamCacheVGGT, progressive decoupling, and HD‑VGGT, achieving O(1) memory streaming, 15%+ accuracy gains on dynamic benchmarks, and record‑high AUC on RealEstate10K.

3D reconstructionVGGTcomputer vision

0 likes · 10 min read

Breaking the 3D Perception Bottleneck: VGGT Series Enables Dynamic High‑Fidelity Reconstruction

Machine Heart

May 14, 2026 · Artificial Intelligence

How PsiBot Uses 100,000 Hours of Human Data to Power Embodied Intelligence

PsiBot demonstrates that, with a 100,000‑hour human‑operation dataset captured via exoskeleton gloves and ego‑vision, a world‑model (W0) and reinforcement‑learning policy (R2) can bridge the gap to robot control, offering a scalable alternative to costly teleoperation pipelines.

Embodied AIdata collectionhuman data

0 likes · 12 min read

How PsiBot Uses 100,000 Hours of Human Data to Power Embodied Intelligence

Machine Heart

Apr 28, 2026 · Artificial Intelligence

Why a 7‑Month‑Old Startup Claims Human‑Like Robots Are Key to General Embodied Intelligence

The article details KAI, a 173 cm, 115‑DOF humanoid robot with tactile skin and a custom battery, and explains how its ultra‑human form, massive first‑person data collection, and three‑stage training pipeline are intended to enable a world‑model‑driven embodied AI system, while also acknowledging the engineering and market challenges ahead.

Embodied AIHumanoid Robotdata pipeline

0 likes · 13 min read

Why a 7‑Month‑Old Startup Claims Human‑Like Robots Are Key to General Embodied Intelligence

AI Explorer

Apr 27, 2026 · Artificial Intelligence

Manifold AI’s Worldscape 0.2 Wins WorldArena, Marking a Shift from Seeing to Understanding

Manifold AI’s domestically developed Worldscape 0.2 model clinched first place in the rigorous WorldArena benchmark—demonstrating high‑fidelity dynamic scene generation and embodied control—highlighting a breakthrough in AI world models that move from mere visual perception toward genuine physical‑logic understanding, while noting the technology remains early‑stage.

AI benchmarkingManifold AIWorldArena

0 likes · 7 min read

Manifold AI’s Worldscape 0.2 Wins WorldArena, Marking a Shift from Seeing to Understanding

Lao Guo's Learning Space

Apr 26, 2026 · Industry Insights

April 2026 AI Explosion: Sealed Model, Dual Model Showdown, and a 24‑Hour Shift

In April 2026 the AI landscape accelerated dramatically as Anthropic sealed its most powerful model, OpenAI and DeepSeek released competing flagship systems on the same day, Chinese firms unveiled groundbreaking world‑model and full‑duplex voice technologies, and token usage surged to 140 trillion calls per day, signaling a shift toward AI as essential infrastructure.

AnthropicClaude MythosDeepSeek-V4

0 likes · 16 min read

April 2026 AI Explosion: Sealed Model, Dual Model Showdown, and a 24‑Hour Shift

Machine Heart

Apr 22, 2026 · Artificial Intelligence

China’s AlphaBrain Platform Launches First Full‑Stack Open‑Source Brain‑Like VLA

The AlphaBrain Platform, an open‑source embodied‑intelligence suite from China’s AI² Robotics, combines a world‑model stack, the pioneering NeuroVLA brain‑like model with spiking‑neuron actions, low‑cost RL‑Token training, and cross‑architecture continuous learning, all validated on leading robotics benchmarks.

AlphaBrainEmbodied IntelligenceNeuroVLA

0 likes · 11 min read

China’s AlphaBrain Platform Launches First Full‑Stack Open‑Source Brain‑Like VLA

Code Mala Tang

Apr 22, 2026 · Artificial Intelligence

How LeWorldModel Achieves Stable End‑to‑End World Modeling with Just Two Losses

LeWorldModel, a 2026 JEPA‑based world model introduced by Yann LeCun and collaborators, solves representation collapse with a minimalist two‑loss objective, delivering a 15‑million‑parameter system that trains in hours, runs 48× faster than prior baselines, and reaches near‑SOTA performance on robot control benchmarks.

Deep LearningEmbodied AIJEPA

0 likes · 6 min read

How LeWorldModel Achieves Stable End‑to‑End World Modeling with Just Two Losses

Lao Guo's Learning Space

Apr 21, 2026 · Artificial Intelligence

HappyOyster: Build an Explorable Interactive World with a Single Prompt

Alibaba’s ATH team unveiled HappyOyster, a real‑time world‑model platform that lets users generate and explore interactive 3D environments from a single sentence or image, offering two modes—Wander for exploration and Direct for creation—while detailing its streaming architecture, multimodal foundation, competitive advantages, use cases, and current limitations.

AI videoGame DevelopmentGenerative AI

0 likes · 11 min read

HappyOyster: Build an Explorable Interactive World with a Single Prompt

Machine Heart

Apr 21, 2026 · Artificial Intelligence

The Anonymous Model That Dominated Two World‑Model Benchmarks – Who’s Behind MotuBrain?

MotuBrain, an unnamed world model, topped both the WorldArena and RoboTwin2.0 benchmarks, outperforming established models in motion quality, flow and smoothness, and demonstrating a unified prediction‑and‑action capability that could reshape embodied AI research.

BenchmarkEmbodied AIMotuBrain

0 likes · 9 min read

The Anonymous Model That Dominated Two World‑Model Benchmarks – Who’s Behind MotuBrain?

Machine Heart

Apr 18, 2026 · Artificial Intelligence

Alibaba’s HappyOyster World Model Takes a Third Path Between Google and Fei‑Fei’s Approaches

HappyOyster, Alibaba’s real‑time interactive world‑model product, combines a Wander mode for open‑ended scene generation and a Direct mode for AI‑driven video direction, using a streaming multimodal architecture that distinguishes it from one‑shot text‑to‑video systems like Sora and offers a distinct path from Google’s Genie and Fei‑Fei’s World Labs.

Alibaba AIInteractive VideoMultimodal AI

0 likes · 10 min read

Alibaba’s HappyOyster World Model Takes a Third Path Between Google and Fei‑Fei’s Approaches

Machine Heart

Apr 12, 2026 · Artificial Intelligence

CVPR 2026 WorldArena Challenge Launches with Amap’s Open‑Source High‑Performance World Model Baseline

The CVPR 2026 WorldArena Challenge, organized by top academic institutions and Amap, introduces a new evaluation framework that tests video world models for physical realism and functional utility, while Amap releases its high‑performance ABot‑PhysWorld model and benchmark scores that set a new state‑of‑the‑art.

ABot-PhysWorldBenchmarkCVPR 2026

0 likes · 9 min read

CVPR 2026 WorldArena Challenge Launches with Amap’s Open‑Source High‑Performance World Model Baseline

Data Party THU

Apr 5, 2026 · Artificial Intelligence

How Sequential World Models Enable Scalable Multi‑Robot Cooperation

SeqWM introduces a sequential causal decomposition of multi‑robot dynamics, allowing each robot to model its marginal contribution conditioned on preceding agents, which simplifies learning, improves sample efficiency, and yields natural collaborative behaviors both in simulation (Bi‑DexHands, Multi‑Quadruped) and real‑world tests on Unitree Go2‑W, outperforming prior methods.

Simulationmulti-robotreal-robot

0 likes · 7 min read

How Sequential World Models Enable Scalable Multi‑Robot Cooperation

Fighter's World

Apr 4, 2026 · R&D Management

Building an AI‑Native Organization: From Hierarchy to Intelligent Ops

When AI eliminates execution bottlenecks, the real constraint becomes information flow, prompting a shift from hierarchical information‑routing to AI‑driven world models, intelligence layers and interfaces; the article analyses Block’s four‑layer architecture, its preconditions, challenges for mid‑level managers, and offers a step‑by‑step path for small teams to begin the AI‑native transformation.

AI-nativecapabilitieshierarchy

0 likes · 24 min read

Building an AI‑Native Organization: From Hierarchy to Intelligent Ops

Machine Learning Algorithms & Natural Language Processing

Mar 31, 2026 · Artificial Intelligence

GigaWorld-1 Tops WorldArena Benchmark, Surpassing Google and Nvidia

GigaWorld-1, the latest embodied world model from Jiji Vision, clinched the global #1 spot on the WorldArena benchmark—beating Google, Nvidia, and Alibaba—with a comprehensive score over 60, excelling in physics adherence (+16%), near‑perfect 3D accuracy, and leading visual quality, while leveraging explicit action modeling, a differentiable physics engine, massive robot video data, and open‑source releases that have already attracted over 16,000 downloads.

BenchmarkEmbodied AIOpen-source

0 likes · 7 min read

GigaWorld-1 Tops WorldArena Benchmark, Surpassing Google and Nvidia

Machine Heart

Mar 29, 2026 · Artificial Intelligence

Why AI Can’t Plan: LeCun’s Team Shows Time Is Curved in Latent Space

Yann LeCun’s team argues that current visual models fail at planning because their latent representations form highly curved temporal trajectories, making Euclidean distance unreliable; their new paper introduces a curvature regularizer to straighten these paths, enabling more accurate planning demonstrated on a challenging teleport maze.

Curvature RegularizerLatent PlanningTemporal Straightening

0 likes · 8 min read

Why AI Can’t Plan: LeCun’s Team Shows Time Is Curved in Latent Space

SuanNi

Mar 25, 2026 · Artificial Intelligence

How LeWorldModel Learns Physics from Pixels in Hours – A Deep Dive

LeWorldModel (LeWM) is a compact AI world model that learns real‑world physics directly from raw pixel streams using only two simple mathematical rules, achieving dramatically faster planning and robust physical intuition compared to prior large‑scale models.

AI researchModel Predictive Controlphysics learning

0 likes · 14 min read

How LeWorldModel Learns Physics from Pixels in Hours – A Deep Dive

AI Engineering

Mar 10, 2026 · Artificial Intelligence

Yann LeCun’s New AMI Labs Secures $1.03B to Build a World‑Model Alternative to LLMs

Yann LeCun and Alexandre LeBrun have launched AMI Labs, raising $1.03 billion in Europe’s largest seed round to develop JEPA—a world‑model architecture intended to replace LLMs for high‑risk domains, with all code and papers open‑sourced, a 5‑10‑year horizon, and backing from NVIDIA, Samsung, Bezos’ venture, and others.

AI researchAMI LabsJEPA

0 likes · 3 min read

Yann LeCun’s New AMI Labs Secures $1.03B to Build a World‑Model Alternative to LLMs

AI Engineering

Jan 17, 2026 · Artificial Intelligence

Can Tiny LLMs Compute Accurately? WorldModel‑Qwen Inference‑Time WASM Execution

The article details how the small Qwen‑0.6B model was adapted to generate and run WebAssembly code during inference, achieving deterministic calculations and revealing both the promise and current limitations of integrating world‑model reasoning into tiny LLMs.

LLMQwen-0.6BWASM execution

0 likes · 5 min read

Can Tiny LLMs Compute Accurately? WorldModel‑Qwen Inference‑Time WASM Execution

Advanced AI Application Practice

Jan 3, 2026 · Industry Insights

Where AI Is Heading in 2025: Key Trends and Predictions for Next Year

The author reviews optimistic and conservative AI forecasts, argues that enterprise AI adoption will surge, outlines infrastructure bottlenecks, predicts a shift from pure model performance to ecosystem competition, and highlights the rise of world‑model approaches and edge‑side applications for 2025.

AI InfrastructureAI competitionAI trends

0 likes · 8 min read

Where AI Is Heading in 2025: Key Trends and Predictions for Next Year

21CTO

Oct 20, 2025 · Artificial Intelligence

Real-Time Frame Model (RTFM): Single‑GPU World Model Redefines 3D Generation

World Labs unveiled RTFM, a real‑time frame model that runs on a single H100 GPU, generating persistent, interactive 3D worlds from 2D images without explicit 3D representations, highlighting the growing computational demands of generative world models and their potential to reshape AI-driven spatial intelligence.

3D generationGPU AccelerationGenerative AI

0 likes · 9 min read

Real-Time Frame Model (RTFM): Single‑GPU World Model Redefines 3D Generation

Amap Tech

Oct 6, 2025 · Artificial Intelligence

Breaking VLA Training Limits: World-Env’s Virtual Sandbox for Safe, Data‑Efficient Robotics

World-Env introduces a virtual training sandbox that eliminates physical interaction, dramatically improves data efficiency with just five expert demos per task, and employs a vision‑language model as a semantic judge to dynamically terminate actions, enabling safe, high‑performing VLA post‑training across diverse robotic benchmarks.

Data Efficiencyvirtual environmentvision-language-action

0 likes · 9 min read

Breaking VLA Training Limits: World-Env’s Virtual Sandbox for Safe, Data‑Efficient Robotics

DataFunTalk

Jun 12, 2025 · Artificial Intelligence

How Meta’s V‑JEPA 2 Is Pushing AI Toward Human‑Like Physical Understanding

Meta’s newly released V‑JEPA 2 introduces a video‑trained world model that can understand, predict, and plan physical actions, enabling zero‑shot robot control and outperforming existing models on benchmarks like IntPhys 2, MVPBench, and CausalVQA, while outlining future directions for hierarchical and multimodal JEPA architectures.

BenchmarkV-JEPA 2Video AI

0 likes · 8 min read

How Meta’s V‑JEPA 2 Is Pushing AI Toward Human‑Like Physical Understanding

Sohu Tech Products

Mar 6, 2024 · Artificial Intelligence

Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications

OpenAI’s Sora video model unifies image and video data into latent spacetime patches via a VAE, trains on original resolutions with GPT‑4‑expanded captions, employs a Diffusion Transformer backbone for patch‑wise denoising, and demonstrates 3D‑consistent, long‑term world‑model capabilities that hint at a unified computer‑vision paradigm and steps toward AGI.

AI researchOpenAI SoraTransformer

0 likes · 9 min read

Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications

DataFunTalk

Jan 25, 2024 · Artificial Intelligence

World Models, Reinforcement Learning, and Causal Inference: A Comprehensive Overview

This article presents a detailed overview of world models and their role in reinforcement learning, explains how causal inference can enhance model-based RL, discusses sample efficiency challenges, and shares experimental findings and practical insights from recent research and industry applications.

AIcausal inferencemachine learning

0 likes · 22 min read

World Models, Reinforcement Learning, and Causal Inference: A Comprehensive Overview

DataFunTalk

Apr 17, 2023 · Artificial Intelligence

Speculation: GPT-5 May Adopt Model‑Based Deep Reinforcement Learning for Unlimited Self‑Improvement

The article argues that the next generation GPT is likely to employ model‑based deep reinforcement learning, turning the model into both a policy and a world model, which could enable rapid, data‑efficient self‑enhancement but also raise serious safety and societal risks.

AI safetyGPT-5deep reinforcement learning

0 likes · 4 min read

Speculation: GPT-5 May Adopt Model‑Based Deep Reinforcement Learning for Unlimited Self‑Improvement

Meituan Technology Team

Jun 11, 2020 · Artificial Intelligence

Pedestrian Trajectory Prediction: Methodology and Experience from the ICRA 2020 TrajNet++ Competition

The ICRA 2020 TrajNet++ competition challenged teams to predict 4.8‑second pedestrian paths from 3.6‑second observations, and Meituan’s winning solution used a Seq2Seq world‑model that encodes past trajectories, updates a spatio‑temporal interaction map, and decodes future positions, achieving a 1.24 m final displacement error and demonstrating readiness for real‑world unmanned delivery.

AIICRA 2020interaction modeling

0 likes · 14 min read

Pedestrian Trajectory Prediction: Methodology and Experience from the ICRA 2020 TrajNet++ Competition