Tagged articles

world model

41 articles · Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Jul 2, 2026 · Artificial Intelligence

From Agentic Tools to Agentive Systems: A Review of “Critique of Agent Model”

The paper distinguishes agentic tools that rely on external scaffolding from truly agentive systems whose goals, identity, decision‑making, self‑regulation and learning are internalized, proposes the GIC (Goal‑Identity‑Configurator) architecture, and evaluates its safety, auditability and applicability through a pilot‑training use case.

AI agentsGIC architectureagency
0 likes · 19 min read
From Agentic Tools to Agentive Systems: A Review of “Critique of Agent Model”
Machine Heart
Machine Heart
Jun 29, 2026 · Artificial Intelligence

How MWA™'s Long‑Sequence Bidirectional Physical Causal Chain Sets a New Record in Embodied AI

The article presents MWA™, the first long‑sequence bidirectional physical causal chain hidden‑space world model, details its bidirectional dynamics, latent‑action pre‑training, three‑gradient constraints and AnyPhys negative‑sample system, and shows it achieved a 75.2% success rate on the RoboCasa GR1 TableTop benchmark, surpassing leading competitors.

AnyPhysEmbodied AIRoboCasa benchmark
0 likes · 14 min read
How MWA™'s Long‑Sequence Bidirectional Physical Causal Chain Sets a New Record in Embodied AI
Machine Heart
Machine Heart
Jun 26, 2026 · Artificial Intelligence

How RuoYu Technology Secured China’s First Explosion‑Proof Certification and the World’s First Fueling‑Brain Robot

Amid a booming Chinese embodied‑intelligence market, RuoYu Technology’s explosion‑proof robot RuoYu LanYue 01, powered by the self‑developed RuoYu Jiutian brain, achieved the nation’s first explosion‑proof certification and the world’s first fueling‑brain solution, demonstrating end‑to‑end perception‑planning‑execution across fueling stations, oil‑gas fields, and ports.

Embodied AIH-GARexplosion-proof robotics
0 likes · 16 min read
How RuoYu Technology Secured China’s First Explosion‑Proof Certification and the World’s First Fueling‑Brain Robot

Chinese Team Brings World‑Model AI to Mass Production – The Physical‑World Anthropic

The article analyzes how world‑model AI, which predicts the next physical frame instead of the next word, is reshaping autonomous driving, highlights Momenta's three‑stage R7 architecture and massive data loop, compares its path with Anthropic's software‑only strategy, and projects a multi‑trillion‑dollar physical‑AI market by 2030.

AI marketAnthropicMomenta
0 likes · 10 min read
Chinese Team Brings World‑Model AI to Mass Production – The Physical‑World Anthropic
Machine Heart
Machine Heart
Jun 23, 2026 · Industry Insights

Momenta's Physical AI IPO: World Model as the New AI Foundation

Momenta has cleared the Hong Kong Stock Exchange listing hearing, positioning itself as the first "Physical AI" stock with its R7 World Model—a three‑layer architecture that leverages massive real‑world driving data, simulation and reinforcement learning—to challenge Nvidia, Tesla and Anthropic while targeting a multi‑trillion‑dollar market.

AI marketCommercial LoopData Scaling
0 likes · 17 min read
Momenta's Physical AI IPO: World Model as the New AI Foundation
Machine Heart
Machine Heart
Jun 21, 2026 · Artificial Intelligence

Can World Models Bridge LLMs' Dynamic Reasoning Gaps?

The article analyzes why large language model agents struggle with dynamic tasks, critiques existing CoT‑style optimizations, and shows how recent world‑model approaches such as EvoAgent, WebEvolver, COMAP, RWML and ProPlay quantitatively improve prediction, planning and success rates in evolving environments.

AgentCoTEvoAgent
0 likes · 9 min read
Can World Models Bridge LLMs' Dynamic Reasoning Gaps?
Machine Heart
Machine Heart
Jun 18, 2026 · Artificial Intelligence

Curr-0: Enabling Humanoid Robots to Perform Continuous Full-Body Dexterous Operations

Current Robotics introduces Curr-0, a single-policy model that unifies locomotion, whole-body posture coordination, and fine hand manipulation for 70-plus-degree humanoid robots, trained on 21,000 hours of human behavior data collected via the HumanEx exoskeleton system, and supported by a multi-modal world model for scalable evaluation and deployment.

Humanoid Robotfull-body dexterityhuman behavior dataset
0 likes · 6 min read
Curr-0: Enabling Humanoid Robots to Perform Continuous Full-Body Dexterous Operations
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Jun 15, 2026 · Artificial Intelligence

A Comprehensive Survey of Agentic Time Series Systems: Architecture, Reliability, and Research Frontiers

This survey maps the emerging field of agentic time‑series systems, outlining a five‑layer architecture that integrates perception, reasoning, planning, memory, and world modeling, while emphasizing reliability constraints, benchmark evolution, diverse applications, and six key research frontiers.

LLMReliabilityagentic time series
0 likes · 27 min read
A Comprehensive Survey of Agentic Time Series Systems: Architecture, Reliability, and Research Frontiers
Machine Heart
Machine Heart
Jun 11, 2026 · Artificial Intelligence

MBench: Tsinghua and Tencent Define Long-Term Memory for Video World Models

MBench, a new benchmark from Tsinghua University and Tencent, systematically evaluates the long‑term memory ability of streaming video generation models across entity, environment, and causal consistency, introduces a trigger‑conditioned scoring scheme, and reveals that memory remains a major bottleneck for current SOTA models.

AIBenchmarklong-term consistency
0 likes · 8 min read
MBench: Tsinghua and Tencent Define Long-Term Memory for Video World Models
PaperAgent
PaperAgent
Jun 5, 2026 · Artificial Intelligence

Tongji’s “Boundless” World Model Wins Open‑Source #1 and Overall #2 in WorldArena

The Tongji University “Boundless” world model achieved the top open‑source score (64.54) and the second‑overall rank (67.87) on WorldArena’s Track‑1, demonstrating high‑quality video generation, stable long‑sequence physics, and embodied interaction across six evaluation dimensions, while using data‑efficient training and a hybrid open/closed‑source strategy.

BoundlessEmbodied AIOpen-source
0 likes · 9 min read
Tongji’s “Boundless” World Model Wins Open‑Source #1 and Overall #2 in WorldArena
SuanNi
SuanNi
May 31, 2026 · Artificial Intelligence

How NVIDIA’s Gamma‑World Turns Single‑Agent Models into Multiplayer Experiences

Gamma‑World introduces a multi‑agent world model that solves identity, interaction, and real‑time inference challenges with parameter‑free geometric encoding, sparse hub attention, and teacher‑student distillation, enabling zero‑shot generalization from two to four agents and achieving 24 FPS interactive video generation.

Gamma-WorldReal-time inferenceSimplex Rotary Agent Encoding
0 likes · 11 min read
How NVIDIA’s Gamma‑World Turns Single‑Agent Models into Multiplayer Experiences
Machine Heart
Machine Heart
May 29, 2026 · Artificial Intelligence

ZhiYuan’s GE 2.0 Wins WorldArena World Model Championship – How It Achieved Bare‑Bones Victory

ZhiYuan’s Genie Envisioner‑Sim 2.0 (GE 2.0) captured the overall WorldArena world‑model title without any task‑specific tuning, demonstrating superior long‑sequence stability, multi‑view generation, real‑time inference and a closed‑loop reward feedback loop that outperforms industry baselines across 16 metrics and three real‑world tasks.

Closed-loop EvaluationEmbodied AIGE 2.0
0 likes · 9 min read
ZhiYuan’s GE 2.0 Wins WorldArena World Model Championship – How It Achieved Bare‑Bones Victory
Xiaomi Tech
Xiaomi Tech
May 26, 2026 · Artificial Intelligence

Xiaomi Auto Unveils Integrated Reconstruction‑Generation World Model Framework Achieving SOTA on Major Benchmarks

Xiaomi Auto introduces a novel world‑model framework that tightly couples 3D reconstruction and generative prediction, delivering state‑of‑the‑art performance on Waymo and nuScenes benchmarks while enabling high‑fidelity, long‑duration video synthesis for autonomous‑driving scenarios.

3D reconstructionBenchmark SOTAXiaomi Auto
0 likes · 10 min read
Xiaomi Auto Unveils Integrated Reconstruction‑Generation World Model Framework Achieving SOTA on Major Benchmarks
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
May 22, 2026 · Artificial Intelligence

How Baidu Baige’s Full‑Stack AI Infra Accelerates Embodied Model Iteration

The article details Baidu Baige’s end‑to‑end AI infrastructure for embodied intelligence, covering VLA and world‑model architectures, scaling challenges for medium‑sized models, cloud‑based motion‑control pipelines, open‑source integration, hardware‑aware training optimizations, and simulation‑engine improvements that together speed up model development and deployment.

AI InfraBaidu BaigeEmbodied AI
0 likes · 13 min read
How Baidu Baige’s Full‑Stack AI Infra Accelerates Embodied Model Iteration
Machine Heart
Machine Heart
May 14, 2026 · Artificial Intelligence

Breaking the 3D Perception Bottleneck: VGGT Series Enables Dynamic High‑Fidelity Reconstruction

The VGGT series from KOKONI 3D and collaborators tackles three core 3D perception limits—unbounded sequence memory, dynamic‑static entanglement, and compute‑precision trade‑offs—by introducing StreamCacheVGGT, progressive decoupling, and HD‑VGGT, achieving O(1) memory streaming, 15%+ accuracy gains on dynamic benchmarks, and record‑high AUC on RealEstate10K.

3D reconstructionVGGTcomputer vision
0 likes · 10 min read
Breaking the 3D Perception Bottleneck: VGGT Series Enables Dynamic High‑Fidelity Reconstruction
Machine Heart
Machine Heart
May 14, 2026 · Artificial Intelligence

How PsiBot Uses 100,000 Hours of Human Data to Power Embodied Intelligence

PsiBot demonstrates that, with a 100,000‑hour human‑operation dataset captured via exoskeleton gloves and ego‑vision, a world‑model (W0) and reinforcement‑learning policy (R2) can bridge the gap to robot control, offering a scalable alternative to costly teleoperation pipelines.

Embodied AIdata collectionhuman data
0 likes · 12 min read
How PsiBot Uses 100,000 Hours of Human Data to Power Embodied Intelligence
Machine Heart
Machine Heart
Apr 28, 2026 · Artificial Intelligence

Why a 7‑Month‑Old Startup Claims Human‑Like Robots Are Key to General Embodied Intelligence

The article details KAI, a 173 cm, 115‑DOF humanoid robot with tactile skin and a custom battery, and explains how its ultra‑human form, massive first‑person data collection, and three‑stage training pipeline are intended to enable a world‑model‑driven embodied AI system, while also acknowledging the engineering and market challenges ahead.

Embodied AIHumanoid Robotdata pipeline
0 likes · 13 min read
Why a 7‑Month‑Old Startup Claims Human‑Like Robots Are Key to General Embodied Intelligence
AI Explorer
AI Explorer
Apr 27, 2026 · Artificial Intelligence

Manifold AI’s Worldscape 0.2 Wins WorldArena, Marking a Shift from Seeing to Understanding

Manifold AI’s domestically developed Worldscape 0.2 model clinched first place in the rigorous WorldArena benchmark—demonstrating high‑fidelity dynamic scene generation and embodied control—highlighting a breakthrough in AI world models that move from mere visual perception toward genuine physical‑logic understanding, while noting the technology remains early‑stage.

AI benchmarkingManifold AIWorldArena
0 likes · 7 min read
Manifold AI’s Worldscape 0.2 Wins WorldArena, Marking a Shift from Seeing to Understanding
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 26, 2026 · Industry Insights

April 2026 AI Explosion: Sealed Model, Dual Model Showdown, and a 24‑Hour Shift

In April 2026 the AI landscape accelerated dramatically as Anthropic sealed its most powerful model, OpenAI and DeepSeek released competing flagship systems on the same day, Chinese firms unveiled groundbreaking world‑model and full‑duplex voice technologies, and token usage surged to 140 trillion calls per day, signaling a shift toward AI as essential infrastructure.

AnthropicClaude MythosDeepSeek-V4
0 likes · 16 min read
April 2026 AI Explosion: Sealed Model, Dual Model Showdown, and a 24‑Hour Shift
Machine Heart
Machine Heart
Apr 22, 2026 · Artificial Intelligence

China’s AlphaBrain Platform Launches First Full‑Stack Open‑Source Brain‑Like VLA

The AlphaBrain Platform, an open‑source embodied‑intelligence suite from China’s AI² Robotics, combines a world‑model stack, the pioneering NeuroVLA brain‑like model with spiking‑neuron actions, low‑cost RL‑Token training, and cross‑architecture continuous learning, all validated on leading robotics benchmarks.

AlphaBrainEmbodied IntelligenceNeuroVLA
0 likes · 11 min read
China’s AlphaBrain Platform Launches First Full‑Stack Open‑Source Brain‑Like VLA
Code Mala Tang
Code Mala Tang
Apr 22, 2026 · Artificial Intelligence

How LeWorldModel Achieves Stable End‑to‑End World Modeling with Just Two Losses

LeWorldModel, a 2026 JEPA‑based world model introduced by Yann LeCun and collaborators, solves representation collapse with a minimalist two‑loss objective, delivering a 15‑million‑parameter system that trains in hours, runs 48× faster than prior baselines, and reaches near‑SOTA performance on robot control benchmarks.

Deep LearningEmbodied AIJEPA
0 likes · 6 min read
How LeWorldModel Achieves Stable End‑to‑End World Modeling with Just Two Losses
Lao Guo's Learning Space
Lao Guo's Learning Space
Apr 21, 2026 · Artificial Intelligence

HappyOyster: Build an Explorable Interactive World with a Single Prompt

Alibaba’s ATH team unveiled HappyOyster, a real‑time world‑model platform that lets users generate and explore interactive 3D environments from a single sentence or image, offering two modes—Wander for exploration and Direct for creation—while detailing its streaming architecture, multimodal foundation, competitive advantages, use cases, and current limitations.

AI videoGame DevelopmentGenerative AI
0 likes · 11 min read
HappyOyster: Build an Explorable Interactive World with a Single Prompt
Machine Heart
Machine Heart
Apr 18, 2026 · Artificial Intelligence

Alibaba’s HappyOyster World Model Takes a Third Path Between Google and Fei‑Fei’s Approaches

HappyOyster, Alibaba’s real‑time interactive world‑model product, combines a Wander mode for open‑ended scene generation and a Direct mode for AI‑driven video direction, using a streaming multimodal architecture that distinguishes it from one‑shot text‑to‑video systems like Sora and offers a distinct path from Google’s Genie and Fei‑Fei’s World Labs.

Alibaba AIInteractive VideoMultimodal AI
0 likes · 10 min read
Alibaba’s HappyOyster World Model Takes a Third Path Between Google and Fei‑Fei’s Approaches
Machine Heart
Machine Heart
Apr 12, 2026 · Artificial Intelligence

CVPR 2026 WorldArena Challenge Launches with Amap’s Open‑Source High‑Performance World Model Baseline

The CVPR 2026 WorldArena Challenge, organized by top academic institutions and Amap, introduces a new evaluation framework that tests video world models for physical realism and functional utility, while Amap releases its high‑performance ABot‑PhysWorld model and benchmark scores that set a new state‑of‑the‑art.

ABot-PhysWorldBenchmarkCVPR 2026
0 likes · 9 min read
CVPR 2026 WorldArena Challenge Launches with Amap’s Open‑Source High‑Performance World Model Baseline
Data Party THU
Data Party THU
Apr 5, 2026 · Artificial Intelligence

How Sequential World Models Enable Scalable Multi‑Robot Cooperation

SeqWM introduces a sequential causal decomposition of multi‑robot dynamics, allowing each robot to model its marginal contribution conditioned on preceding agents, which simplifies learning, improves sample efficiency, and yields natural collaborative behaviors both in simulation (Bi‑DexHands, Multi‑Quadruped) and real‑world tests on Unitree Go2‑W, outperforming prior methods.

Simulationmulti-robotreal-robot
0 likes · 7 min read
How Sequential World Models Enable Scalable Multi‑Robot Cooperation
Fighter's World
Fighter's World
Apr 4, 2026 · R&D Management

Building an AI‑Native Organization: From Hierarchy to Intelligent Ops

When AI eliminates execution bottlenecks, the real constraint becomes information flow, prompting a shift from hierarchical information‑routing to AI‑driven world models, intelligence layers and interfaces; the article analyses Block’s four‑layer architecture, its preconditions, challenges for mid‑level managers, and offers a step‑by‑step path for small teams to begin the AI‑native transformation.

AI-nativecapabilitieshierarchy
0 likes · 24 min read
Building an AI‑Native Organization: From Hierarchy to Intelligent Ops
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 31, 2026 · Artificial Intelligence

GigaWorld-1 Tops WorldArena Benchmark, Surpassing Google and Nvidia

GigaWorld-1, the latest embodied world model from Jiji Vision, clinched the global #1 spot on the WorldArena benchmark—beating Google, Nvidia, and Alibaba—with a comprehensive score over 60, excelling in physics adherence (+16%), near‑perfect 3D accuracy, and leading visual quality, while leveraging explicit action modeling, a differentiable physics engine, massive robot video data, and open‑source releases that have already attracted over 16,000 downloads.

BenchmarkEmbodied AIOpen-source
0 likes · 7 min read
GigaWorld-1 Tops WorldArena Benchmark, Surpassing Google and Nvidia
Machine Heart
Machine Heart
Mar 29, 2026 · Artificial Intelligence

Why AI Can’t Plan: LeCun’s Team Shows Time Is Curved in Latent Space

Yann LeCun’s team argues that current visual models fail at planning because their latent representations form highly curved temporal trajectories, making Euclidean distance unreliable; their new paper introduces a curvature regularizer to straighten these paths, enabling more accurate planning demonstrated on a challenging teleport maze.

Curvature RegularizerLatent PlanningTemporal Straightening
0 likes · 8 min read
Why AI Can’t Plan: LeCun’s Team Shows Time Is Curved in Latent Space
SuanNi
SuanNi
Mar 25, 2026 · Artificial Intelligence

How LeWorldModel Learns Physics from Pixels in Hours – A Deep Dive

LeWorldModel (LeWM) is a compact AI world model that learns real‑world physics directly from raw pixel streams using only two simple mathematical rules, achieving dramatically faster planning and robust physical intuition compared to prior large‑scale models.

AI researchModel Predictive Controlphysics learning
0 likes · 14 min read
How LeWorldModel Learns Physics from Pixels in Hours – A Deep Dive
AI Engineering
AI Engineering
Mar 10, 2026 · Artificial Intelligence

Yann LeCun’s New AMI Labs Secures $1.03B to Build a World‑Model Alternative to LLMs

Yann LeCun and Alexandre LeBrun have launched AMI Labs, raising $1.03 billion in Europe’s largest seed round to develop JEPA—a world‑model architecture intended to replace LLMs for high‑risk domains, with all code and papers open‑sourced, a 5‑10‑year horizon, and backing from NVIDIA, Samsung, Bezos’ venture, and others.

AI researchAMI LabsJEPA
0 likes · 3 min read
Yann LeCun’s New AMI Labs Secures $1.03B to Build a World‑Model Alternative to LLMs
21CTO
21CTO
Oct 20, 2025 · Artificial Intelligence

Real-Time Frame Model (RTFM): Single‑GPU World Model Redefines 3D Generation

World Labs unveiled RTFM, a real‑time frame model that runs on a single H100 GPU, generating persistent, interactive 3D worlds from 2D images without explicit 3D representations, highlighting the growing computational demands of generative world models and their potential to reshape AI-driven spatial intelligence.

3D generationGPU AccelerationGenerative AI
0 likes · 9 min read
Real-Time Frame Model (RTFM): Single‑GPU World Model Redefines 3D Generation
Amap Tech
Amap Tech
Oct 6, 2025 · Artificial Intelligence

Breaking VLA Training Limits: World-Env’s Virtual Sandbox for Safe, Data‑Efficient Robotics

World-Env introduces a virtual training sandbox that eliminates physical interaction, dramatically improves data efficiency with just five expert demos per task, and employs a vision‑language model as a semantic judge to dynamically terminate actions, enabling safe, high‑performing VLA post‑training across diverse robotic benchmarks.

Data Efficiencyvirtual environmentvision-language-action
0 likes · 9 min read
Breaking VLA Training Limits: World-Env’s Virtual Sandbox for Safe, Data‑Efficient Robotics
DataFunTalk
DataFunTalk
Jun 12, 2025 · Artificial Intelligence

How Meta’s V‑JEPA 2 Is Pushing AI Toward Human‑Like Physical Understanding

Meta’s newly released V‑JEPA 2 introduces a video‑trained world model that can understand, predict, and plan physical actions, enabling zero‑shot robot control and outperforming existing models on benchmarks like IntPhys 2, MVPBench, and CausalVQA, while outlining future directions for hierarchical and multimodal JEPA architectures.

BenchmarkV-JEPA 2Video AI
0 likes · 8 min read
How Meta’s V‑JEPA 2 Is Pushing AI Toward Human‑Like Physical Understanding
Sohu Tech Products
Sohu Tech Products
Mar 6, 2024 · Artificial Intelligence

Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications

OpenAI’s Sora video model unifies image and video data into latent spacetime patches via a VAE, trains on original resolutions with GPT‑4‑expanded captions, employs a Diffusion Transformer backbone for patch‑wise denoising, and demonstrates 3D‑consistent, long‑term world‑model capabilities that hint at a unified computer‑vision paradigm and steps toward AGI.

AI researchOpenAI SoraTransformer
0 likes · 9 min read
Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications
DataFunTalk
DataFunTalk
Jan 25, 2024 · Artificial Intelligence

World Models, Reinforcement Learning, and Causal Inference: A Comprehensive Overview

This article presents a detailed overview of world models and their role in reinforcement learning, explains how causal inference can enhance model-based RL, discusses sample efficiency challenges, and shares experimental findings and practical insights from recent research and industry applications.

AIcausal inferencemachine learning
0 likes · 22 min read
World Models, Reinforcement Learning, and Causal Inference: A Comprehensive Overview
Meituan Technology Team
Meituan Technology Team
Jun 11, 2020 · Artificial Intelligence

Pedestrian Trajectory Prediction: Methodology and Experience from the ICRA 2020 TrajNet++ Competition

The ICRA 2020 TrajNet++ competition challenged teams to predict 4.8‑second pedestrian paths from 3.6‑second observations, and Meituan’s winning solution used a Seq2Seq world‑model that encodes past trajectories, updates a spatio‑temporal interaction map, and decodes future positions, achieving a 1.24 m final displacement error and demonstrating readiness for real‑world unmanned delivery.

AIICRA 2020interaction modeling
0 likes · 14 min read
Pedestrian Trajectory Prediction: Methodology and Experience from the ICRA 2020 TrajNet++ Competition