PAT3D Makes Text-to-3D Scenes Physically Plausible for Simulation and Interaction
PAT3D, a Physics‑Augmented Text‑to‑3D scene generation framework presented at ICLR 2026, extracts object‑space relationships from text‑driven images, initializes a hierarchical layout, and refines it with differentiable rigid‑body simulation and semantic loss, yielding physically stable, editable scenes that outperform prior methods in stability metrics and enable downstream editing, animation, and robot simulation.
