SIF3D: Sense‑Informed Forecasting of 3D Human Motion with Multimodal Attention
SIF3D is a scene‑aware 3D human motion forecasting framework that fuses observed motion, 3D point‑cloud scenes, and gaze through novel ternary intention‑aware and semantic‑coherence‑aware attention mechanisms, encoding with PointNet++ and Transformers, and decoding with a graph‑convolutional network, achieving state‑of‑the‑art results on GIMO and GTA‑1M benchmarks.