End-to-End 3D Spatial Video Generation via Monocular Depth Estimation, Novel View Synthesis, and MV-HEVC Encoding
Leveraging AI-driven monocular depth estimation, novel view synthesis, and MV‑HEVC encoding, the JD Retail Content R&D team presents an end‑to‑end pipeline that converts 2D video assets into high‑quality immersive 3D spatial videos, introduces the large‑scale StereoV1K dataset, and demonstrates superior performance over existing methods.