Tag

Depth Estimation

0 views collected around this technical thread.

JD Tech
JD Tech
Apr 21, 2025 · Artificial Intelligence

End-to-End 3D Spatial Video Generation via Monocular Depth Estimation, Novel View Synthesis, and MV‑HEVC Encoding

This article presents a comprehensive AI‑driven pipeline that converts 2D video into immersive 3D spatial video by leveraging monocular depth estimation, depth‑warping novel view synthesis, a multi‑branch inpainting module, a large‑scale StereoV1K dataset, and efficient MV‑HEVC compression, with results validated at ICME 2025 and deployed in JD Vision services.

3D videoAIAIGC
0 likes · 20 min read
End-to-End 3D Spatial Video Generation via Monocular Depth Estimation, Novel View Synthesis, and MV‑HEVC Encoding
JD Retail Technology
JD Retail Technology
Apr 16, 2025 · Artificial Intelligence

AI‑Driven 3D Spatial Video Generation from Monocular 2D Content with MV‑HEVC Encoding

This work presents an end‑to‑end AI pipeline that transforms existing monocular 2D videos into immersive 3D spatial streams by combining DINO‑v2‑based depth estimation, multi‑branch view synthesis, and MV‑HEVC encoding, achieving up to 33 % BD‑Rate reduction, 31 % speed gains, state‑of‑the‑art visual quality, and real‑time production suitability, validated on the new StereoV1K benchmark and deployed in JD.Vision’s e‑commerce catalog.

3D videoAI generationAIGC
0 likes · 21 min read
AI‑Driven 3D Spatial Video Generation from Monocular 2D Content with MV‑HEVC Encoding
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Dec 30, 2022 · Artificial Intelligence

Unlocking Realistic Bokeh: Depth‑Aware Algorithms Behind Holiday Video Effects

This article explains the optical principles of bokeh (scatter blur), describes a depth‑aware variable‑focus algorithm developed by Kuaishou’s audio‑video team, and details practical optimizations such as saliency detection, edge‑preserving weighting, and adaptive spot‑light effects that enable realistic, customizable holiday video filters.

BokehDepth EstimationVideo Effects
0 likes · 11 min read
Unlocking Realistic Bokeh: Depth‑Aware Algorithms Behind Holiday Video Effects
DataFunTalk
DataFunTalk
Jun 30, 2022 · Artificial Intelligence

Self‑Augmented Unpaired Image Dehazing via Density and Depth Decomposition (D4)

The paper introduces D4, a self‑augmented unpaired image dehazing framework that decomposes the transmission map into fog density and scene depth, enabling realistic fog synthesis for data augmentation and achieving superior dehazing performance with fewer parameters and FLOPs on multiple benchmarks.

CVPR2022Depth Estimationcomputer vision
0 likes · 14 min read
Self‑Augmented Unpaired Image Dehazing via Density and Depth Decomposition (D4)
Kuaishou Tech
Kuaishou Tech
Feb 9, 2022 · Mobile Development

Kuaishou Mobile Mixed Reality System: Architecture, Algorithms, and Applications

This article presents Kuaishou's mobile mixed reality (MR) system, detailing its integration of deep learning, SLAM, and scene reconstruction for real‑time spatial computing, the design of a monocular depth‑estimation model, a lightweight 3D rendering engine, and its deployment across iOS and Android devices with various user‑facing effects.

Depth EstimationKuaishouMixed Reality
0 likes · 16 min read
Kuaishou Mobile Mixed Reality System: Architecture, Algorithms, and Applications
JD Retail Technology
JD Retail Technology
Aug 2, 2021 · Artificial Intelligence

Real-time Monocular Human Depth Estimation and Segmentation on Embedded Systems (HDES-Net)

The paper presents HDES‑Net, a lightweight real‑time monocular human depth estimation and segmentation network designed for embedded platforms, using MobileNetV1 backbone with ASPP and depth‑wise separable convolutions, achieving high accuracy on CAD‑60 and EPFL‑RGBD datasets while running at up to 199.93 FPS on a Tesla P40 and 17.23 FPS on a Jetson Nano after TensorRT optimization.

Depth EstimationEmbedded AIHDES-Net
0 likes · 8 min read
Real-time Monocular Human Depth Estimation and Segmentation on Embedded Systems (HDES-Net)
TAL Education Technology
TAL Education Technology
Jun 18, 2020 · Artificial Intelligence

An Overview of Virtual Reality, Augmented Reality, and Vision‑Based Techniques

This article explains the fundamentals of virtual reality and its distinction from augmented reality, describes VR hardware, outlines depth‑estimation and eye‑tracking methods such as projection, Hough transform, AdaBoost and sample matching, discusses Sobel edge detection, and explores the importance of audio, haptic feedback, and immersive VR applications in education.

ARDepth EstimationImmersive Education
0 likes · 11 min read
An Overview of Virtual Reality, Augmented Reality, and Vision‑Based Techniques
iQIYI Technical Product Team
iQIYI Technical Product Team
May 8, 2020 · Artificial Intelligence

Deep Learning‑Based 2D‑to‑3D Conversion for VR Content

iQIYI’s deep‑learning pipeline converts single‑view images into high‑quality stereo pairs for VR by training on side‑by‑side 3D movies, employing a Monodepth‑based encoder‑decoder, a CVAE to encode camera parameters, ConvLSTM for temporal consistency, and disparity‑guided inpainting to fill occlusion holes, achieving stable, continuous depth maps validated through extensive human 3‑D effect assessments.

2D-to-3DDepth EstimationVR
0 likes · 12 min read
Deep Learning‑Based 2D‑to‑3D Conversion for VR Content