Tagged articles

Video Inpainting

5 articles · Page 1 of 1

Mar 16, 2025 · Artificial Intelligence

VideoPainter: Plug‑and‑Play Video Inpainting and Editing Sets 8 SOTA Benchmarks

VideoPainter introduces a plug‑and‑play dual‑branch framework for video inpainting and editing, featuring a lightweight context encoder, ID‑consistent resampling, and the large VPData/VPBench datasets, and achieves state‑of‑the‑art results across eight quantitative and qualitative metrics.

Diffusion ModelsDual-Branch ArchitectureID resampling

0 likes · 15 min read

VideoPainter: Plug‑and‑Play Video Inpainting and Editing Sets 8 SOTA Benchmarks

AIWalker

Mar 13, 2025 · Artificial Intelligence

VideoPainter: Plug‑and‑Play Video Inpainting and Editing Achieves 8 SOTA Benchmarks

VideoPainter introduces a plug‑and‑play dual‑branch framework with a lightweight context encoder and ID‑resampling adapter, built on the massive VPData/VPBench dataset, and demonstrates state‑of‑the‑art performance across eight video restoration and editing metrics, while supporting flexible model integration and long‑video consistency.

Dataset ConstructionDual-Branch ArchitectureID Consistency

0 likes · 18 min read

VideoPainter: Plug‑and‑Play Video Inpainting and Editing Achieves 8 SOTA Benchmarks

Kuaishou Audio & Video Technology

Sep 29, 2022 · Artificial Intelligence

How DeViT Revolutionizes Video Inpainting with Deformed Vision Transformers

The article introduces DeViT, a novel Deformed Vision Transformer framework for video inpainting that leverages a deformable patch homography estimator, mask‑pruned attention, and spatio‑temporal weight adaptation, achieving state‑of‑the‑art results on benchmark datasets and highlighting its potential for advanced video editing tools.

DeViTTransformerVideo Inpainting

0 likes · 10 min read

How DeViT Revolutionizes Video Inpainting with Deformed Vision Transformers

Cyber Elephant Tech Team

Mar 30, 2022 · Artificial Intelligence

Can AI Make Real-Life Invisibility Cloaks? Inside the STTN Video Restoration Breakthrough

This article reviews the challenges of video inpainting, surveys traditional methods, and introduces the Spatial‑Temporal Transformer Network (STTN) that leverages multi‑scale attention and a Temporal Patch‑GAN discriminator, detailing its architecture, loss functions, training on Youtube‑VOS, and impressive restoration results.

AI video restorationDeep LearningVideo Inpainting

0 likes · 10 min read

Can AI Make Real-Life Invisibility Cloaks? Inside the STTN Video Restoration Breakthrough

Youku Technology

Jul 8, 2021 · Artificial Intelligence

Key Findings from Alibaba Moku Lab at ACM MM 2021

At ACM MM 2021, Alibaba’s Moku Lab presented four cutting‑edge studies: an interactive video inpainting system using user doodles, a decoupled IoU regression model for object detection, a spatio‑temporal distortion‑aware video quality assessment framework, and a multimodal emotional relationship recognition dataset and benchmark.

Video Inpaintingcomputer visionmultimodal emotion recognition

0 likes · 8 min read

Key Findings from Alibaba Moku Lab at ACM MM 2021