Tagged articles
5 articles
Page 1 of 1
AIWalker
AIWalker
Mar 16, 2025 · Artificial Intelligence

VideoPainter: Plug‑and‑Play Video Inpainting and Editing Sets 8 SOTA Benchmarks

VideoPainter introduces a plug‑and‑play dual‑branch framework for video inpainting and editing, featuring a lightweight context encoder, ID‑consistent resampling, and the large VPData/VPBench datasets, and achieves state‑of‑the‑art results across eight quantitative and qualitative metrics.

Diffusion ModelsDual-Branch ArchitectureID resampling
0 likes · 15 min read
VideoPainter: Plug‑and‑Play Video Inpainting and Editing Sets 8 SOTA Benchmarks
AIWalker
AIWalker
Mar 13, 2025 · Artificial Intelligence

VideoPainter: Plug‑and‑Play Video Inpainting and Editing Achieves 8 SOTA Benchmarks

VideoPainter introduces a plug‑and‑play dual‑branch framework with a lightweight context encoder and ID‑resampling adapter, built on the massive VPData/VPBench dataset, and demonstrates state‑of‑the‑art performance across eight video restoration and editing metrics, while supporting flexible model integration and long‑video consistency.

Dual-Branch ArchitectureID ConsistencyPlug-and-Play
0 likes · 18 min read
VideoPainter: Plug‑and‑Play Video Inpainting and Editing Achieves 8 SOTA Benchmarks
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Sep 29, 2022 · Artificial Intelligence

How DeViT Revolutionizes Video Inpainting with Deformed Vision Transformers

The article introduces DeViT, a novel Deformed Vision Transformer framework for video inpainting that leverages a deformable patch homography estimator, mask‑pruned attention, and spatio‑temporal weight adaptation, achieving state‑of‑the‑art results on benchmark datasets and highlighting its potential for advanced video editing tools.

DeViTMultimediaTransformer
0 likes · 10 min read
How DeViT Revolutionizes Video Inpainting with Deformed Vision Transformers
Cyber Elephant Tech Team
Cyber Elephant Tech Team
Mar 30, 2022 · Artificial Intelligence

Can AI Make Real-Life Invisibility Cloaks? Inside the STTN Video Restoration Breakthrough

This article reviews the challenges of video inpainting, surveys traditional methods, and introduces the Spatial‑Temporal Transformer Network (STTN) that leverages multi‑scale attention and a Temporal Patch‑GAN discriminator, detailing its architecture, loss functions, training on Youtube‑VOS, and impressive restoration results.

AI video restorationDeep LearningVideo Inpainting
0 likes · 10 min read
Can AI Make Real-Life Invisibility Cloaks? Inside the STTN Video Restoration Breakthrough
Youku Technology
Youku Technology
Jul 8, 2021 · Artificial Intelligence

Key Findings from Alibaba Moku Lab at ACM MM 2021

At ACM MM 2021, Alibaba’s Moku Lab presented four cutting‑edge studies: an interactive video inpainting system using user doodles, a decoupled IoU regression model for object detection, a spatio‑temporal distortion‑aware video quality assessment framework, and a multimodal emotional relationship recognition dataset and benchmark.

Computer VisionVideo Inpaintingmultimodal emotion recognition
0 likes · 8 min read
Key Findings from Alibaba Moku Lab at ACM MM 2021