Tagged articles

video segmentation

10 articles · Page 1 of 1

May 15, 2026 · Artificial Intelligence

How X2SAM Empowers Multimodal Models to Segment Images and Videos at Pixel Level

X2SAM is a unified multimodal large model that combines image and video segmentation with language and visual prompts, introduces a Mask Memory for temporal consistency, defines a new V‑VGD task, and achieves state‑of‑the‑art results while cutting training cost by over 30%.

V-VGDX2SAMcomputer vision

0 likes · 9 min read

How X2SAM Empowers Multimodal Models to Segment Images and Videos at Pixel Level

Kuaishou Large Model

Sep 27, 2023 · Artificial Intelligence

DVIS: Decoupled Framework that Sets New SOTA in Video Instance Segmentation

DVIS introduces a decoupled video instance segmentation framework that splits the task into segmentation, tracking, and refinement modules, achieving state-of-the-art performance across VIS, VPS, and VSS benchmarks while maintaining low computational overhead, and demonstrates robustness in both online and offline settings.

Deep LearningTransformercomputer vision

0 likes · 12 min read

DVIS: Decoupled Framework that Sets New SOTA in Video Instance Segmentation

Alimama Tech

Feb 1, 2023 · Artificial Intelligence

Video Object of Interest Segmentation (VOIS): Task, Dataset, and Dual-Path Transformer Approach

The paper presents Video Object of Interest Segmentation (VOIS), a new e‑commerce task that locates and segments video instances matching a given product image, introduces the LiveVideos dataset of 2,418 Taobao live‑stream clips, and proposes a dual‑path Swin‑Transformer with cross‑fusion modules that outperforms existing VOS/VIS baselines.

Transformerdatasetinstance segmentation

0 likes · 11 min read

Video Object of Interest Segmentation (VOIS): Task, Dataset, and Dual-Path Transformer Approach

Meituan Technology Team

Jun 23, 2022 · Artificial Intelligence

Highlights of Six Meituan Papers Accepted at CVPR 2022

Meituan’s six CVPR 2022 papers advance computer vision by introducing a few‑sample model compression method, a language‑bridged video object segmentation approach, a single‑stage 3D visual grounding technique, a dynamic early‑exit image captioning system, a boosted black‑box adversarial attack, and a semi‑supervised video paragraph grounding framework.

3D groundingCVPR 2022adversarial attacks

0 likes · 15 min read

Highlights of Six Meituan Papers Accepted at CVPR 2022

Kuaishou Tech

Jun 21, 2021 · Artificial Intelligence

Kuaishou’s CVPR 2021 Paper Highlights: 3D Vision, Domain Adaptation, Point Cloud Completion, Video Segmentation, and Face Forgery Detection

Kuaishou secured 14 accepted papers at CVPR 2021, spanning 3D hand mesh recovery, unsupervised keypoint detection, point cloud completion, modular interactive video segmentation, deep video matting, co‑salient object detection, occlusion‑aware instance segmentation, semantic image matting, and face forgery detection, showcasing the maturity of its research collaborations.

CVPRDomain AdaptationFace Forgery Detection

0 likes · 14 min read

Kuaishou’s CVPR 2021 Paper Highlights: 3D Vision, Domain Adaptation, Point Cloud Completion, Video Segmentation, and Face Forgery Detection

Kuaishou Audio & Video Technology

Jun 18, 2021 · Artificial Intelligence

MiVOS: Achieving Precise Video Segmentation with Minimal User Interaction

MiVOS introduces a highly decoupled, three‑module framework—Interaction‑to‑Mask, Mask Propagation, and Difference‑aware Fusion—for interactive video object segmentation, delivering precise masks with fewer user interactions, validated on the DAVIS benchmark and supported by a new large‑scale synthetic VOS dataset (BL30K).

interactive segmentationmodular networkvideo segmentation

0 likes · 10 min read

MiVOS: Achieving Precise Video Segmentation with Minimal User Interaction

Youku Technology

Jul 10, 2020 · Artificial Intelligence

Mastering Video Object Segmentation: Cutting-Edge Models and Design Tricks

This technical talk introduces video object segmentation tasks, reviews leading datasets and state-of-the-art deep learning models, and shares practical network design rules and performance‑boosting techniques, presented by Prof. Wang Xinggang as part of Alibaba's MEDIA AI challenge series.

AIDeep Learningcomputer vision

0 likes · 4 min read

Mastering Video Object Segmentation: Cutting-Edge Models and Design Tricks

Youku Technology

Jun 2, 2020 · Artificial Intelligence

MEDIA AI Alibaba Entertainment Algorithm Challenge – High‑Precision Video Person Segmentation & Video Temporal Event Detection

The MEDIA AI Alibaba Entertainment Algorithm Challenge, co‑hosted by Alibaba Entertainment Technology and Alibaba Cloud Tianchi, invites researchers and teams to tackle high‑precision video person segmentation and video temporal event detection using large‑scale datasets, with a total prize pool of 380,000 CNY and recruitment opportunities.

AI competitionMedia AIvideo event detection

0 likes · 7 min read

MEDIA AI Alibaba Entertainment Algorithm Challenge – High‑Precision Video Person Segmentation & Video Temporal Event Detection

NetEase Media Technology Team

Jul 12, 2019 · Artificial Intelligence

Video Highlight Detection and GIF Cover Generation Using 3D Convolutional Scoring

The paper proposes a 3D‑CNN scoring system that ranks short, information‑dense video segments, selects the most exciting one, and converts it into a looping GIF cover, replacing static thumbnails; trained on large video‑GIF datasets with pairwise ranking loss, it improves click‑through rates while reducing bad‑case generation.

3D CNNGIF generationcontent recommendation

0 likes · 13 min read

Video Highlight Detection and GIF Cover Generation Using 3D Convolutional Scoring

Youku Technology

Apr 29, 2019 · Artificial Intelligence

Precise and Fast Object Segmentation Algorithms – Talk by Ren Haibing (Youku Cognitive Lab)

Ren Haibing’s Youku Cognitive Lab talk reviews object segmentation’s motivation, explains semantic and instance concepts, presents UNet‑based and category‑agnostic methods—including fast video segmentation with motion cues—and reports high IoU results while outlining future edge‑aware, label‑free, and non‑online video segmentation research directions.

AIDeep Learningcategory-agnostic

0 likes · 19 min read

Precise and Fast Object Segmentation Algorithms – Talk by Ren Haibing (Youku Cognitive Lab)