Tagged articles
3 articles
Page 1 of 1
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Sep 21, 2023 · Artificial Intelligence

Xiaohongshu Team’s Four ICCV 2023 Papers on Open‑Vocabulary Video Instance Segmentation, One‑Shot 3D Avatar Learning, Test‑Time Personalized Human Pose Forecasting, and MPI‑Flow for Realistic Optical Flow

The Xiaohongshu technical team secured four ICCV 2023 papers—including an oral presentation—introducing an open‑vocabulary video instance segmentation benchmark and model, a one‑shot neural‑radiance‑field avatar method, a test‑time personalized 3D pose forecasting framework, and an MPI‑based realistic optical‑flow generation technique, all achieving state‑of‑the‑art performance.

3D AvatarHuman Pose ForecastingICCV 2023
0 likes · 14 min read
Xiaohongshu Team’s Four ICCV 2023 Papers on Open‑Vocabulary Video Instance Segmentation, One‑Shot 3D Avatar Learning, Test‑Time Personalized Human Pose Forecasting, and MPI‑Flow for Realistic Optical Flow
Meituan Technology Team
Meituan Technology Team
Jun 3, 2021 · Artificial Intelligence

VisTR: End-to-End Video Instance Segmentation with Transformers

VisTR redefines video instance segmentation as an end‑to‑end sequence‑to‑sequence task, using a CNN backbone, Transformer encoder‑decoder with instance queries, and Hungarian matching to jointly predict masks, classes, and tracks across frames, achieving state‑of‑the‑art accuracy (40.1 AP) and 57.7 FPS on YouTube‑VIS.

TransformerVideo Instance SegmentationVisTR
0 likes · 21 min read
VisTR: End-to-End Video Instance Segmentation with Transformers
Meituan Technology Team
Meituan Technology Team
Apr 15, 2021 · Artificial Intelligence

Meituan Technical Team Shares CVPR 2021 Pre-lecture: Five Papers on Video Instance Segmentation, Facial Expression Recognition, Real-time Semantic Segmentation, Weakly Supervised Semantic Segmentation, and Multi-source Domain Adaptation

At a CVPR 2021 pre‑lecture, Meituan’s Visual Intelligence Center showcased five cutting‑edge papers—VisTR transformer‑based video instance segmentation, a feature‑decomposition facial expression recognizer, an accelerated BiSeNet for real‑time semantic segmentation, an embedded discriminative attention mechanism for weakly supervised segmentation, and a partial‑feature selection framework for multi‑source domain adaptation—highlighting the company’s large AI R&D team, university collaborations, real‑world deployment across its services, and ongoing recruitment.

CVPR2021Facial Expression RecognitionMeituan Research
0 likes · 10 min read
Meituan Technical Team Shares CVPR 2021 Pre-lecture: Five Papers on Video Instance Segmentation, Facial Expression Recognition, Real-time Semantic Segmentation, Weakly Supervised Semantic Segmentation, and Multi-source Domain Adaptation