Tag

person recognition

1 views collected around this technical thread.

iQIYI Technical Product Team
iQIYI Technical Product Team
Sep 27, 2019 · Artificial Intelligence

iQIYI-VID: A Large-Scale Multimodal Video Dataset for Person Recognition

iQIYI-VID is the world’s largest multimodal video dataset for person recognition, containing 10,000 celebrity identities and 600,000 video clips drawn from millions of videos, supporting tasks such as detection, identification, attribute and audio analysis, and serving as the basis for 2018‑2019 challenges and a face‑recognition subset, thereby driving research while performance gaps remain.

AIcomputer visioniQIYI-VID
0 likes · 7 min read
iQIYI-VID: A Large-Scale Multimodal Video Dataset for Person Recognition
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 9, 2019 · Artificial Intelligence

iQIYI 2019 Multimodal Video Person Recognition Competition Report by Zheey Team

The Zheey team from Beijing University of Posts and Telecommunications tackled the iQIYI 2019 Multimodal Video Person Recognition Challenge with a three‑layer MLP on official face features, boosting a baseline 0.8742 to 0.8949 through model fusion, quality filtering and fine‑tuning, ultimately ranking sixth and open‑sourcing their code.

MLPcompetitionface features
0 likes · 9 min read
iQIYI 2019 Multimodal Video Person Recognition Competition Report by Zheey Team
iQIYI Technical Product Team
iQIYI Technical Product Team
Jul 5, 2019 · Artificial Intelligence

Residual Dense Network with Feature Fusion for Multimodal Video Person Identification (iQIYI-VID-2019)

The authors introduce a feature‑fusion pipeline and a Residual Dense Net that leverages multi‑frame face embeddings to identify persons in iQIYI‑VID‑2019 videos, achieving 0.9035 mAP (second place) with only ≈0.5 GFLOPs and processing the full test set in minutes.

feature fusioniQIYI-VID-2019multimodal learning
0 likes · 11 min read
Residual Dense Network with Feature Fusion for Multimodal Video Person Identification (iQIYI-VID-2019)
iQIYI Technical Product Team
iQIYI Technical Product Team
Apr 4, 2019 · Artificial Intelligence

My Experience and Methods in the iQIYI Multimodal Person Recognition Challenge

In the iQIYI Multimodal Person Recognition Challenge, I leveraged the provided facial features, weighted face‑quality averaging, DBSCAN‑based noise clustering and a dynamic extra noise class within an iterative KNN‑to‑neural‑network training pipeline, ultimately reaching the top‑5 and open‑sourcing the full workflow on GitHub.

DBSCANcomputer visioniQIYI
0 likes · 7 min read
My Experience and Methods in the iQIYI Multimodal Person Recognition Challenge
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 10, 2018 · Artificial Intelligence

iQIYI Releases World's First Multimodal, Multi-angle Celebrity Video Dataset (iQIYI-VID) and Announces AI Competition

iQIYI released iQIYI-VID, the world’s first multimodal, multi-angle celebrity video dataset (1,000 hours, 500,000 clips, 5,000 celebrities) for a new AI competition focusing on multimodal video person recognition, which has attracted global university teams and top computer‑vision judges to advance AI understanding in entertainment.

AI datasetcompetitioncomputer vision
0 likes · 7 min read
iQIYI Releases World's First Multimodal, Multi-angle Celebrity Video Dataset (iQIYI-VID) and Announces AI Competition