Tag

NeXtVLAD

0 views collected around this technical thread.

HomeTech
HomeTech
Jul 7, 2023 · Artificial Intelligence

Multi-Modal Video Understanding and AIGC Video Generation at Autohome

This article presents a comprehensive multi-modal video understanding system for AIGC video generation, detailing technical architecture, GCN-based semi-supervised learning, and practical applications across automotive content scenarios.

AIGCBERTNeXtVLAD
0 likes · 8 min read
Multi-Modal Video Understanding and AIGC Video Generation at Autohome
HomeTech
HomeTech
Mar 4, 2020 · Artificial Intelligence

Video Multi-Label Classification Using Graph Convolutional Networks

This paper introduces a method for video multi-label classification that incorporates label correlation features using graph convolutional networks, significantly improving classification performance.

GCNInceptionV3NeXtVLAD
0 likes · 7 min read
Video Multi-Label Classification Using Graph Convolutional Networks
NetEase Media Technology Team
NetEase Media Technology Team
Apr 4, 2019 · Artificial Intelligence

Video Recommendation System: Framework, Topic Clustering, and Related Video Retrieval

The paper proposes a video recommendation framework that combines recall and ranking modules, using a multi‑modal topic clustering approach—integrating audio, visual, and textual features via NeXtVLAD, PCA, and K‑Means—to generate unified video representations, improve candidate selection, and boost click‑through and viewing time, while addressing cold‑start and semantic relevance challenges.

A/B testingNeXtVLADcold-start problem
0 likes · 7 min read
Video Recommendation System: Framework, Topic Clustering, and Related Video Retrieval