Tag

video retrieval

0 views collected around this technical thread.

DataFunTalk
DataFunTalk
May 15, 2024 · Artificial Intelligence

Advances in Video Multimodal Retrieval: Video‑Text Semantic Search and Video‑Video Same‑Source Search

This article presents Ant Group's multimodal research on video retrieval, detailing video‑text semantic search and video‑video same‑source search, introducing a large Chinese pre‑training dataset, novel pre‑training, hard‑sample mining, fine‑grained modeling techniques, and an efficient end‑to‑end copyright detection framework.

Pretrainingcopyright detectionfine-grained modeling
0 likes · 38 min read
Advances in Video Multimodal Retrieval: Video‑Text Semantic Search and Video‑Video Same‑Source Search
Kuaishou Tech
Kuaishou Tech
Jan 23, 2024 · Artificial Intelligence

Highlights of Five Selected AAAI 2024 Papers on Recommendation, Retrieval, and Video Generation

This article presents concise overviews of five AAAI 2024 accepted papers covering multi‑stage reinforcement‑learning recommendation, error‑adaptive watch‑time prediction, coarse‑to‑fine text‑to‑video retrieval, enhanced fashion image retrieval, and conditional image‑to‑video generation, each with authors, download links, and reported performance gains.

AAAI 2024Artificial IntelligenceRecommendation systems
0 likes · 14 min read
Highlights of Five Selected AAAI 2024 Papers on Recommendation, Retrieval, and Video Generation
DataFunTalk
DataFunTalk
May 20, 2022 · Artificial Intelligence

Hierarchical Graph Convolutional Networks for Video Social Relationship Modeling

This article presents a multimodal approach that combines dynamic analysis and graph machine learning to generate and apply social relationship graphs in videos, detailing problem background, graph generation modules, applications such as video retrieval, experimental results, and future research directions.

AIWeak Supervisiongraph neural network
0 likes · 11 min read
Hierarchical Graph Convolutional Networks for Video Social Relationship Modeling
Tencent Tech
Tencent Tech
Apr 21, 2022 · Artificial Intelligence

How Tencent’s HunYuan Model Dominated All Major Video Retrieval Benchmarks

Tencent’s newly unveiled HunYuan AI model achieved a grand‑slam by ranking first on the five most authoritative cross‑modal video retrieval datasets, showcasing a hierarchical multimodal approach that dramatically boosts retrieval precision and promises broad impact for both research and industry applications.

AIHunyuanTencent
0 likes · 5 min read
How Tencent’s HunYuan Model Dominated All Major Video Retrieval Benchmarks
Youku Technology
Youku Technology
Mar 23, 2021 · Artificial Intelligence

Text-Video Alignment Algorithm for Automated Short Video Production at Youku

Youku’s new text‑video alignment system automatically generates short video summaries by extracting multimodal video and linguistic features, matching sentences to clips through embedding and tag‑level models, and enabling AI‑driven auto‑editing that cuts production time from days to minutes.

BERTNLPcross-modal matching
0 likes · 10 min read
Text-Video Alignment Algorithm for Automated Short Video Production at Youku
Youku Technology
Youku Technology
Jul 9, 2020 · Artificial Intelligence

Multi-level Multi-modal Search Engine and Graph Engine for Billion-scale Video Content

An advanced multi‑level, multi‑modal search and graph engine for Youku processes text, voice, image and video queries across hierarchical video elements, using combined vector and inverted indexes to merge cross‑level and cross‑modal results, while a distributed knowledge‑graph layer enables multimodal graph traversal for billion‑scale video retrieval.

AIgraph engineknowledge graph
0 likes · 12 min read
Multi-level Multi-modal Search Engine and Graph Engine for Billion-scale Video Content
iQIYI Technical Product Team
iQIYI Technical Product Team
Feb 28, 2020 · Artificial Intelligence

Video Copyright Detection Using CDVS and CDVA: Solution Overview and Technical Details

The BoYun Vision team developed a CDVS‑ and CDVA‑based video copyright detection system that extracts key frames, combines handcrafted and deep features, aligns timestamps with a CDVS temporal node filter, and achieved a top‑2 result in the 2019 iQIYI‑CCF competition.

CDVACDVSVideo Copyright Detection
0 likes · 9 min read
Video Copyright Detection Using CDVS and CDVA: Solution Overview and Technical Details
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 9, 2019 · Artificial Intelligence

iQIYI 2019 Multimodal Video Person Recognition Competition Report by Zheey Team

The Zheey team from Beijing University of Posts and Telecommunications tackled the iQIYI 2019 Multimodal Video Person Recognition Challenge with a three‑layer MLP on official face features, boosting a baseline 0.8742 to 0.8949 through model fusion, quality filtering and fine‑tuning, ultimately ranking sixth and open‑sourcing their code.

MLPcompetitionface features
0 likes · 9 min read
iQIYI 2019 Multimodal Video Person Recognition Competition Report by Zheey Team