Tagged articles
13 articles
Page 1 of 1
Kuaishou Tech
Kuaishou Tech
Nov 20, 2025 · Artificial Intelligence

How UniDex and UniSearch Redefine Video Search with Semantic Indexing and Generative Models

This article explains how Kuaishou’s UniDex replaces traditional term‑based inverted indexes with model‑driven semantic posting lists and how the end‑to‑end UniSearch framework generates video IDs directly from queries, delivering higher relevance, lower latency, and significant online performance gains.

AIGenerative ModelsSearch
0 likes · 17 min read
How UniDex and UniSearch Redefine Video Search with Semantic Indexing and Generative Models
NewBeeNLP
NewBeeNLP
May 29, 2024 · Artificial Intelligence

How Ant’s Multimodal Team Boosted Video‑Text Retrieval by 24% and Cut Copyright Search Costs 85%

This article presents Ant Group's multimodal research on video retrieval, detailing a large Chinese video‑text pre‑training dataset, three techniques that raise video‑text semantic search performance by up to 24.5%, and an end‑to‑end video‑video copyright detection system that reduces storage by 85% and speeds up inference 18‑fold.

copyright detectionfine-grained modelinghard sample mining
0 likes · 40 min read
How Ant’s Multimodal Team Boosted Video‑Text Retrieval by 24% and Cut Copyright Search Costs 85%
DataFunTalk
DataFunTalk
May 15, 2024 · Artificial Intelligence

Advances in Video Multimodal Retrieval: Video‑Text Semantic Search and Video‑Video Same‑Source Search

This article presents Ant Group's multimodal research on video retrieval, detailing video‑text semantic search and video‑video same‑source search, introducing a large Chinese pre‑training dataset, novel pre‑training, hard‑sample mining, fine‑grained modeling techniques, and an efficient end‑to‑end copyright detection framework.

copyright detectionfine-grained modelinghard sample mining
0 likes · 38 min read
Advances in Video Multimodal Retrieval: Video‑Text Semantic Search and Video‑Video Same‑Source Search
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Mar 12, 2024 · Artificial Intelligence

How Advanced Video AI Transforms Content Moderation and Retrieval

This article explores how modern video AI techniques—ranging from transformer‑based classification to semi‑supervised retrieval and token‑halting acceleration—enable efficient, accurate detection of prohibited content and fast, scalable video search in the era of short‑form media.

AI moderationSemi-supervised LearningTransformer
0 likes · 18 min read
How Advanced Video AI Transforms Content Moderation and Retrieval
Kuaishou Tech
Kuaishou Tech
Jan 23, 2024 · Artificial Intelligence

Highlights of Five Selected AAAI 2024 Papers on Recommendation, Retrieval, and Video Generation

This article presents concise overviews of five AAAI 2024 accepted papers covering multi‑stage reinforcement‑learning recommendation, error‑adaptive watch‑time prediction, coarse‑to‑fine text‑to‑video retrieval, enhanced fashion image retrieval, and conditional image‑to‑video generation, each with authors, download links, and reported performance gains.

AAAI 2024Recommendation Systemsartificial intelligence
0 likes · 14 min read
Highlights of Five Selected AAAI 2024 Papers on Recommendation, Retrieval, and Video Generation
DataFunTalk
DataFunTalk
May 20, 2022 · Artificial Intelligence

Hierarchical Graph Convolutional Networks for Video Social Relationship Modeling

This article presents a multimodal approach that combines dynamic analysis and graph machine learning to generate and apply social relationship graphs in videos, detailing problem background, graph generation modules, applications such as video retrieval, experimental results, and future research directions.

AIGraph Neural NetworkWeak Supervision
0 likes · 11 min read
Hierarchical Graph Convolutional Networks for Video Social Relationship Modeling
Tencent Tech
Tencent Tech
Apr 21, 2022 · Artificial Intelligence

How Tencent’s HunYuan Model Dominated All Major Video Retrieval Benchmarks

Tencent’s newly unveiled HunYuan AI model achieved a grand‑slam by ranking first on the five most authoritative cross‑modal video retrieval datasets, showcasing a hierarchical multimodal approach that dramatically boosts retrieval precision and promises broad impact for both research and industry applications.

AITencentmultimodal
0 likes · 5 min read
How Tencent’s HunYuan Model Dominated All Major Video Retrieval Benchmarks
Youku Technology
Youku Technology
Mar 23, 2021 · Artificial Intelligence

Text-Video Alignment Algorithm for Automated Short Video Production at Youku

Youku’s new text‑video alignment system automatically generates short video summaries by extracting multimodal video and linguistic features, matching sentences to clips through embedding and tag‑level models, and enabling AI‑driven auto‑editing that cuts production time from days to minutes.

BERTNLPcross-modal matching
0 likes · 10 min read
Text-Video Alignment Algorithm for Automated Short Video Production at Youku
Youku Technology
Youku Technology
Jul 9, 2020 · Artificial Intelligence

Multi-level Multi-modal Search Engine and Graph Engine for Billion-scale Video Content

An advanced multi‑level, multi‑modal search and graph engine for Youku processes text, voice, image and video queries across hierarchical video elements, using combined vector and inverted indexes to merge cross‑level and cross‑modal results, while a distributed knowledge‑graph layer enables multimodal graph traversal for billion‑scale video retrieval.

AIgraph enginelarge-scale indexing
0 likes · 12 min read
Multi-level Multi-modal Search Engine and Graph Engine for Billion-scale Video Content
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 13, 2020 · Artificial Intelligence

How to Detect Video Copyright Infringement with Two‑Stage Frame Matching

This article details a two‑stage video copyright detection pipeline that builds a frame‑level feature library, uses Hessian‑Affine + SIFT and Fisher Vectors for robust feature extraction, applies weighted bipartite graph matching and longest increasing subsequence localization, and achieves an F1‑score of 0.9086 on the CCF 2019 competition dataset.

AIfeature extractionframe matching
0 likes · 14 min read
How to Detect Video Copyright Infringement with Two‑Stage Frame Matching
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 9, 2019 · Artificial Intelligence

iQIYI 2019 Multimodal Video Person Recognition Competition Report by Zheey Team

The Zheey team from Beijing University of Posts and Telecommunications tackled the iQIYI 2019 Multimodal Video Person Recognition Challenge with a three‑layer MLP on official face features, boosting a baseline 0.8742 to 0.8949 through model fusion, quality filtering and fine‑tuning, ultimately ranking sixth and open‑sourcing their code.

MLPcompetitionface features
0 likes · 9 min read
iQIYI 2019 Multimodal Video Person Recognition Competition Report by Zheey Team
Suning Technology
Suning Technology
Dec 17, 2018 · Artificial Intelligence

How Search & Recommendation Technologies Evolve: Insights from Suning’s 2018 Conference

The 2018 Suning Search & Recommendation Technology Conference in Nanjing gathered over 400 industry experts to discuss search engine evolution, recommendation algorithm models, multi‑source data fusion, multimedia video retrieval, and AI‑driven advertising, highlighting practical implementations and future research directions.

data fusionmachine learningrecommendation
0 likes · 5 min read
How Search & Recommendation Technologies Evolve: Insights from Suning’s 2018 Conference