Tagged articles
7 articles
Page 1 of 1
Zhihu Tech Column
Zhihu Tech Column
Nov 4, 2025 · Artificial Intelligence

How Multimodal Large Models Transform Recommendation Systems: From Tags to Embeddings

This article explores how multimodal large models like Qwen2.5‑VL enable high‑dimensional tag generation and universal embeddings for recommendation systems, detailing data synthesis, model training, quantization, fine‑tuning, and the resulting improvements in click‑through rate and exposure interaction.

EmbeddingLarge Language ModelsMultimodal AI
0 likes · 17 min read
How Multimodal Large Models Transform Recommendation Systems: From Tags to Embeddings
iQIYI Technical Product Team
iQIYI Technical Product Team
Feb 25, 2022 · Artificial Intelligence

Short Video Content Tagging: Multimodal AI Model Framework and Applications

The framework tags short videos by fusing text, image and audio‑video features through specialized extraction, classification, generative and retrieval modules, then ranking candidates with a multimodal BERT model, delivering accurate, business‑specific tags that boost recommendation, search and advertising.

Deep LearningMultimodal AIcontent tagging
0 likes · 10 min read
Short Video Content Tagging: Multimodal AI Model Framework and Applications
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 27, 2020 · Artificial Intelligence

Multimodal Short Video Content Tagging Techniques and Applications at iQIYI

The article surveys iQIYI’s multimodal short‑video content‑tagging pipeline, detailing extraction‑ and generation‑based methods, challenges of open‑world tags, model evolution from rule‑based to Transformer generators, visual‑text fusion techniques, and applications such as recommendation, search, clustering, and future enhancements.

MultimodalNLPcontent tagging
0 likes · 18 min read
Multimodal Short Video Content Tagging Techniques and Applications at iQIYI
Qunar Tech Salon
Qunar Tech Salon
Mar 5, 2020 · Artificial Intelligence

Content Tagging Technology for Short Videos at iQIYI: Challenges and Model Evolution

This article describes iQIYI's short‑video content tagging system, outlining the challenges of extracting type and abstract tags from multimodal data, detailing the evolution from text‑only models to image‑fusion, BERT‑enhanced, and video‑frame models, and discussing their applications and future directions.

BERTMultimodal LearningTransformer
0 likes · 11 min read
Content Tagging Technology for Short Videos at iQIYI: Challenges and Model Evolution
iQIYI Technical Product Team
iQIYI Technical Product Team
Feb 14, 2020 · Artificial Intelligence

Content Tagging Technology for Short Videos: Challenges and Multi‑Modal Model Evolution at iQIYI

iQIYI’s short‑video tagging system tackles multimodal fusion, open‑set and abstract tags by evolving from a text‑only model through cover‑image, BERT‑vector, and video‑frame fusion architectures, enabling automated labeling, personalized recommendation, and semantic search while planning to add OCR, audio, and knowledge‑graph enhancements.

BERTMultimodal LearningTransformer
0 likes · 13 min read
Content Tagging Technology for Short Videos: Challenges and Multi‑Modal Model Evolution at iQIYI
DataFunTalk
DataFunTalk
Jul 26, 2019 · Artificial Intelligence

Hulu’s Video Content Understanding: Challenges, Practices, and Applications

This article summarizes Hulu Chief Research Officer Xie Xiaohui’s presentation on why video content understanding is essential, the technical challenges involved, and Hulu’s end‑to‑end solutions—including fine‑grained segmentation, logo and subtitle detection, automated pipelines, tagging taxonomy, content generation, and vector embeddings—to improve recommendation, advertising, and search for massive video libraries.

AIHulucontent tagging
0 likes · 14 min read
Hulu’s Video Content Understanding: Challenges, Practices, and Applications