Artificial Intelligence 18 min read

Tencent Advertising Multimedia AI Technology: Research and Application

Liu Wei outlines Tencent’s Advertising Multimedia AI ecosystem on the Taiji platform, describing a five‑platform matrix—Jue for content understanding, Qiankun for automated video creation, Shenzhen for AI‑driven review, Tianyin for hierarchical fingerprinting, and Hunyuan as a multimodal large model—featuring innovations such as massive multimodal pre‑training, logo retrieval, QA‑style attribute extraction, spatiotemporal video analysis, advanced auto‑judgment, and high‑performance hashing that achieve top cross‑modal retrieval results.

Tencent Cloud Developer
Tencent Cloud Developer
Tencent Cloud Developer
Tencent Advertising Multimedia AI Technology: Research and Application

This article, authored by Liu Wei, Director and Distinguished Scientist of Tencent's Advertising Multimedia AI Center, presents a comprehensive overview of multimedia AI technology research and applications in Tencent's advertising ecosystem. The article introduces a complete multimedia AI technology matrix built on the Taiji Machine Learning Platform, featuring the Hunyuan AI large model and advertising-specific models.

The core content covers five major technology platforms: Jue (巨阙) - Advertising Content Understanding, which provides multi-dimensional and multi-granularity semantic understanding for products, creatives, and landing pages, including multi-modal pre-training for product classification, large-scale logo retrieval, QA-style attribute recognition, and temporal video understanding; Qiankun (乾坤) - Intelligent Advertising Creation, an automated video creation engine supporting video adaptation, image-to-video generation, video-to-video derivation, and virtual effects; Shenzhen (神针) - Intelligent Ad Review, a platform with 100+ AI review capabilities covering automatic judgment, similarity reuse, negative detection, and rule engine; Tianyin (天印) - Ad Fingerprint System, providing hierarchical fingerprint IDs and embeddings for similarity detection across 4 levels; and Hunyuan (混元) - AI Large Model, featuring multi-modal content understanding, multi-modal copy generation, and cross-modal retrieval capabilities.

The technical innovations include multi-modal pre-training models using tens of millions of advertising data, DML-based logo retrieval systems, multi-modal QA frameworks for attribute extraction, video temporal segmentation and spatiotemporal detection, multi-modal auto-judgment review technology, and advanced hashing algorithms like Angular Quantization and Hash Bit Selection. The cross-modal retrieval model has achieved Top 1 results on five authoritative international cross-modal retrieval benchmarks.

multimodal AIcomputer visiondeep learninglarge language modelsAdvertising Technologyvideo analysiscontent understandingcross-modal retrieval
Tencent Cloud Developer
Written by

Tencent Cloud Developer

Official Tencent Cloud community account that brings together developers, shares practical tech insights, and fosters an influential tech exchange community.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.