Tagged articles
10 articles
Page 1 of 1
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Mar 12, 2024 · Artificial Intelligence

How Advanced Video AI Transforms Content Moderation and Retrieval

This article explores how modern video AI techniques—ranging from transformer‑based classification to semi‑supervised retrieval and token‑halting acceleration—enable efficient, accurate detection of prohibited content and fast, scalable video search in the era of short‑form media.

AI moderationSemi-supervised LearningTransformer
0 likes · 18 min read
How Advanced Video AI Transforms Content Moderation and Retrieval
Tencent Advertising Technology
Tencent Advertising Technology
Jun 22, 2021 · Artificial Intelligence

Technical Insights and Solution Strategies from the Tencent Advertising Algorithm Competition – Video Ad Track

The article outlines the Tencent Advertising Algorithm Competition’s video ad challenge, details the paper submission guidelines, and shares a participant’s step‑by‑step technical approach—including baseline experiments, model re‑implementation with Paddle, multimodal feature extraction, optimizer choices, and future improvement directions—providing practical AI insights for multimedia video classification.

Deep LearningMultimodal LearningTencent competition
0 likes · 7 min read
Technical Insights and Solution Strategies from the Tencent Advertising Algorithm Competition – Video Ad Track
Baidu Geek Talk
Baidu Geek Talk
Jun 21, 2021 · Artificial Intelligence

Detecting Pornographic Videos with Dual‑Modal AI: Images + Audio

This article presents a technical overview of a multimodal AI framework that combines image and audio analysis to identify pornographic video content, detailing model architectures, feature extraction methods, and experimental results achieving 93.4% accuracy on a 3,000‑sample test set.

Audio AnalysisDeep LearningMultimodal AI
0 likes · 6 min read
Detecting Pornographic Videos with Dual‑Modal AI: Images + Audio
DataFunTalk
DataFunTalk
May 22, 2021 · Artificial Intelligence

Baidu's Video Foundation Technology Architecture and Key AI Techniques

This article presents an overview of Baidu's video foundation technology architecture, covering the video R&D platform, core AI techniques for video understanding, editing, surveillance, and general vision, and detailing innovations such as Attention‑Cluster networks, cross‑modality attention with graph convolution, GANs, super‑resolution, and adaptive encoding.

Adaptive EncodingAttention MechanismGAN
0 likes · 14 min read
Baidu's Video Foundation Technology Architecture and Key AI Techniques
Amap Tech
Amap Tech
Feb 1, 2021 · Artificial Intelligence

AMAP-TECH Algorithm Competition: Dynamic Road Condition Analysis Using In-Vehicle Video

The AMAP‑TECH competition challenged participants to infer real‑time road conditions from in‑vehicle video, prompting the authors to combine lane‑wise vehicle detection with LightGBM and later an end‑to‑end DenseNet‑GRU model, augment data, ensemble five networks, and achieve a 0.7237 F1 score while outlining future deployment and research directions.

Computer VisionDeep LearningModel Deployment
0 likes · 15 min read
AMAP-TECH Algorithm Competition: Dynamic Road Condition Analysis Using In-Vehicle Video
Amap Tech
Amap Tech
Jan 22, 2021 · Artificial Intelligence

Dynamic Road Condition Analysis Using EfficientNet and ConvLSTM: Solution to the AMAP‑TECH Algorithm Competition

The winning solution for the AMAP‑TECH competition combines EfficientNet spatial features with ConvLSTM temporal modeling, adds a mask‑prediction branch for closed‑road obstacles, and mitigates severe class imbalance through a two‑stage decoupled training pipeline, achieving first place on leaderboard A and top‑5 on leaderboard B without ensembling.

AI competitionConvLSTMEfficientNet
0 likes · 7 min read
Dynamic Road Condition Analysis Using EfficientNet and ConvLSTM: Solution to the AMAP‑TECH Algorithm Competition
Youku Technology
Youku Technology
Apr 16, 2020 · Artificial Intelligence

Multimodal Video Classification: Image Feature Improvements and System Insights

The talk presents Alibaba’s hierarchical video‑category system and a multimodal classification pipeline—leveraging EfficientNet, NeXtVLAD fusion, attention‑dropping augmentation, and MoCo contrastive learning—that together boost cold‑start recall by 43%, improve program classification over 20%, and set the stage for larger models and advanced unsupervised methods.

AIEfficientNetMultimodal
0 likes · 17 min read
Multimodal Video Classification: Image Feature Improvements and System Insights
DataFunTalk
DataFunTalk
Mar 30, 2020 · Artificial Intelligence

Enhancing Multimodal Video Classification with Improved Image Features and Category System

This article presents a comprehensive overview of Alibaba Entertainment's category system and multimodal video classification algorithm, detailing the construction of a high‑accuracy hierarchical taxonomy, improvements to image feature extraction using EfficientNet and data augmentation, unsupervised training techniques, experimental results, practical pitfalls, and future research directions.

AIMultimodalUnsupervised Learning
0 likes · 17 min read
Enhancing Multimodal Video Classification with Improved Image Features and Category System
iQIYI Technical Product Team
iQIYI Technical Product Team
Feb 21, 2019 · Artificial Intelligence

Multimodal Soft‑Porn Detection for Short Videos: Models, Challenges, and Lessons Learned

The article describes iQIYI's multimodal soft‑porn detection system for short videos, covering challenges like subjective definitions and class imbalance, and detailing text (Convolutional Bi‑LSTM), image (Xception‑CBAM), video (NeXtVLAD) models, integration strategies, key takeaways, and future improvements.

AIcontent moderationmultimodal detection
0 likes · 15 min read
Multimodal Soft‑Porn Detection for Short Videos: Models, Challenges, and Lessons Learned