Tagged articles
3 articles
Page 1 of 1
Baidu Geek Talk
Baidu Geek Talk
Feb 17, 2022 · Artificial Intelligence

AI-Powered Sports Video Applications: Figure Skating Action Recognition, Multimodal Classification, and Football Highlight Clipping

The article showcases three AI‑driven sports video solutions—real‑time figure‑skating action recognition with ST‑GCN, multimodal video classification merging text, image and audio via ERNIE and TextCNN, and automated football highlight clipping using TSN‑BMN‑LSTM—each achieving over 85% accuracy, fully open‑source on PaddlePaddle with one‑click notebooks and a live developer session.

AIDeep LearningMultimodal Classification
0 likes · 8 min read
AI-Powered Sports Video Applications: Figure Skating Action Recognition, Multimodal Classification, and Football Highlight Clipping
Baidu Geek Talk
Baidu Geek Talk
Jan 17, 2022 · Artificial Intelligence

Unlocking Video AI: PaddleVideo’s Open‑Source Solutions for Sports, Media, and Safety

This article surveys PaddleVideo, Baidu's open‑source video AI toolkit, detailing its industry‑focused models for sports action recognition, multimodal tagging, intelligent production, interactive segmentation, drone detection, and medical imaging, while providing performance metrics and GitHub resources for each solution.

Computer VisionMultimodal LearningOpen-source
0 likes · 14 min read
Unlocking Video AI: PaddleVideo’s Open‑Source Solutions for Sports, Media, and Safety
Youku Technology
Youku Technology
May 13, 2019 · Artificial Intelligence

How Youku Tackles Multimodal Video Understanding and Quality Control

This article outlines Youku's multimodal video content understanding pipeline, covering business needs, problem decomposition, data construction, model selection, OCR subtitle extraction, scene and action recognition, sample augmentation, noise handling, and multimodal fusion strategies for robust content moderation.

AIComputer VisionOCR
0 likes · 11 min read
How Youku Tackles Multimodal Video Understanding and Quality Control