Tagged articles
1 articles
Page 1 of 1
DataFunSummit
DataFunSummit
Sep 17, 2024 · Artificial Intelligence

Multimodal Video Understanding for Real-World Surveillance: Tasks, Dataset, Models, and Challenges

This article presents a comprehensive overview of multimodal video understanding for real-world surveillance, covering task definitions, the new UCA multimodal surveillance dataset, baseline models for video moment localization, captioning, and anomaly detection, experimental results, challenges, and future research directions.

AI modelsmultimodal video understandingsurveillance dataset
0 likes · 19 min read
Multimodal Video Understanding for Real-World Surveillance: Tasks, Dataset, Models, and Challenges