AI Secrets Behind City Brain: Video Anomaly Detection, Person Re‑ID & Image Generation

This article introduces the concept of a city brain powered by internet data, then showcases three ACM MM 2017 papers—spatio‑temporal auto‑encoders for video anomaly detection, deep siamese networks for person re‑identification, and stylized adversarial auto‑encoders for image generation—offering free access to the mini‑e‑book.

Alibaba Cloud Developer
Alibaba Cloud Developer
Alibaba Cloud Developer
AI Secrets Behind City Brain: Video Anomaly Detection, Person Re‑ID & Image Generation

City Brain leverages internet infrastructure and abundant urban data to perform real‑time global analysis, efficiently allocate public resources, improve social governance, and promote sustainable city development.

As Alibaba’s technical committee chair Wang Jian noted, each technological revolution—steam, electricity, now data—drives urban civilization forward, making a data‑driven “brain” essential.

Paper 1: Spatio‑Temporal AutoEncoder for Video Anomaly Detection

This work tackles the challenging problem of detecting abnormal events in video streams, which is critical for traffic safety and security monitoring. Inspired by recent advances in action recognition, the authors design a spatio‑temporal auto‑encoder and introduce a weight‑decay prediction error metric. Evaluations on real traffic scenes show the method surpasses previous best results on key indicators.

Paper 2: Deep Siamese Network with Multi‑level Similarity Perception for Person Re‑identification

Person re‑identification aims to match a pedestrian’s image across multiple cameras, a key technology for smart‑city surveillance. The proposed approach combines deep siamese and classification networks, extending similarity perception to multiple levels. Compared with major public datasets, it achieves the highest retrieval accuracy currently reported.

Paper 3: Stylized Adversarial Autoencoder for Image Generation

License‑plate recognition is a foundational model for smart cities, requiring diverse, accurately labeled data. To address data scarcity, the authors adopt a stylized adversarial auto‑encoder that merges content and style features via separate extraction networks, enabling controlled generation of realistic license‑plate images and improving GAN‑based data synthesis.

All three papers are compiled into a mini‑e‑book that can be read or downloaded for free by following the QR‑code instructions.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Image Generationperson re-identificationcity brainVideo Anomaly Detection
Alibaba Cloud Developer
Written by

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.