How Alibaba’s AI Powers Voice Ticketing and Facial Recognition in Shanghai Metro

Alibaba’s AI-driven solutions enable Shanghai Metro passengers to buy tickets by simply speaking, recognize faces at turnstiles, and analyze crowd flow in real time, showcasing multimodal voice‑vision interaction, far‑field speech recognition in noisy stations, and advanced computer‑vision techniques.

Alibaba Cloud Developer
Alibaba Cloud Developer
Alibaba Cloud Developer
How Alibaba’s AI Powers Voice Ticketing and Facial Recognition in Shanghai Metro

Voice Ticketing Machine

Alibaba, Ant Financial and Shanghai Metro jointly launched a new generation voice ticketing machine that allows passengers to buy tickets by speaking natural language, even in noisy subway environments. The system supports direct station name or price queries and fuzzy destination search, with payment via Alipay QR code.

According to iDST AI expert Yan Zhijie, it is the world’s first far‑field voice interaction product that works accurately in strong public‑place noise, using a multimodal “speech + vision” solution that detects passengers approaching the machine and initiates interaction without a wake‑up word.

The solution consists of four subsystems: a large‑array microphone for sound source localization and enhancement; a computer‑vision module for face, eye and lip detection; a multimodal fusion engine that combines audio and video; and far‑field speech recognition, semantic understanding, dialogue management and speech synthesis.

Key technical highlights include high‑accuracy far‑field recognition in noisy stations, wake‑word‑free interaction, and a dialogue system that understands colloquial speech and continuously learns from real passenger conversations.

Voice ticketing machine
Voice ticketing machine
Voice ticketing demo
Voice ticketing demo

Smart Passenger Flow Analysis

The AI‑driven passenger‑flow analysis system uses video‑based human detection, machine‑learning data analysis and Alibaba Cloud DataV visualization to monitor crowd density, predict congestion, and support emergency response by estimating passenger numbers on platforms and trains.

Technical challenges addressed are robust human detection with existing low‑resolution cameras under varying lighting, de‑duplication of the same passenger across multiple cameras, and ultra‑short‑term flow prediction.

Smart passenger flow analysis
Smart passenger flow analysis

Facial‑Recognition Turnstile

Shanghai Metro’s facial‑recognition turnstiles employ Alibaba’s face detection, tracking and verification algorithms, achieving >99.5% accuracy on LFW and >95% recognition in 1:3000 scenarios, with sub‑200 ms latency on an Intel i3 PC.

The system leverages Alibaba Cloud’s distributed computing and storage, offering high concurrency, speed, precision and anti‑spoofing capabilities.

Facial recognition turnstile
Facial recognition turnstile
Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Multimodal AIfacial recognitionSmart Transit
Alibaba Cloud Developer
Written by

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.