How Alibaba’s AI Beats the KITTI Benchmark and Revolutionizes Visual Shopping

Alibaba’s AI breakthroughs—from a foot‑scanning shopping demo that lets a Google engineer instantly find matching shoes, to a record‑setting vehicle detection model on KITTI and world‑leading OCR for real‑time image review—showcase the power and commercial potential of modern computer‑vision research.

Alibaba Cloud Developer
Alibaba Cloud Developer
Alibaba Cloud Developer
How Alibaba’s AI Beats the KITTI Benchmark and Revolutionizes Visual Shopping

At CVPR 2017 in Honolulu, Alibaba’s “foot‑shopping” technology attracted attention.

A Google engineer visited Alibaba’s booth and used Taobao’s “拍立淘” feature to scan a person’s shoes; the system accurately identified the same shoe model.

“拍立淘”, launched in 2014, lets users tap the camera icon in the Taobao app, point at a product, and instantly find matching items.

During the China AI Conference (CCAI 2017) in Hangzhou, Alibaba showcased visual‑shopping demos such as “拍立淘” and a virtual try‑on mirror, drawing many visitors.

In May, Alibaba iDST visual computing researcher Hua Xiansheng’s team set a new world record on the KITTI vehicle‑detection benchmark, raising accuracy to 90.46 %.

The team’s solution combines region‑fusion decision making with a context‑aware multi‑task deep neural network to handle multi‑view, pose variation, and occlusion in complex scenes.

Alibaba’s advertising platform also achieved the best result in the ICDAR Robust Reading competition with its OCR technology, far surpassing the runner‑up.

Today the OCR system powers real‑time image review across the entire Alibaba advertising business, processing tens of millions of images daily with over 95 % accuracy and reducing risk‑detection time from days to seconds.

These advances illustrate that computer‑vision research continues to face challenges but also offers abundant opportunities, especially when combined with deep learning and big‑data techniques.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Computer VisionAIDeep Learningobject detectionOCR
Alibaba Cloud Developer
Written by

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.