Artificial Intelligence 6 min read

Computer Vision in Transportation Workshop – Course Overview and Highlights

The Didi Computer Vision in Transportation workshop teaches fundamentals, advanced domain‑adaptation and lightweight model techniques, and real‑world applications such as driver identification and driving‑scenario analysis, delivered by Didi AI Labs experts, emphasizing practical use cases and cutting‑edge research.

Didi Tech

Jul 13, 2019

Overview

The Didi "Computer Vision in Transportation" workshop focuses on practical applications of visual technology in the transportation domain. It covers the underlying methods, advanced research topics, and real‑world use cases.

Course Structure

1. Introduction to Computer Vision – fundamentals such as representation learning, activation functions, neural network architectures, convolution operators, pooling layers, and batch normalization.

2. Advanced Topics – domain adaptation and lightweight models, addressing challenges like model dependence on expensive labeled data, difficulty of transferring models to new environments, high computational resource demands, and deployment constraints on edge devices.

3. Applications – driver identification (face detection & recognition) and driving‑scenario understanding (vision perception, 3D reconstruction, behavior analysis).

Instructors

• Dr. Che Zhengping – Didi AI Labs

• Prof. Guo Yuhong – Chief Researcher, Didi AI Labs & Carleton University

• Dr. Shen Haifeng – Didi AI Labs (face technology)

• Dr. Li Guangyu – Didi AI Labs (driving‑scenario understanding)

Key Topics in the Syllabus

Computer Vision Basics: representation learning, activation functions, network structures, convolutions, pooling, batch normalization.

Image Classification: datasets, roadmap, classification networks, experiments.

Object Detection: region‑based methods, region‑free methods, experiments.

Domain Adaptation: introduction, CV applications, methods.

Lightweight Models: basics, Inception/Xception, SqueezeNet, MobileNet/V2, ShuffleNet/V2.

Driver Identification: application, overview, experiments.

Driving Scenario Understanding: vision perception, 3D reconstruction, behavior analysis.

Highlights

The workshop emphasizes two frontier topics: domain adaptation to enable models to generalize across different environments, and lightweight models that maintain accuracy while reducing size and computational cost.

Practical examples include driver authentication using face detection/recognition and comprehensive driving‑scenario analysis covering pedestrian detection, vehicle detection, lane‑line segmentation, traffic‑light detection, and 3D semantic reconstruction.

Didi Tech

Official Didi technology account

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.