How Alibaba Sports Built AI‑Powered Home Exercise with Real‑Time Pose Detection

This article explains how Alibaba Sports created an AI‑driven home‑exercise solution that uses on‑device pose estimation, describes the underlying MNN inference engine, outlines challenges such as accuracy, performance and testing, and shares the business impact of supporting dozens of workout motions.

Alibaba Cloud Developer
Alibaba Cloud Developer
Alibaba Cloud Developer
How Alibaba Sports Built AI‑Powered Home Exercise with Real‑Time Pose Detection

Background

Over the past year, Alibaba Sports' technology team explored "endpoint intelligence" and launched an AI‑exercise project that digitizes sports, enabling users to exercise at home with only a phone and a small space.

Endpoint Intelligence Practice

After a year of development, the team built a systematic client‑side intelligent sport platform that runs inference on the phone using Alibaba's deep‑learning engine, detects human pose and actions, analyzes trajectories and angles, and provides real‑time feedback and correction for more than ten exercise types.

Technical Support

The core technique uses the MNN inference engine for pose detection:

Real‑time detection of 14 key skeletal points from images or video.

Connecting points to form lines creates action representations for posture, angle and trajectory analysis.

Action‑pose matching enables timing, counting, standardization feedback and interactive correction.

Improving Recognition Accuracy

Accuracy depends on the matching algorithm; improvements focus on stabilizing skeletal points, selecting representative actions for state machines, and ensuring frame rates cover all states. Examples show how mis‑detected arm points or low frame rates lead to incorrect action matching and how historical motion data can be used for correction.

Reducing Performance Consumption

Mobile devices have limited CPU, memory and power, so the three inference stages—pre‑processing, inference, and post‑processing—are optimized. Pre‑processing minimizes format conversions, inference relies on MNN, and post‑processing uses platform‑specific rendering (e.g., Metal on iOS) to lower energy use and heat.

Improving Testing Efficiency

Traditional manual testing is costly and inconsistent. The team built an AI‑exercise automated testing tool that batch‑processes video samples, extracts skeletal data, runs business‑logic tests, and generates reports, dramatically reducing labor, improving coverage, and enabling quantitative evaluation of model accuracy, algorithm efficiency, and performance.

Business Results

The AI‑exercise system now supports dozens of motions, offers AI training courses, and runs continuous "celebrity coaching" sessions to boost user engagement. Ongoing development will expand motion libraries and enrich product features, establishing a distinctive intelligent‑sports brand.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

AIAutomated Testingmobile inferencepose detectionMNN engine
Alibaba Cloud Developer
Written by

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.