How PaddleX Enables Early Detection of Malignant Skin Tumors with AI Segmentation

This article examines the urgent need for early skin cancer detection in China, outlines the challenges of dermatological imaging, and details a low‑code PaddleX solution that leverages PP‑LiteSeg‑T for data preparation, model training, optimization, and deployment to improve diagnostic accuracy and efficiency.

Baidu Tech Salon
Baidu Tech Salon
Baidu Tech Salon
How PaddleX Enables Early Detection of Malignant Skin Tumors with AI Segmentation

Background and Motivation

In early 2024, the National Cancer Center reported a malignant skin tumor incidence of 2.4 per 100,000 in China, with roughly 35,000 new cases annually. Advanced melanoma has a five‑year survival rate of only 4.6% and a median survival of 1.42 years, highlighting the critical need for early diagnosis.

Industry Challenges

Dermatology faces high professional thresholds, uneven medical resource distribution, and a shortage of specialists, leading to delayed or missed diagnoses and increased patient burden. Imaging devices also require advanced image‑processing techniques to remain competitive.

Solution Overview

The Zhengzhou University Light Industry College Meikel Studio built an intelligent early‑screening system using the low‑code PaddleX platform. The system captures dermoscopic images, applies the PP‑LiteSeg semantic segmentation model, and automates lesion detection to reduce manual effort and misdiagnosis.

Data Collection and Preparation

To address data scarcity, the team aggregated the ISIC 2018‑2020 datasets, totaling 61,051 images from 2,056 patients covering seven disease categories. After filtering for quality, approximately 5,400 images (about 540 per class) were annotated using LabelMe and converted via PaddleX tools.

Data augmentation was performed with the EasyData platform to balance class distribution.

Model Selection

PaddleX offers two model families; benchmark results favored the lightweight PP‑LiteSeg‑T for its balance of accuracy and inference speed.

Zero‑Code Development and Data Validation

The zero‑code pipeline split and validated the dataset, producing visualizations of labeled samples and class distribution.

Model Training

After validation, the team trained the model with four GPUs, exposing key hyperparameters (learning rate, iteration count) in the front‑end for easy tuning.

Hyperparameter Optimization

Learning rate experiments identified 0.02 as optimal (fixed 1000 iterations). Subsequent iteration‑count experiments raised mIoU from 0.838 to 0.868, achieving a GPU inference time of 5.98 ms.

Deployment and Online Service

The model was deployed via the zero‑code pipeline, enabling API calls and a web UI for single‑image inference. Offline deployment packages are also available for edge devices.

Future Directions

PaddleX’s low‑code, private‑deployment capabilities allow the system to expand to other medical imaging tasks such as spinal NIFTI segmentation, and to integrate into edge devices for real‑time assistance, promoting broader AI adoption in healthcare and other industries.

AIdeep learningimage segmentationMedical ImagingPaddleXSkin Cancer
Baidu Tech Salon
Written by

Baidu Tech Salon

Baidu Tech Salon, organized by Baidu's Technology Management Department, is a monthly offline event that shares cutting‑edge tech trends from Baidu and the industry, providing a free platform for mid‑to‑senior engineers to exchange ideas.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.