Tagged articles
2 articles
Page 1 of 1
Sohu Tech Products
Sohu Tech Products
Nov 26, 2025 · Artificial Intelligence

How Cleanlab Cut Data Review by 34×: A Real‑World Text Classification Case Study

This article walks through a real text‑classification project where noisy labels inflated the review workload to over 15,000 samples, and shows how using cleanlab’s confident‑learning framework reduced the manual audit set to 438 items, boosting efficiency by thirty‑four times while improving model performance.

Data QualityData‑Centric AIcleanlab
0 likes · 16 min read
How Cleanlab Cut Data Review by 34×: A Real‑World Text Classification Case Study
DataFunTalk
DataFunTalk
Jul 3, 2020 · Artificial Intelligence

Confident Learning: Detecting and Cleaning Noisy Labels with cleanlab

This article introduces confident learning, a principled framework for identifying and correcting mislabeled data in machine‑learning datasets, explains its three‑step process (count, clean, re‑training), demonstrates usage of the open‑source cleanlab library with code examples, and presents experimental results showing its effectiveness on benchmarks such as CIFAR‑10 and ImageNet.

cleanlabconfident learningdata cleaning
0 likes · 13 min read
Confident Learning: Detecting and Cleaning Noisy Labels with cleanlab