Few-Shot Learning Methods and Applications in Meituan NLP

Meituan’s NLP team leverages few‑shot learning—using data‑augmentation, semi‑supervised, ensemble/self‑training, and domain‑adaptation techniques—to cut annotation costs, achieving 1–2 percentage‑point accuracy gains on internal benchmarks and deploying high‑performing models for tasks such as topic classification, fake‑review detection, and sentiment analysis, while planning broader platform and model extensions.

Meituan Technology Team
Meituan Technology Team
Meituan Technology Team
Few-Shot Learning Methods and Applications in Meituan NLP

Meituan’s NLP services involve many tasks that require large amounts of labeled data, leading to high annotation costs. Few‑shot learning aims to achieve good performance with limited data.

We review several few‑shot techniques: data augmentation (sample and embedding augmentation, EDA, back‑translation, Mixup, adversarial training), semi‑supervised learning (consistency regularization, entropy minimization, methods such as Temporal Ensembling, Mean Teacher, VAT, MixMatch, MixText, UDA), ensemble + self‑training, and domain adaptation (MAML, Meta‑Baseline).

Experimental results on Meituan business benchmarks and public Chinese datasets show that embedding‑based augmentation improves accuracy by ~1 pp, semi‑supervised methods add 1.5‑2 pp, and ensemble/self‑training yields similar gains. Active learning reduces the amount of data needed, achieving the performance of 1000 random samples with only 500 actively selected examples.

Practical deployments include medical‑beauty topic classification, travel‑guide detection, medical‑beauty effect labeling, brand labeling, fake‑review detection, POI sentiment analysis, and internal text classification, all achieving high accuracy with a few thousand labeled instances.

Future work focuses on improving existing models, exploring more domain‑transfer techniques, extending experiments to MRC and NER, building a unified few‑shot learning platform, and training a general model for various text tasks.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

data augmentationNLPFew‑Shot LearningSemi-supervised Learningactive learning
Meituan Technology Team
Written by

Meituan Technology Team

Over 10,000 engineers powering China’s leading lifestyle services e‑commerce platform. Supporting hundreds of millions of consumers, millions of merchants across 2,000+ industries. This is the public channel for the tech teams behind Meituan, Dianping, Meituan Waimai, Meituan Select, and related services.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.