Short Text Understanding in the Medical Health Domain: Business Scenarios and Technical Practices

This presentation details the challenges and solutions for short‑text understanding in medical health, covering DingXiangYuan’s business scenarios, NLP pipeline components, knowledge‑graph construction, concept mining, and their impact on search, recommendation, and tagging systems.

DataFunTalk
DataFunTalk
DataFunTalk
Short Text Understanding in the Medical Health Domain: Business Scenarios and Technical Practices

The talk introduces short‑text understanding in the medical‑health domain, focusing on DingXiangYuan’s business scenarios and the challenges of processing both professional and consumer medical texts.

It describes various search applications such as the DingXiangYuan forum, drug assistant, doctor app, and e‑commerce platform, highlighting the need to handle topic‑driven, complex, and serious medical content.

The NLP pipeline is divided into five steps: text correction, noun‑phrase extraction, named‑entity recognition, mention extraction, and entity linking, with details on spell checking, candidate generation, CRF+Bi‑LSTM NER, and ranking methods.

Knowledge‑graph construction and content profiling are employed to enrich entities and generate abstract concept tags, using graph embeddings and MDL‑based filtering to ensure high‑quality concepts.

Extracted concepts enable downstream tasks such as query expansion, tag generation, and recommendation in both professional and consumer scenarios, improving relevance and coverage.

The conclusion emphasizes improving entity‑recognition accuracy, mining concepts from user logs, and building a business graph to support search and recommendation pipelines.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

NLPmedical-aiconcept miningShort Text Understanding
DataFunTalk
Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.