Tagged articles
17 articles
Page 1 of 1
DataFunTalk
DataFunTalk
Jan 5, 2022 · Artificial Intelligence

Graph-Based Methods for Hot Event Discovery, Long Text Matching, and Ontology Construction in Natural Language Processing

This talk presents a series of graph‑based techniques for natural language processing, including the Story Forest system for hot event discovery, the GIANT framework for ontology creation and user interest modeling, and a divide‑and‑conquer approach to long‑text matching that leverages graph neural networks and community detection.

event detectiongraph mininggraph neural networks
0 likes · 19 min read
Graph-Based Methods for Hot Event Discovery, Long Text Matching, and Ontology Construction in Natural Language Processing
Baidu Geek Talk
Baidu Geek Talk
Sep 13, 2021 · Artificial Intelligence

Upgrading WanFang Academic Paper Retrieval System with PaddleNLP

WanFang upgraded its academic paper retrieval system by adopting PaddleNLP’s Chinese pre‑trained Sentence‑BERT models, using weakly supervised SimCSE data and Milvus vector indexing, compressing the transformer for TensorRT‑accelerated inference, achieving 70% better matching quality and 2600 QPS latency‑optimized performance.

Model DeploymentPaddleNLPSentence-BERT
0 likes · 8 min read
Upgrading WanFang Academic Paper Retrieval System with PaddleNLP
Sohu Tech Products
Sohu Tech Products
Aug 4, 2021 · Artificial Intelligence

Technical Summary of the 2021 Sohu Campus Text Matching Algorithm Competition

This article presents a comprehensive technical summary of the 2021 Sohu Campus Text Matching Algorithm Competition, detailing data characteristics, preprocessing strategies, tokenization choices, positional encoding methods, model architectures using relative encodings such as WoBERT and RoFormer, experimental results, and reflections on future improvements.

Model DesignNLPcompetition
0 likes · 9 min read
Technical Summary of the 2021 Sohu Campus Text Matching Algorithm Competition
58 Tech
58 Tech
Jul 5, 2021 · Artificial Intelligence

Construction of a Virtual Category‑Tag System for 58 Local Services Using Machine Learning

This article describes the end‑to‑end design and implementation of a virtual category‑tag framework for 58 local services, detailing data preparation, tag selection via semantic similarity models, tag mounting, synonym normalization, experimental comparisons of CDSSM, MatchPyramid, BERT, RoBERTa and other techniques, and outlines future improvements.

BERTTaggingsynonym normalization
0 likes · 16 min read
Construction of a Virtual Category‑Tag System for 58 Local Services Using Machine Learning
58 Tech
58 Tech
Jun 16, 2021 · Artificial Intelligence

Improving Text Matching Accuracy in Voice Assistants: Experiments with Siamese Networks, BERT Models, and Advanced Tricks

This article evaluates classic Siamese networks, various BERT‑based pretrained models, and several training tricks such as adversarial training, k‑fold cross‑validation, and model ensembling on both a public similarity‑sentence competition dataset and an internal voice‑assistant standard question matching dataset, ultimately raising accuracy from 97.23 % to 99.5 %.

BERTSiamese NetworkVoice Assistant
0 likes · 15 min read
Improving Text Matching Accuracy in Voice Assistants: Experiments with Siamese Networks, BERT Models, and Advanced Tricks
Meituan Technology Team
Meituan Technology Team
May 27, 2021 · Artificial Intelligence

Standardizing Food Delivery Dish Names: Knowledge Graph Construction and Applications

The paper outlines an end‑to‑end pipeline that standardizes highly personalized food‑delivery dish names by combining rule‑based and BERT‑DSSM text synonym detection with EfficientNet image classification, constructing a multi‑level taxonomy that improves aggregation, supply‑demand analysis, recall ranking and merchant tagging.

Computer VisionKnowledge GraphNLP
0 likes · 17 min read
Standardizing Food Delivery Dish Names: Knowledge Graph Construction and Applications
58 Tech
58 Tech
Aug 3, 2020 · Artificial Intelligence

Intelligent Customer Service Competition: Leveraging AI for Text Matching and Classification

This announcement describes the rise of AI‑driven intelligent customer service, highlights 58.com’s long‑standing system, and introduces a competition that provides real‑world data for participants to develop advanced text‑matching and classification models using state‑of‑the‑art NLP techniques.

DatasetNLPartificial intelligence
0 likes · 3 min read
Intelligent Customer Service Competition: Leveraging AI for Text Matching and Classification
58 Tech
58 Tech
Jul 22, 2020 · Artificial Intelligence

Intelligent Customer Service Competition: Leveraging AI for Text Matching and Classification

The announcement introduces an AI‑driven intelligent customer service competition, highlighting the importance of text matching and classification in NLP, describing 58.com’s existing system, providing a real‑world dataset, and inviting participants to develop precise models using the latest deep‑learning techniques.

AIDatasetNLP
0 likes · 4 min read
Intelligent Customer Service Competition: Leveraging AI for Text Matching and Classification
Tencent Cloud Developer
Tencent Cloud Developer
Jul 8, 2020 · Artificial Intelligence

Graph-Based Chinese Word Embedding (AlphaEmbedding) for Improved Text Matching

AlphaEmbedding builds a weighted graph linking Chinese words, sub‑words, characters and pinyin, then uses random‑walk‑based node2vec training to produce embeddings that capture orthographic and phonetic similarity, markedly improving recall and ranking for homophones, typos and OOV terms in enterprise search.

Chinese NLPgraph computingsemantic similarity
0 likes · 17 min read
Graph-Based Chinese Word Embedding (AlphaEmbedding) for Improved Text Matching
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 19, 2019 · Artificial Intelligence

How RE2 Boosts FAQ Chatbot Accuracy: A Deep Dive into Text Matching Models

This article explains the design and evaluation of RE2, a lightweight yet expressive text‑matching framework for FAQ‑style chatbots, detailing its five‑layer architecture, block‑wise residual connections, experimental results on SNLI, MultiNLI, SciTail, Quora and WikiQA datasets, and its significant performance improvements in Alibaba’s DingXiaoMi service.

Deep LearningFAQ chatbotIndustrial AI
0 likes · 13 min read
How RE2 Boosts FAQ Chatbot Accuracy: A Deep Dive into Text Matching Models
Meituan Technology Team
Meituan Technology Team
Jun 21, 2018 · Artificial Intelligence

Deep Learning for Text Matching and Ranking at Meituan

Meituan leverages deep‑learning models such as Word2Vec, DSSM, and LSTM‑based encoders within its ClickNet framework to compute text similarity and rank results, integrating rich business features like user location and merchant rating, thereby surpassing traditional TF‑IDF, BM25, and XGBoost approaches and boosting click‑through rates and revenue.

AIDeep LearningRanking Models
0 likes · 27 min read
Deep Learning for Text Matching and Ranking at Meituan
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 12, 2018 · Artificial Intelligence

Bridge Language Gaps: Join Alibaba’s Cross‑Language Short Text Matching Challenge

Alibaba’s Ali‑Xiaomi team invites researchers worldwide to the CIKM AnalyticCup, a cross‑language short text matching competition aimed at transferring rich language understanding to low‑resource languages, highlighting past successes in precipitation forecasting and encouraging innovative AI solutions for multilingual chatbot services.

AIChatbotCross-language
0 likes · 7 min read
Bridge Language Gaps: Join Alibaba’s Cross‑Language Short Text Matching Challenge