Tagged articles

text matching

17 articles · Page 1 of 1

Nov 20, 2024 · Fundamentals

Python Regular Expressions: From Basics to Advanced Usage

This tutorial explains how to use Python's re module for regular expression operations, covering basic string matching, character classes, quantifiers, grouping, greedy vs. non‑greedy matching, substitution, and a practical example of extracting email addresses from text.

programmingre moduleregex

0 likes · 10 min read

Python Regular Expressions: From Basics to Advanced Usage

DataFunTalk

Jan 5, 2022 · Artificial Intelligence

Graph-Based Methods for Hot Event Discovery, Long Text Matching, and Ontology Construction in Natural Language Processing

This talk presents a series of graph‑based techniques for natural language processing, including the Story Forest system for hot event discovery, the GIANT framework for ontology creation and user interest modeling, and a divide‑and‑conquer approach to long‑text matching that leverages graph neural networks and community detection.

Graph Neural Networksevent detectiongraph mining

0 likes · 19 min read

Graph-Based Methods for Hot Event Discovery, Long Text Matching, and Ontology Construction in Natural Language Processing

Baidu Geek Talk

Sep 13, 2021 · Artificial Intelligence

Upgrading WanFang Academic Paper Retrieval System with PaddleNLP

WanFang upgraded its academic paper retrieval system by adopting PaddleNLP’s Chinese pre‑trained Sentence‑BERT models, using weakly supervised SimCSE data and Milvus vector indexing, compressing the transformer for TensorRT‑accelerated inference, achieving 70% better matching quality and 2600 QPS latency‑optimized performance.

Model DeploymentPaddleNLPSentence-BERT

0 likes · 8 min read

Upgrading WanFang Academic Paper Retrieval System with PaddleNLP

Sohu Tech Products

Sep 1, 2021 · Artificial Intelligence

2021 Sohu Text Matching Competition: Model Design, Tricks, and Performance Analysis

This article details the authors' approach to the 2021 Sohu Text Matching competition, describing the task definition, data splits, model architectures (cross‑encoder and bi‑encoder), pretrained language models used, various training tricks, ensemble strategies, and the resulting evaluation scores.

AINLPPretrained Models

0 likes · 8 min read

2021 Sohu Text Matching Competition: Model Design, Tricks, and Performance Analysis

Sohu Tech Products

Aug 4, 2021 · Artificial Intelligence

Technical Summary of the 2021 Sohu Campus Text Matching Algorithm Competition

This article presents a comprehensive technical summary of the 2021 Sohu Campus Text Matching Algorithm Competition, detailing data characteristics, preprocessing strategies, tokenization choices, positional encoding methods, model architectures using relative encodings such as WoBERT and RoFormer, experimental results, and reflections on future improvements.

Model DesignMulti-Task LearningNLP

0 likes · 9 min read

Technical Summary of the 2021 Sohu Campus Text Matching Algorithm Competition

58 Tech

Jul 5, 2021 · Artificial Intelligence

Construction of a Virtual Category‑Tag System for 58 Local Services Using Machine Learning

This article describes the end‑to‑end design and implementation of a virtual category‑tag framework for 58 local services, detailing data preparation, tag selection via semantic similarity models, tag mounting, synonym normalization, experimental comparisons of CDSSM, MatchPyramid, BERT, RoBERTa and other techniques, and outlines future improvements.

BERTTaggingsynonym normalization

0 likes · 16 min read

Construction of a Virtual Category‑Tag System for 58 Local Services Using Machine Learning

58 Tech

Jun 16, 2021 · Artificial Intelligence

Improving Text Matching Accuracy in Voice Assistants: Experiments with Siamese Networks, BERT Models, and Advanced Tricks

This article evaluates classic Siamese networks, various BERT‑based pretrained models, and several training tricks such as adversarial training, k‑fold cross‑validation, and model ensembling on both a public similarity‑sentence competition dataset and an internal voice‑assistant standard question matching dataset, ultimately raising accuracy from 97.23 % to 99.5 %.

BERTSiamese NetworkVoice Assistant

0 likes · 15 min read

Improving Text Matching Accuracy in Voice Assistants: Experiments with Siamese Networks, BERT Models, and Advanced Tricks

Meituan Technology Team

May 27, 2021 · Artificial Intelligence

Standardizing Food Delivery Dish Names: Knowledge Graph Construction and Applications

The paper outlines an end‑to‑end pipeline that standardizes highly personalized food‑delivery dish names by combining rule‑based and BERT‑DSSM text synonym detection with EfficientNet image classification, constructing a multi‑level taxonomy that improves aggregation, supply‑demand analysis, recall ranking and merchant tagging.

Knowledge GraphNLPStandardization

0 likes · 17 min read

Standardizing Food Delivery Dish Names: Knowledge Graph Construction and Applications

TiPaiPai Technical Team

May 14, 2021 · Artificial Intelligence

Mastering Text Matching: From SentenceBERT to Contrastive Learning

This article explores the landscape of text matching in NLP, covering problem types, three model interaction levels, sentence embedding techniques, supervised and unsupervised approaches, and the role of contrastive learning with alignment and uniformity metrics.

BERTNLPcontrastive learning

0 likes · 12 min read

Mastering Text Matching: From SentenceBERT to Contrastive Learning

DataFunTalk

Aug 6, 2020 · Artificial Intelligence

Practical Applications of Pretrained Language Models (BERT, GPT, ELMo) in NetEase Yanxuan NLP Tasks

The article reviews the principles of popular pretrained language models, compares their architectures, and details how NetEase Yanxuan applied BERT, GPT and ELMo to classification, matching, sequence labeling and generation tasks, presenting experimental results and deployment insights.

BERTNLPText Classification

0 likes · 20 min read

Practical Applications of Pretrained Language Models (BERT, GPT, ELMo) in NetEase Yanxuan NLP Tasks

58 Tech

Aug 3, 2020 · Artificial Intelligence

Intelligent Customer Service Competition: Leveraging AI for Text Matching and Classification

This announcement describes the rise of AI‑driven intelligent customer service, highlights 58.com’s long‑standing system, and introduces a competition that provides real‑world data for participants to develop advanced text‑matching and classification models using state‑of‑the‑art NLP techniques.

NLPartificial-intelligencecustomer service

0 likes · 3 min read

Intelligent Customer Service Competition: Leveraging AI for Text Matching and Classification

58 Tech

Jul 22, 2020 · Artificial Intelligence

Intelligent Customer Service Competition: Leveraging AI for Text Matching and Classification

The announcement introduces an AI‑driven intelligent customer service competition, highlighting the importance of text matching and classification in NLP, describing 58.com’s existing system, providing a real‑world dataset, and inviting participants to develop precise models using the latest deep‑learning techniques.

AINLPcustomer service

0 likes · 4 min read

Tencent Cloud Developer

Jul 8, 2020 · Artificial Intelligence

Graph-Based Chinese Word Embedding (AlphaEmbedding) for Improved Text Matching

AlphaEmbedding builds a weighted graph linking Chinese words, sub‑words, characters and pinyin, then uses random‑walk‑based node2vec training to produce embeddings that capture orthographic and phonetic similarity, markedly improving recall and ranking for homophones, typos and OOV terms in enterprise search.

Chinese NLPgraph computingsemantic similarity

0 likes · 17 min read

Graph-Based Chinese Word Embedding (AlphaEmbedding) for Improved Text Matching

Yanxuan Tech Team

Dec 9, 2019 · Artificial Intelligence

How NetEase Yanxuan Leverages BERT, GPT, and ELMo for Real-World NLP Tasks

This article reviews the evolution of language models from bag‑of‑words to BERT, compares ELMo, GPT, and BERT architectures, and details how NetEase Yanxuan applies pre‑trained models to classification, text matching, sequence labeling, and generative tasks in production.

BERTELMoGPT

0 likes · 19 min read

How NetEase Yanxuan Leverages BERT, GPT, and ELMo for Real-World NLP Tasks

Alibaba Cloud Developer

Aug 19, 2019 · Artificial Intelligence

How RE2 Boosts FAQ Chatbot Accuracy: A Deep Dive into Text Matching Models

This article explains the design and evaluation of RE2, a lightweight yet expressive text‑matching framework for FAQ‑style chatbots, detailing its five‑layer architecture, block‑wise residual connections, experimental results on SNLI, MultiNLI, SciTail, Quora and WikiQA datasets, and its significant performance improvements in Alibaba’s DingXiaoMi service.

FAQ chatbotNLPdeep learning

0 likes · 13 min read

How RE2 Boosts FAQ Chatbot Accuracy: A Deep Dive into Text Matching Models

Meituan Technology Team

Jun 21, 2018 · Artificial Intelligence

Deep Learning for Text Matching and Ranking at Meituan

Meituan leverages deep‑learning models such as Word2Vec, DSSM, and LSTM‑based encoders within its ClickNet framework to compute text similarity and rank results, integrating rich business features like user location and merchant rating, thereby surpassing traditional TF‑IDF, BM25, and XGBoost approaches and boosting click‑through rates and revenue.

AIRanking Modelsdeep learning

0 likes · 27 min read

Deep Learning for Text Matching and Ranking at Meituan

Alibaba Cloud Developer

Jun 12, 2018 · Artificial Intelligence

Bridge Language Gaps: Join Alibaba’s Cross‑Language Short Text Matching Challenge

Alibaba’s Ali‑Xiaomi team invites researchers worldwide to the CIKM AnalyticCup, a cross‑language short text matching competition aimed at transferring rich language understanding to low‑resource languages, highlighting past successes in precipitation forecasting and encouraging innovative AI solutions for multilingual chatbot services.

AIChatbotCross-language

0 likes · 7 min read

Bridge Language Gaps: Join Alibaba’s Cross‑Language Short Text Matching Challenge