Tag

string matching

0 views collected around this technical thread.

IT Services Circle
IT Services Circle
Dec 24, 2024 · Fundamentals

Understanding the KMP String Matching Algorithm with Implementation and Optimizations

This article explains the Knuth-Morris-Pratt (KMP) string‑matching algorithm, describes how the next array is built to capture prefix‑suffix information, shows Java implementations of the search and next‑array construction, and discusses common optimizations to reduce redundant comparisons.

KMPNext ArrayOptimization
0 likes · 9 min read
Understanding the KMP String Matching Algorithm with Implementation and Optimizations
Python Programming Learning Circle
Python Programming Learning Circle
Mar 22, 2024 · Fundamentals

Using FuzzyWuzzy for Fuzzy String Matching in Python

This article introduces the FuzzyWuzzy Python library, explains its underlying Levenshtein distance algorithm, demonstrates how to install it, describes the key functions in the fuzz and process modules, and provides practical examples for matching company names and province fields with complete code snippets.

FuzzyWuzzyLevenshteinPython
0 likes · 10 min read
Using FuzzyWuzzy for Fuzzy String Matching in Python
php中文网 Courses
php中文网 Courses
Feb 28, 2024 · Backend Development

Using PHP preg_match for Regular Expression Matching

This article introduces PHP's preg_match function, demonstrates simple and complex regular expression examples—including string and email matching—and explains common regex patterns, showing how to perform powerful and flexible text matching in backend development.

backendphppreg_match
0 likes · 4 min read
Using PHP preg_match for Regular Expression Matching
Python Programming Learning Circle
Python Programming Learning Circle
Feb 4, 2024 · Fundamentals

Using FuzzyWuzzy for Fuzzy String Matching in Python

This article introduces the FuzzyWuzzy Python library, explains its Levenshtein‑based matching functions (Ratio, Partial Ratio, Token Sort Ratio, Token Set Ratio) and the process module, and demonstrates practical applications for fuzzy matching of company and province names with complete code examples.

FuzzyWuzzyLevenshtein DistancePython
0 likes · 10 min read
Using FuzzyWuzzy for Fuzzy String Matching in Python
Python Programming Learning Circle
Python Programming Learning Circle
Jul 10, 2023 · Fundamentals

Using FuzzyWuzzy for Fuzzy String Matching in Python

This article introduces the Python FuzzyWuzzy library, explains its Levenshtein‑based fuzzy string matching functions such as Ratio, Partial Ratio, Token Sort Ratio and Token Set Ratio, demonstrates how to use the process module for extracting best matches, and provides practical code examples for matching company and province names.

FuzzyWuzzyLevenshteindata cleaning
0 likes · 11 min read
Using FuzzyWuzzy for Fuzzy String Matching in Python
Python Programming Learning Circle
Python Programming Learning Circle
Jun 7, 2023 · Fundamentals

Using FuzzyWuzzy for Fuzzy String Matching in Python

This article introduces the FuzzyWuzzy Python library, explains its Levenshtein‑based matching functions such as Ratio, Partial Ratio, Token Sort Ratio and Token Set Ratio, demonstrates how to install it, and provides practical code examples for merging company and province fields with fuzzy matching thresholds.

FuzzyWuzzyLevenshteindata cleaning
0 likes · 10 min read
Using FuzzyWuzzy for Fuzzy String Matching in Python
Python Programming Learning Circle
Python Programming Learning Circle
Apr 29, 2022 · Fundamentals

Using FuzzyWuzzy for Fuzzy String Matching in Python

This tutorial explains how to use the Python FuzzyWuzzy library, which relies on Levenshtein distance, to perform fuzzy string matching for tasks such as normalizing province or company names, and provides complete code examples and practical applications.

FuzzyWuzzyLevenshteinPython
0 likes · 10 min read
Using FuzzyWuzzy for Fuzzy String Matching in Python
JD Tech
JD Tech
Apr 1, 2022 · Fundamentals

Advanced Matching Algorithms and Graph Data Structures: KMP, Rabin‑Karp, Boyer‑Moore, Trie, Double‑Array Trie, and AC Automaton

This article introduces common graph concepts and several advanced string‑matching algorithms—including Brute‑Force, Rabin‑Karp, KMP, Boyer‑Moore, AC automaton, Trie, and Double‑Array Trie—explaining their principles, implementations, complexity analyses, and typical application scenarios for search systems.

AlgorithmsData StructuresKMP
0 likes · 20 min read
Advanced Matching Algorithms and Graph Data Structures: KMP, Rabin‑Karp, Boyer‑Moore, Trie, Double‑Array Trie, and AC Automaton
Python Programming Learning Circle
Python Programming Learning Circle
May 21, 2021 · Fundamentals

Using FuzzyWuzzy for Fuzzy String Matching in Python

This article introduces the FuzzyWuzzy Python library, explains its Levenshtein‑based matching functions, shows how to install it, and provides step‑by‑step code examples for fuzzy matching of company and province names using pandas dataframes.

FuzzyWuzzyLevenshteinPython
0 likes · 10 min read
Using FuzzyWuzzy for Fuzzy String Matching in Python
Sohu Tech Products
Sohu Tech Products
Apr 15, 2020 · Fundamentals

Optimized String Matching: Boyer‑Moore Algorithm with Bad‑Character and Good‑Suffix Rules

This article explains the Boyer‑Moore string‑matching algorithm, detailing how the bad‑character and good‑suffix heuristics dramatically reduce comparisons, and provides a complete Java implementation with step‑by‑step illustrations of each matching round and the underlying shift calculations.

Boyer-Moorealgorithmbad character rule
0 likes · 6 min read
Optimized String Matching: Boyer‑Moore Algorithm with Bad‑Character and Good‑Suffix Rules
Sohu Tech Products
Sohu Tech Products
Apr 8, 2020 · Fundamentals

Brute‑Force and Simple Hash‑Based Substring Search (Rabin‑Karp) Explained with Examples

This article explains the brute‑force (BF) substring search algorithm, demonstrates its step‑by‑step operation with examples, then introduces a simple hash‑based sliding‑window method (a basic Rabin‑Karp approach), provides Java code, and shows how to compute and update hashes efficiently to locate a pattern in a main string.

Brute ForceRabin-Karpalgorithm
0 likes · 7 min read
Brute‑Force and Simple Hash‑Based Substring Search (Rabin‑Karp) Explained with Examples
Sohu Tech Products
Sohu Tech Products
Mar 25, 2020 · Fundamentals

Understanding Brute‑Force, Boyer‑Moore, and KMP String‑Matching Algorithms with Java Code

This article explains the Brute‑Force (BF), Boyer‑Moore (BM), and Knuth‑Morris‑Pratt (KMP) string‑matching algorithms, illustrates their operation with step‑by‑step visual examples, discusses the inefficiencies of BF, the shift rules of BM, and the prefix‑suffix logic of KMP, and provides a complete Java implementation including the construction of the next array.

Boyer-MooreKMPNext Array
0 likes · 10 min read
Understanding Brute‑Force, Boyer‑Moore, and KMP String‑Matching Algorithms with Java Code
360 Quality & Efficiency
360 Quality & Efficiency
Feb 12, 2020 · Fundamentals

Common Regular Expressions and Their Syntax

This article introduces the fundamentals of regular expressions, explains their components, syntax rules such as quantifiers and anchors, and provides a collection of frequently used regex patterns for tasks like email validation, ID numbers, dates, IP addresses, and URLs.

Validationpattern syntaxregex
0 likes · 5 min read
Common Regular Expressions and Their Syntax
Python Programming Learning Circle
Python Programming Learning Circle
Feb 9, 2020 · Fundamentals

Understanding Python's re Module and Regular Expressions

This article introduces Python's re module, explaining regular expression fundamentals, key functions such as match, search, compile, sub, findall, finditer, and split, detailing their syntax, parameters, flags, and providing numerous code examples to illustrate pattern matching, searching, replacing, and splitting strings.

Pythonre moduleregex
0 likes · 17 min read
Understanding Python's re Module and Regular Expressions
Beike Product & Technology
Beike Product & Technology
Aug 23, 2019 · Fundamentals

KMP String Matching Algorithm: Theory, Implementation, and Comparison with Other Search Methods

This article explains the KMP string‑matching algorithm, how it builds the prefix (next) table to avoid the O(M·N) worst‑case of naive search, provides JavaScript implementations, compares it with Boyer‑Moore and V8's indexOf strategy, and includes detailed code examples.

JavaScriptKMPPrefix Table
0 likes · 10 min read
KMP String Matching Algorithm: Theory, Implementation, and Comparison with Other Search Methods
Java Captain
Java Captain
May 13, 2019 · Fundamentals

Implementing Sensitive Word Filtering with Trie Trees

This article explains how to use a trie (prefix tree) to efficiently filter sensitive words in a text, covering the basic concepts, construction steps, traversal algorithm, complexity analysis, and a Java implementation using HashMap.

Data StructureSensitive Word FilteringTrie
0 likes · 9 min read
Implementing Sensitive Word Filtering with Trie Trees