Tagged articles
12 articles
Page 1 of 1
JavaGuide
JavaGuide
Apr 7, 2026 · Information Security

Why Brute‑Force Won’t Cut It for Sensitive‑Word Filtering (And What Actually Works)

The article walks through the evolution of sensitive‑word filtering—from naïve brute‑force scanning to Trie, Aho‑Corasick automaton, Double‑Array Trie, and DFA implementations—detailing their algorithms, time/space complexities, concrete Java code examples, performance trade‑offs, high‑concurrency optimizations, and practical production advice for building a robust content‑moderation system.

Aho-CorasickDFADouble-Array Trie
0 likes · 26 min read
Why Brute‑Force Won’t Cut It for Sensitive‑Word Filtering (And What Actually Works)
dbaplus Community
dbaplus Community
Jan 2, 2026 · Information Security

How We Built a High‑Performance, Low‑Cost Content Moderation System with Trie + Aho‑Corasick

Faced with minutes‑long posting delays and exploding review costs in a fast‑growing social app, the team introduced 24‑hour shift staffing, a local blacklist stored in MySQL, an in‑memory Trie + Aho‑Corasick matcher, Redis‑driven hot updates and a machine‑audit fallback with a feedback loop, dramatically cutting latency, cost and false‑positives.

Aho-CorasickGoRedis
0 likes · 33 min read
How We Built a High‑Performance, Low‑Cost Content Moderation System with Trie + Aho‑Corasick
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 12, 2025 · Backend Development

Building a High‑Performance Content Moderation System with Trie, Aho‑Corasick, Redis, and Go

This article details how to design and implement a scalable, low‑cost content moderation pipeline that combines a local Trie + Aho‑Corasick engine, Redis‑based hot‑updates, MySQL persistence, and third‑party machine‑review fallback to achieve millisecond‑level response, high accuracy, and controllable costs.

Aho-CorasickBackendGo
0 likes · 34 min read
Building a High‑Performance Content Moderation System with Trie, Aho‑Corasick, Redis, and Go
Tech Freedom Circle
Tech Freedom Circle
May 28, 2025 · Backend Development

Designing a 100k QPS Sensitive‑Word Filter with Real‑Time Updates

This article analyzes high‑throughput sensitive‑word filtering by comparing brute‑force, KMP, Trie, double‑array Trie and Aho‑Corasick algorithms, presents their time and space complexities, shows Java implementations for Trie and AC automata, evaluates Netty deployment options, and offers practical optimizations such as asynchronous detection, hot‑reloading, tiered responses, logging and fuzzy matching.

Aho-CorasickAlgorithm OptimizationJava
0 likes · 37 min read
Designing a 100k QPS Sensitive‑Word Filter with Real‑Time Updates
JD Tech Talk
JD Tech Talk
Apr 8, 2025 · Fundamentals

Performance Comparison of String Replacement Algorithms in Java

The article analyzes various Java string‑replacement techniques—including simple String.replace, compiled regular expressions, Aho‑Corasick automaton, and custom Trie implementations—by presenting their designs, object sizes, and benchmark results to guide developers in choosing the most efficient solution for large keyword sets.

Aho-CorasickJavaTrie
0 likes · 13 min read
Performance Comparison of String Replacement Algorithms in Java
JD Cloud Developers
JD Cloud Developers
Apr 8, 2025 · Fundamentals

Which String Replacement Method Is Fastest? A Java Performance Comparison

This article examines various Java string‑replacement techniques—including simple replace, regex, Aho‑Corasick, and custom Trie implementations—by presenting their design, code samples, and detailed performance benchmarks to help developers choose the most efficient solution for large keyword sets.

Aho-CorasickJava performanceTrie
0 likes · 13 min read
Which String Replacement Method Is Fastest? A Java Performance Comparison
Sohu Tech Products
Sohu Tech Products
Nov 22, 2023 · Backend Development

Optimizing a Real‑Time Keyword Matching Service with Aho‑Corasick and Double‑Array Trie

By replacing the naïve double‑loop matcher with a Double‑Array Trie‑based Aho‑Corasick automaton and refactoring the system into a layered name‑and‑data microservice architecture that shards the keyword dictionary and rebuilds the automaton only on version changes, the real‑time keyword‑matching service reduced latency from seconds to milliseconds even at thousands of QPS.

Aho-CorasickJavaMicroservices
0 likes · 17 min read
Optimizing a Real‑Time Keyword Matching Service with Aho‑Corasick and Double‑Array Trie
Architecture Digest
Architecture Digest
Jul 8, 2022 · Fundamentals

Sensitive Word Matching in Vivo's Content Review System: Algorithm Selection and Practical Implementations

The article describes how Vivo's content moderation platform, DiTing, uses algorithm selection—including Aho‑Corasick automaton, combination word matching, and pinyin‑based matching—to efficiently detect sensitive terms in large‑scale text streams, while addressing challenges such as homophones, multi‑character patterns, and performance constraints.

Aho-CorasickAlgorithm OptimizationPinyin Matching
0 likes · 14 min read
Sensitive Word Matching in Vivo's Content Review System: Algorithm Selection and Practical Implementations
MaGe Linux Operations
MaGe Linux Operations
Jan 13, 2018 · Artificial Intelligence

How FlashText Cuts Keyword Search from Days to Minutes

FlashText is an open‑source Python library that dramatically speeds up keyword search and replacement in large text corpora, turning multi‑day regex operations into a fifteen‑minute task by leveraging the Aho‑Corasick algorithm and a Trie‑based dictionary.

Aho-CorasickFlashTextPython
0 likes · 8 min read
How FlashText Cuts Keyword Search from Days to Minutes