Tagged articles
98 articles
Page 1 of 1
Data Party THU
Data Party THU
May 17, 2026 · Artificial Intelligence

Personalizing AI Agents: Memory, Rolling Context, and Advanced Retrieval Techniques

The article explains how AI agents use memory to retain conversation context, why sending the full history to large language models is inefficient, and presents rolling context windows, inverted‑index pruning, semantic embedding retrieval, and GraphRAG as complementary strategies to build more accurate and personalized agents.

AI memoryGraphRAGLLM optimization
0 likes · 10 min read
Personalizing AI Agents: Memory, Rolling Context, and Advanced Retrieval Techniques
DeepHub IMBA
DeepHub IMBA
May 1, 2026 · Artificial Intelligence

How to Build Intelligent Contextual Memory for AI Agents

The article examines why naïvely feeding all dialogue history to large language models is costly and unreliable, and it walks through rolling context windows, inverted‑index pruning, semantic vector search, and GraphRAG as complementary techniques for creating efficient, reasoning‑capable AI agent memory.

AIAgent MemoryContext Window
0 likes · 11 min read
How to Build Intelligent Contextual Memory for AI Agents
macrozheng
macrozheng
Jan 15, 2026 · Databases

Master MySQL Full-Text Search: Inverted Index, Queries, and Best Practices

This guide explains why InnoDB fuzzy queries lose indexes, introduces MySQL full‑text search with inverted indexes, shows how to create and use full‑text indexes via CREATE TABLE and ALTER statements, and demonstrates natural language, boolean, and query‑expansion modes with practical SQL examples.

Boolean ModeFull‑Text Searchdatabase
0 likes · 12 min read
Master MySQL Full-Text Search: Inverted Index, Queries, and Best Practices
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Dec 21, 2025 · Backend Development

How Elasticsearch Scales to Billions of Queries: Sharding, Inverted Index, Distributed Execution, and Replication

Elasticsearch achieves billion‑scale search performance by combining horizontal sharding, immutable inverted‑index segments, a two‑stage distributed Query/FETCH model, and multiple replicas with a coordinator node to ensure high concurrency, scalability, and availability.

Distributed QueryElasticsearchReplication
0 likes · 4 min read
How Elasticsearch Scales to Billions of Queries: Sharding, Inverted Index, Distributed Execution, and Replication
Java Tech Enthusiast
Java Tech Enthusiast
Dec 1, 2025 · Databases

Master MySQL Full‑Text Search: Inverted Index, Query Modes, and Practical Examples

This guide explains how InnoDB implements full‑text search with inverted indexes, shows how to create and drop full‑text indexes, demonstrates MATCH() AGAINST() syntax across natural language, boolean, and query‑expansion modes, and covers relevance scoring, stopwords, token size limits, and real‑world query examples.

Full‑Text SearchInnoDBinverted index
0 likes · 14 min read
Master MySQL Full‑Text Search: Inverted Index, Query Modes, and Practical Examples
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jul 10, 2025 · Big Data

Master ElasticSearch: Core Concepts, Architecture, and Search Process Explained

This article provides a comprehensive overview of ElasticSearch, covering its role as a distributed full‑text search engine built on Lucene, key concepts such as index, type, document, field, shard and replica, the analysis pipeline, inverted index mechanics, and the two‑phase query‑fetch search workflow.

AnalysisElasticsearchFull‑Text Search
0 likes · 7 min read
Master ElasticSearch: Core Concepts, Architecture, and Search Process Explained
Architect's Tech Stack
Architect's Tech Stack
Jul 8, 2025 · Databases

Master MySQL Full-Text Search: Indexes, Queries, and Advanced Techniques

This article explains MySQL's InnoDB full‑text search, covering the theory of inverted indexes, how to create and use full‑text indexes with various MATCH…AGAINST modes, Boolean operators, query expansion, relevance calculation, and index removal, illustrated with practical SQL examples and diagrams.

Full‑Text Searchdatabaseinverted index
0 likes · 12 min read
Master MySQL Full-Text Search: Indexes, Queries, and Advanced Techniques
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jul 1, 2025 · Big Data

Master ElasticSearch: Core Concepts, Architecture, and Search Workflow Explained

This article provides a comprehensive overview of ElasticSearch, covering its definition, core components such as indexes, shards and replicas, the analysis pipeline, inverted index mechanics, and the two‑stage search process that enables scalable, fault‑tolerant full‑text search in big‑data environments.

AnalyzersBig DataDistributed Search
0 likes · 7 min read
Master ElasticSearch: Core Concepts, Architecture, and Search Workflow Explained
Selected Java Interview Questions
Selected Java Interview Questions
Aug 27, 2024 · Databases

MySQL Full‑Text Search: Inverted Index, Query Modes, and Usage

This article explains how MySQL InnoDB implements full‑text search using inverted indexes, shows how to create and drop full‑text indexes, and demonstrates the three query modes—Natural Language, Boolean, and Query Expansion—along with their syntax, operators, relevance calculation, and practical examples.

Boolean Modedatabaseinverted index
0 likes · 11 min read
MySQL Full‑Text Search: Inverted Index, Query Modes, and Usage
IT Services Circle
IT Services Circle
Jun 24, 2024 · Databases

Understanding Elasticsearch Architecture: Inverted Index, Term Dictionary, Segments, and Distributed Search

This article explains how Elasticsearch transforms simple keyword matching into a high‑performance, scalable search engine by using inverted indexes, term dictionaries, posting lists, term indexes, stored fields, doc values, segments, and distributed node architectures to achieve fast, reliable full‑text search on massive data sets.

ElasticsearchSegmentTerm Dictionary
0 likes · 16 min read
Understanding Elasticsearch Architecture: Inverted Index, Term Dictionary, Segments, and Distributed Search
Top Architect
Top Architect
Apr 18, 2024 · Big Data

Understanding ElasticSearch Architecture and Its Underlying Lucene Mechanics

This article provides a comprehensive, top‑down and bottom‑up explanation of ElasticSearch’s core architecture, detailing nodes, shards, Lucene segments, inverted indexes, stored fields, document values, caching, query processing, routing, and scaling considerations for efficient search operations.

inverted indexlucenesearch engine
0 likes · 10 min read
Understanding ElasticSearch Architecture and Its Underlying Lucene Mechanics
Code Ape Tech Column
Code Ape Tech Column
Apr 12, 2024 · Databases

Understanding Full-Text Search and Inverted Indexes in MySQL InnoDB

This article explains MySQL InnoDB full-text search, covering inverted index structures, creation and usage of full-text indexes, query modes such as natural language, boolean, and query expansion, and provides practical SQL examples and considerations for effective text searching.

Full‑Text SearchSearchdatabase
0 likes · 11 min read
Understanding Full-Text Search and Inverted Indexes in MySQL InnoDB
Liangxu Linux
Liangxu Linux
Feb 28, 2024 · Databases

Master MySQL InnoDB Full-Text Search: Indexes, Queries, and Advanced Techniques

This article explains how MySQL InnoDB implements full‑text search, covering the underlying inverted index structures, how to create and drop full‑text indexes, the MATCH…AGAINST syntax, and detailed examples of natural language, boolean, and query‑expansion search modes with practical SQL demos.

Boolean ModeFull‑Text Searchinverted index
0 likes · 12 min read
Master MySQL InnoDB Full-Text Search: Indexes, Queries, and Advanced Techniques
Laravel Tech Community
Laravel Tech Community
Feb 4, 2024 · Databases

Understanding Full-Text Search and Inverted Indexes in MySQL InnoDB

This article explains how MySQL InnoDB implements full‑text search using inverted indexes, covers the creation and usage of FULLTEXT indexes, demonstrates various MATCH…AGAINST query modes such as natural language, boolean, and query expansion, and shows how to manage and delete full‑text indexes with practical SQL examples.

Boolean Modeinverted indexmysql
0 likes · 10 min read
Understanding Full-Text Search and Inverted Indexes in MySQL InnoDB
JD Tech
JD Tech
Jan 22, 2024 · Big Data

Efficient High‑Concurrency Data Retrieval Using Inverted Index and Bitmap Techniques

This article explores how to achieve fast, scalable data retrieval in million‑level high‑concurrency scenarios by replacing naïve full‑combination rule matching with column‑wise inverted indexes and bitmap operations, dramatically reducing time complexity and improving stability while leveraging RoaringBitmap compression for space efficiency.

BitmapRoaringBitmaphigh concurrency
0 likes · 12 min read
Efficient High‑Concurrency Data Retrieval Using Inverted Index and Bitmap Techniques
ITPUB
ITPUB
Dec 28, 2023 · Databases

How ByConity 0.3.0 Boosts Text Search and Cold‑Read Performance with Inverted Indexes

ByConity 0.3.0 introduces a new inverted index supporting Chinese tokenization, a shared‑storage leader election mechanism, enhanced cold‑read performance via Prefetch and adaptive mark allocation, and an upgraded ELT pipeline with BSP mode, all detailed with implementation steps, code examples, and future roadmap.

ByConityCold ReadELT
0 likes · 13 min read
How ByConity 0.3.0 Boosts Text Search and Cold‑Read Performance with Inverted Indexes
政采云技术
政采云技术
Dec 19, 2023 · Backend Development

Principles and Simple Implementation of a Search Engine in Go

This article explains the fundamental concepts of search engine technology—including forward and inverted indexes, tokenizers, stop words, synonym handling, ranking algorithms, and NLP integration—and provides a concise Go implementation with code examples and performance testing.

GoNLPTokenizer
0 likes · 21 min read
Principles and Simple Implementation of a Search Engine in Go
Python Programming Learning Circle
Python Programming Learning Circle
Nov 17, 2023 · Big Data

Building a Simple Search Engine with Bloom Filter, Tokenization, and Inverted Index in Python

This article demonstrates how to implement a basic big‑data search engine in Python by creating a Bloom filter for fast existence checks, designing tokenization functions for major and minor segmentation, building an inverted index, and supporting AND/OR queries with example code and execution results.

Big DataSearchbloom-filter
0 likes · 12 min read
Building a Simple Search Engine with Bloom Filter, Tokenization, and Inverted Index in Python
Ximalaya Technology Team
Ximalaya Technology Team
Sep 6, 2023 · Backend Development

Design Analysis of Lucene and In-Memory Inverted Index Service for Advertising Retrieval

The team analyzed Lucene’s disk‑based inverted index and built a custom in‑memory inverted‑index service for Himalaya’s ad engine, encoding terms as 64‑bit keys, supporting real‑time updates and BooleanQuery‑style and custom expression filtering, which cut query latency from ~50 ms to under 5 ms and enabled massive scaling.

Data StructuresMemory Searchinverted index
0 likes · 27 min read
Design Analysis of Lucene and In-Memory Inverted Index Service for Advertising Retrieval
Senior Tony
Senior Tony
Aug 20, 2023 · Fundamentals

Why Elasticsearch Is Called Near‑Real‑Time and How It Works Under the Hood

This article explains Elasticsearch’s near‑real‑time nature, its core mechanisms such as inverted indexes and tokenizers, common interview scenarios, search types, refresh strategies, and the difference between query and filter, helping readers understand when and why to choose ES for full‑text and complex queries.

Query vs FilterSearch Typesinverted index
0 likes · 15 min read
Why Elasticsearch Is Called Near‑Real‑Time and How It Works Under the Hood
DataFunTalk
DataFunTalk
May 9, 2023 · Databases

High‑Performance Inverted Index in Apache Doris for Log Data Storage and Analysis

This article explains how Apache Doris implements a high‑performance, column‑oriented inverted index to address the challenges of massive, real‑time log data storage and analysis, delivering dramatically higher write throughput, lower storage costs, and faster query performance than traditional Elasticsearch and Loki solutions.

Apache DorisBig DataLog Analytics
0 likes · 19 min read
High‑Performance Inverted Index in Apache Doris for Log Data Storage and Analysis
21CTO
21CTO
Feb 11, 2023 · Big Data

Mastering Elasticsearch: Core Concepts, Architecture, and Performance Tips

This comprehensive guide explains what Elasticsearch does, its underlying Lucene technology, core concepts such as clusters, shards, replicas, mapping, indexing and storage mechanisms, and provides practical performance‑tuning advice for building and operating a robust distributed search engine.

ClusterElasticsearchMapping
0 likes · 35 min read
Mastering Elasticsearch: Core Concepts, Architecture, and Performance Tips
Architect
Architect
Jan 26, 2023 · Backend Development

Optimizing Elasticsearch for High‑Concurrency LBS Search with an RLE‑Based Inverted Index

This article details Meituan's search‑engine optimization for its food‑delivery platform, describing the performance bottlenecks of Elasticsearch's inverted‑list query and merge phases, the design of a run‑length‑encoding (RLE) index, custom hash‑map term look‑ups, sparse RoaringBitmap structures, integration steps, and the resulting 84% latency reduction.

ElasticsearchRLEinverted index
0 likes · 27 min read
Optimizing Elasticsearch for High‑Concurrency LBS Search with an RLE‑Based Inverted Index
Top Architect
Top Architect
Dec 24, 2022 · Databases

Elasticsearch Architecture: Inverted Index, Sharding, and Data Operations

This article explains the core concepts of Elasticsearch, including how its inverted index works, the structure of term dictionaries and posting lists, shard and replica configuration, cluster node roles, the detailed write, refresh, flush, and merge processes, as well as how search queries are executed across distributed shards.

distributed architectureinverted indexsearch engine
0 likes · 9 min read
Elasticsearch Architecture: Inverted Index, Sharding, and Data Operations
Baidu Geek Talk
Baidu Geek Talk
Dec 14, 2022 · Databases

How Baidu’s HybridIndexTable Redefined Inverted Index Performance

This article examines Baidu's Limitless ad‑funnel optimization, detailing how a cache‑friendly HybridIndexTable and adaptive memory containers dramatically improve inverted‑list scan speed, reduce memory overhead, and enable lock‑free updates for high‑throughput retrieval systems.

HybridIndexTableLearned IndexRC container
0 likes · 17 min read
How Baidu’s HybridIndexTable Redefined Inverted Index Performance
dbaplus Community
dbaplus Community
Dec 6, 2022 · Backend Development

How Meituan Cut Elasticsearch Search Latency by 84% with an RLE‑Based Inverted Index

This article details Meituan's search‑engine team optimization of Elasticsearch for a high‑traffic LBS scenario, describing the performance bottlenecks in term‑posting retrieval and merging, the design of a run‑length‑encoding (RLE) inverted index, its integration as a plugin, extensive benchmarking, and the resulting 84% reduction in TP99 query latency.

Backend SearchElasticsearchRoaringBitmap
0 likes · 25 min read
How Meituan Cut Elasticsearch Search Latency by 84% with an RLE‑Based Inverted Index
Meituan Technology Team
Meituan Technology Team
Nov 17, 2022 · Backend Development

Elasticsearch Query and Merge Optimization Using Run-Length Encoding for Meituan Takeaway Search

Meituan's food‑delivery search team identified heavy CPU and latency hotspots in Elasticsearch's posting‑list query and merge phases, then redesigned the inverted index using Run‑Length Encoding, hash‑based term lookup, index sorting and a custom SparseRoaringDocIdSet, ultimately reducing TP99 search latency by 84% and cutting CPU usage dramatically.

ElasticsearchIndex SortingRoaringBitmap
0 likes · 26 min read
Elasticsearch Query and Merge Optimization Using Run-Length Encoding for Meituan Takeaway Search
Selected Java Interview Questions
Selected Java Interview Questions
Aug 24, 2022 · Backend Development

Understanding ElasticSearch: Distributed Search, Full‑Text Retrieval, and Inverted Index

This article explains what search is, why traditional databases struggle with full‑text queries, introduces the concepts of inverted indexes and Lucene, and shows how ElasticSearch combines distributed architecture, real‑time analytics, and powerful search features to solve these problems.

Distributed SystemsFull‑Text Searchinverted index
0 likes · 8 min read
Understanding ElasticSearch: Distributed Search, Full‑Text Retrieval, and Inverted Index
Open Source Linux
Open Source Linux
Aug 21, 2022 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch leverages inverted indexes, term dictionaries, and compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to enable rapid full‑text search, detailing the underlying data structures, query processing, and practical indexing tips for efficient backend search implementations.

ElasticsearchPostings Listbackend-development
0 likes · 17 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes
Efficient Ops
Efficient Ops
Aug 16, 2022 · Backend Development

How Elasticsearch Delivers Lightning‑Fast Search with Inverted Indexes and Compression

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and advanced compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to achieve rapid search performance while minimizing memory and disk usage, and it also covers practical indexing tips for production use.

ElasticsearchPostings Listbackend-development
0 likes · 15 min read
How Elasticsearch Delivers Lightning‑Fast Search with Inverted Indexes and Compression
Top Architect
Top Architect
Mar 17, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This comprehensive guide explains Elasticsearch fundamentals, including data types, Lucene and inverted indexes, cluster and node roles, shard and replica mechanisms, mapping, installation steps, health monitoring, write and storage processes, segment merging, and practical performance tuning tips for large‑scale search deployments.

ClusterElasticsearchinverted index
0 likes · 35 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
Architect's Tech Stack
Architect's Tech Stack
Mar 3, 2022 · Databases

Understanding MySQL InnoDB Full-Text Search and Inverted Index

This article explains why traditional B‑Tree indexes fail for keyword searches, introduces the concept of inverted indexes, shows how to create and use MySQL InnoDB full‑text indexes with MATCH‑AGAINST in various modes, and covers index maintenance and query‑expansion techniques.

Boolean ModeFull‑Text Searchdatabase
0 likes · 12 min read
Understanding MySQL InnoDB Full-Text Search and Inverted Index
Architect
Architect
Feb 6, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, inverted indexes, cluster components, node roles, shard and replica mechanisms, mapping, installation, health monitoring, write path, storage strategies, segment management, refresh and translog processes, as well as practical performance and JVM tuning tips.

Cluster ManagementDistributed SearchElasticsearch
0 likes · 37 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
Top Architect
Top Architect
Feb 6, 2022 · Databases

Understanding MySQL InnoDB Full‑Text Search and Inverted Index

This article explains why InnoDB full‑text search is needed for keyword‑based queries, introduces inverted index structures, shows how to create, use, and delete full‑text indexes in MySQL, and demonstrates natural language, boolean, and query‑expansion search modes with practical SQL examples.

Full‑Text Searchdatabaseinverted index
0 likes · 15 min read
Understanding MySQL InnoDB Full‑Text Search and Inverted Index
Java Backend Technology
Java Backend Technology
Jan 18, 2022 · Databases

Master MySQL Full-Text Search: Inverted Indexes, Query Modes, and Optimization

This article explains how InnoDB implements full-text search using inverted indexes, shows how to create and use full-text indexes with various query modes—including natural language, boolean, and query expansion—covers relevance calculation, stopwords, token size limits, and demonstrates how to delete indexes, all illustrated with SQL examples and diagrams.

Boolean ModeFull‑Text Searchinverted index
0 likes · 13 min read
Master MySQL Full-Text Search: Inverted Indexes, Query Modes, and Optimization
Open Source Linux
Open Source Linux
Dec 29, 2021 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and compression techniques like FOR and Roaring Bitmaps to enable rapid full‑text search, contrasting its approach with traditional relational databases and offering practical indexing tips for large‑scale applications.

ElasticsearchPostings Listcompression
0 likes · 15 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes
政采云技术
政采云技术
Dec 9, 2021 · Fundamentals

Understanding Elasticsearch Inverted Index and Its Optimization Techniques

This article explains how Elasticsearch leverages Lucene's inverted index, term dictionaries, and posting list compression methods such as FST and Frame‑of‑Reference to achieve fast, accurate, and tolerant search performance, while also discussing practical implementation details and examples.

ElasticsearchFSTPosting List Compression
0 likes · 11 min read
Understanding Elasticsearch Inverted Index and Its Optimization Techniques
Open Source Linux
Open Source Linux
Dec 8, 2021 · Backend Development

How Elasticsearch Uses Lucene’s Inverted Index for Lightning‑Fast Search

This article explains how Elasticsearch leverages Lucene’s inverted index, detailing the structure of term dictionaries, postings lists, compression techniques like Frame‑of‑Reference and Roaring Bitmaps, and query optimizations such as filter caches and skip‑list intersections to achieve fast, memory‑efficient search.

Elasticsearchcompressioninverted index
0 likes · 19 min read
How Elasticsearch Uses Lucene’s Inverted Index for Lightning‑Fast Search
Efficient Ops
Efficient Ops
Dec 2, 2021 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to deliver rapid full‑text search, efficient storage, and fast union queries, while also offering practical indexing tips for production use.

Postings ListRoaring Bitmapcompression
0 likes · 15 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes
Java Interview Crash Guide
Java Interview Crash Guide
Nov 11, 2021 · Big Data

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes and Compression

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and advanced compression techniques like Frame of Reference and Roaring Bitmaps to enable rapid, scalable search over massive datasets, contrasting its approach with traditional relational database queries and detailing practical optimization tips.

ElasticsearchPostings Listcompression
0 likes · 16 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes and Compression
Programmer DD
Programmer DD
Nov 4, 2021 · Backend Development

How Elasticsearch Achieves Near Real-Time Search: Core Mechanisms Explained

This article explains how Elasticsearch implements near real-time search by using immutable inverted indexes, segment merging, shard distribution, and a translog for durability, detailing the challenges of persistence, disk I/O, and data recovery in a distributed environment.

Data PersistenceElasticsearchNear Real-Time Search
0 likes · 9 min read
How Elasticsearch Achieves Near Real-Time Search: Core Mechanisms Explained
21CTO
21CTO
Nov 3, 2021 · Databases

Master MySQL Full‑Text Search: Index Creation, Modes, and Internals

This tutorial explains how MySQL implements full‑text search, covering the creation of full‑text indexes (including Chinese ngram support), the three query modes (natural language, boolean, and query expansion), relevance ranking, underlying inverted‑index structures, cache handling, and common DML operations.

Boolean ModeFull‑Text Searchdatabase
0 likes · 14 min read
Master MySQL Full‑Text Search: Index Creation, Modes, and Internals
21CTO
21CTO
Oct 26, 2021 · Databases

How ElasticSearch Delivers Near Real-Time Search with Immutable Indexes

ElasticSearch achieves near real-time search by building immutable inverted indexes (segments), using incremental indexing, logical deletions, background segment merging, and a write-ahead translog to ensure durability, while distributing shards across nodes to balance load and maintain data consistency.

Near Real-Time SearchSegment Merginginverted index
0 likes · 8 min read
How ElasticSearch Delivers Near Real-Time Search with Immutable Indexes
21CTO
21CTO
Oct 9, 2021 · Backend Development

ElasticSearch Near Real-Time Search: Immutable Indexes, Segments, and Translog

This article explores how ElasticSearch delivers near real‑time search by leveraging immutable inverted indexes, segment merging, shard distribution, and a write‑ahead translog, detailing the challenges of persistence, disk I/O, and data loss prevention in a distributed environment.

Distributed SystemsNear Real-Time SearchSegment Merging
0 likes · 9 min read
ElasticSearch Near Real-Time Search: Immutable Indexes, Segments, and Translog
IT Architects Alliance
IT Architects Alliance
Oct 6, 2021 · Big Data

Understanding Elasticsearch Inverted Index and Efficient Search Retrieval

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and postings lists—along with compression techniques like Frame of Reference and Roaring Bitmaps—to achieve fast, memory‑efficient search queries, and provides practical tips for optimizing indexing and query performance.

ElasticsearchPostings Listcompression
0 likes · 14 min read
Understanding Elasticsearch Inverted Index and Efficient Search Retrieval
IT Architects Alliance
IT Architects Alliance
Sep 29, 2021 · Databases

Understanding Elasticsearch Inverted Index: Fast Retrieval, Compression, and Query Techniques

This article explains how Elasticsearch uses inverted index structures—including term dictionaries, term indexes, and postings lists—combined with compression methods like Frame‑of‑Reference and Roaring Bitmaps to achieve fast search, efficient storage, and effective union queries compared to traditional relational databases.

ElasticsearchPostings ListRoaring Bitmap
0 likes · 14 min read
Understanding Elasticsearch Inverted Index: Fast Retrieval, Compression, and Query Techniques
IT Architects Alliance
IT Architects Alliance
Sep 5, 2021 · Databases

Understanding Elasticsearch Fast Retrieval: Inverted Index, Postings List, and Compression Techniques

This article explains how Elasticsearch achieves rapid search by using inverted indexes, detailing the structure of posting lists, term dictionaries, compression methods like Frame‑of‑Reference and Roaring Bitmaps, and how these techniques enable efficient union queries and filter caching.

ElasticsearchPostings ListRoaring Bitmap
0 likes · 14 min read
Understanding Elasticsearch Fast Retrieval: Inverted Index, Postings List, and Compression Techniques
Architect
Architect
Sep 4, 2021 · Databases

Understanding Elasticsearch Fast Retrieval: Inverted Index, Term Dictionary, and Compression Techniques

This article explains how Elasticsearch achieves fast data retrieval by comparing it with traditional relational databases, detailing search engine fundamentals, the structure of Lucene's inverted index—including term dictionaries, postings lists, and term indexes—and the compression techniques such as Frame of Reference and Roaring Bitmaps that optimize storage and query performance.

ElasticsearchPostings ListRoaring Bitmap
0 likes · 14 min read
Understanding Elasticsearch Fast Retrieval: Inverted Index, Term Dictionary, and Compression Techniques
Java Interview Crash Guide
Java Interview Crash Guide
Jul 2, 2021 · Databases

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch leverages inverted indexes, term dictionaries, and advanced compression techniques like Frame of Reference and Roaring Bitmaps to enable rapid full‑text search, covering the underlying concepts, data structures, and query optimizations essential for high‑performance search applications.

ElasticsearchPostings Listcompression
0 likes · 17 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes
Selected Java Interview Questions
Selected Java Interview Questions
Jul 1, 2021 · Fundamentals

Understanding Elasticsearch Inverted Index: Posting Lists, Term Dictionary, and Compression Techniques

This article explains how Elasticsearch achieves fast search by using inverted indexes, detailing the structure of posting lists, term dictionaries, term indexes, and compression methods such as Frame of Reference and Roaring Bitmaps, as well as techniques for efficient union and intersection queries.

ElasticsearchPostings ListTerm Dictionary
0 likes · 16 min read
Understanding Elasticsearch Inverted Index: Posting Lists, Term Dictionary, and Compression Techniques
Efficient Ops
Efficient Ops
Jun 23, 2021 · Backend Development

Why Can’t Elasticsearch Find My Logs? Uncovering Full‑Text Search Pitfalls and Tokenizer Tweaks

This article explains why large‑scale Elasticsearch clusters may miss log entries during keyword searches, dives into the fundamentals of inverted indexes and tokenization, and demonstrates practical index‑time and query‑time tokenizer optimizations—including custom analyzers for English and Chinese—to dramatically improve search recall and precision.

ElasticsearchFull‑Text SearchTokenizer
0 likes · 13 min read
Why Can’t Elasticsearch Find My Logs? Uncovering Full‑Text Search Pitfalls and Tokenizer Tweaks
58 Tech
58 Tech
Mar 8, 2021 · Fundamentals

Real‑Time Inverted Index Update Techniques in the 58 Search Engine

This article explains how the 58 search engine achieves millisecond‑level real‑time inverted‑index updates by redesigning the underlying data structures, combining static and dynamic indexing, using a chain‑array hybrid, segment merging strategies, and lock‑free read‑write concurrency while maintaining search performance.

Data StructuresReal-Time UpdateSegment Merging
0 likes · 18 min read
Real‑Time Inverted Index Update Techniques in the 58 Search Engine
Architecture Digest
Architecture Digest
Feb 18, 2021 · Big Data

Elasticsearch Write, Read, and Search Processes: Underlying Mechanisms and Lucene Inverted Index

This article explains how Elasticsearch handles data ingestion, retrieval, and full‑text search by describing the roles of coordinating, primary, and replica nodes, the refresh‑commit‑flush cycle, segment files, translog, and the Lucene‑based inverted index that powers its near‑real‑time capabilities.

ElasticsearchRead ProcessSearch
0 likes · 11 min read
Elasticsearch Write, Read, and Search Processes: Underlying Mechanisms and Lucene Inverted Index
Qunar Tech Salon
Qunar Tech Salon
Feb 4, 2021 · Fundamentals

Understanding Lucene Inverted Index: Principles and Implementation

This article explains the concept of inverted indexes, their role in full‑text search, and provides a detailed overview of how Apache Lucene implements inverted indexing, including term dictionaries, posting lists, query processing, and numeric handling with BKDTree.

BKDTreePosting ListTerm Dictionary
0 likes · 15 min read
Understanding Lucene Inverted Index: Principles and Implementation
Programmer DD
Programmer DD
Jan 28, 2021 · Databases

How Elasticsearch Writes, Reads, and Searches Data: Inside the Engine

This article explains Elasticsearch's internal mechanisms for writing, reading, and searching data, covering the roles of coordinating nodes, primary and replica shards, buffers, translog, segment files, refresh cycles, commit and flush operations, as well as Lucene's inverted index and how deletions and updates are handled.

ElasticsearchSegmentinverted index
0 likes · 10 min read
How Elasticsearch Writes, Reads, and Searches Data: Inside the Engine
Big Data Technology & Architecture
Big Data Technology & Architecture
Dec 24, 2020 · Big Data

Common Techniques for Processing Massive Data Sets

This article summarizes a range of practical methods—including Bloom filters, hashing, bit‑maps, heaps, bucket partitioning, database indexes, inverted indexes, external sorting, trie trees, and MapReduce—that are commonly used to handle, deduplicate, and query extremely large data volumes in big‑data applications.

Big DataHashingHeap
0 likes · 11 min read
Common Techniques for Processing Massive Data Sets
DeWu Technology
DeWu Technology
Dec 4, 2020 · Fundamentals

Introduction to Search Engine Technology and Information Retrieval

The article surveys core search‑engine technology—document hierarchy, flat and vertical inverted indexes, query operators for building and merging score lists, and ranking models from Boolean and BM25 to language‑model approaches like Indri—providing a foundational overview of information retrieval.

BM25information retrievalinverted index
0 likes · 14 min read
Introduction to Search Engine Technology and Information Retrieval
vivo Internet Technology
vivo Internet Technology
Nov 25, 2020 · Databases

Understanding Elasticsearch Architecture, Indexing, and Storage Mechanisms

Elasticsearch combines Lucene’s inverted index with a distributed cluster of master‑eligible, data, and coordinating nodes, using Zen discovery for node election and split‑brain prevention, while indexing writes to primary shards, replicating to replicas, storing immutable segments that are periodically merged for efficient search.

ClusterElasticsearchdistributed architecture
0 likes · 19 min read
Understanding Elasticsearch Architecture, Indexing, and Storage Mechanisms
Architecture Digest
Architecture Digest
Oct 1, 2020 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, the role of Lucene, cluster architecture, node roles, discovery mechanisms, shard and replica management, mapping, installation, health monitoring, indexing workflow, storage internals, refresh and translog processes, segment merging, and practical performance and JVM tuning tips.

ElasticsearchShardinverted index
0 likes · 35 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
Swan Home Tech Team
Swan Home Tech Team
Jul 13, 2020 · Backend Development

Design and Evolution of the DaJia App Search System

This article explains the motivations, requirements, and technical design of the DaJia app's search system, compares relational databases with Lucene‑based solutions, describes the inverted index mechanism, outlines common search workflows, and details the system's three iterative development phases and future improvement plans.

BackendElasticsearchSearch
0 likes · 12 min read
Design and Evolution of the DaJia App Search System
Programmer DD
Programmer DD
Jul 10, 2020 · Fundamentals

How Search Engines Work: Inside Document and Query Processing

This article explains the core components of a search engine—document processing, query processing, and matching—detailing each step from indexing to ranking, and discusses the document features that influence relevance, providing a comprehensive overview of information retrieval fundamentals.

Document ProcessingQuery Processinginformation retrieval
0 likes · 20 min read
How Search Engines Work: Inside Document and Query Processing
Tencent Cloud Developer
Tencent Cloud Developer
Nov 26, 2019 · Backend Development

TurboSearch: Tencent AI Lab's Next-Generation Large-Scale Search System

TurboSearch is Tencent AI Lab's next-generation large-scale search system, delivering distributed massive indexing, high-performance parallel retrieval, multi-granularity and multi-modal vector indexing, private Docker deployment, integrated NLP query analysis, extensible plugins, and robust operations for massive data and diverse search scenarios.

NLPTencent AI LabTurboSearch
0 likes · 14 min read
TurboSearch: Tencent AI Lab's Next-Generation Large-Scale Search System
Big Data Technology Architecture
Big Data Technology Architecture
Aug 9, 2019 · Databases

Understanding Elasticsearch: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive overview of Elasticsearch, covering its role in handling structured and unstructured data, core concepts such as Lucene, inverted indexes, clusters, shards, replicas, mapping, indexing processes, storage mechanisms, and practical performance tuning tips for deployment.

ElasticsearchReplicationinverted index
0 likes · 35 min read
Understanding Elasticsearch: Architecture, Core Concepts, and Performance Optimization
Efficient Ops
Efficient Ops
Apr 21, 2019 · Backend Development

Mastering Elasticsearch: From Inverted Index to Distributed Search

This article walks through the fundamentals of search engines, explaining inverted indexes, the explosion of index size, core Elasticsearch concepts, its distributed architecture, and how it powers the ELK stack for log analysis, all illustrated with clear diagrams and examples.

BackendDistributed SystemsELK
0 likes · 6 min read
Mastering Elasticsearch: From Inverted Index to Distributed Search
58 Tech
58 Tech
Mar 7, 2019 · Big Data

In-Memory Inverted Index Compression Algorithms: Overview and MILC Optimization for High‑Performance Search

This article reviews major in‑memory inverted index compression techniques such as PForDelta, PEF, and MILC, explains their principles and trade‑offs, and details practical optimizations applied at 58.com to achieve query performance comparable to uncompressed indexes while reducing memory usage by about 35 percent.

Big DataMILCalgorithm
0 likes · 17 min read
In-Memory Inverted Index Compression Algorithms: Overview and MILC Optimization for High‑Performance Search
MaGe Linux Operations
MaGe Linux Operations
Jan 23, 2019 · Big Data

How Bloom Filters Power Fast Big Data Searches with Python

This tutorial walks through building a simple Python search engine for big data, covering Bloom filter basics, tokenization with major and minor segmentation, inverted index creation, and implementing both simple and complex (AND/OR) queries, complete with code examples and visual illustrations.

AND/OR queriesBig DataPython
0 likes · 15 min read
How Bloom Filters Power Fast Big Data Searches with Python
Dada Group Technology
Dada Group Technology
Dec 27, 2018 · Backend Development

Evolution and Architecture of JD Daojia Location Service

This article explains how JD Daojia’s location system has evolved from simple distance calculations to a high‑performance, index‑driven backend service, detailing its core responsibilities, architectural stages, challenges, and the optimizations that improved accuracy, scalability, and resource usage.

GISdistance-calculationinverted index
0 likes · 10 min read
Evolution and Architecture of JD Daojia Location Service
MaGe Linux Operations
MaGe Linux Operations
Nov 27, 2018 · Big Data

How a Simple Python Bloom Filter Powers Fast Big Data Search

This article demonstrates how to implement a basic Bloom filter, tokenization, and inverted index in Python to illustrate the core principles of big‑data search, including fast negative lookups, term segmentation, and support for AND/OR queries.

AND/OR queriesbig data searchbloom-filter
0 likes · 13 min read
How a Simple Python Bloom Filter Powers Fast Big Data Search
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 19, 2018 · Databases

How HiTSDB’s New Streaming Aggregation Engine Boosts Query Speed 10×

This article examines the architectural redesign of Alibaba's High‑Performance Time Series Database (HiTSDB), covering storage model changes, inverted‑index enhancements, a pipelined streaming aggregation engine, data‑migration strategies, and performance benchmarks that together deliver over tenfold query speed improvements.

Data MigrationHiTSDBStreaming Aggregation
0 likes · 24 min read
How HiTSDB’s New Streaming Aggregation Engine Boosts Query Speed 10×
Meituan Technology Team
Meituan Technology Team
Mar 22, 2018 · Big Data

High-Performance User Behavior Analysis Solution for Massive Data

The paper describes a high‑performance user‑behavior analysis system that processes hundreds of billions of daily logs for Meituan‑Dianping, using an inverted‑index structure with bitmap UUID sets and timestamp sequences, combined with Spark, Spring and Alluxio optimizations to cut query times from hours to under five seconds.

Big DataOLAP analysisdistributed computing
0 likes · 14 min read
High-Performance User Behavior Analysis Solution for Massive Data
Architecture Digest
Architecture Digest
Feb 1, 2018 · Fundamentals

How Search Engines Work: Building Inverted Indexes

This article explains the core of search engine technology by describing what an inverted index is, how it is built using single‑pass memory and multi‑way merge methods, how indexes can be partitioned and incrementally updated, and how Hadoop can be used for large‑scale indexing.

Big DataHadoopindexing
0 likes · 10 min read
How Search Engines Work: Building Inverted Indexes