Tagged articles

inverted index

101 articles · Page 1 of 2

Jul 2, 2026 · Big Data

How Litefuse’s New Single‑Process Mode Lets an Agent Observability Platform Run in 25 seconds

Litefuse open‑sources a single‑process, sub‑400 MB binary that deploys an Agent observability and evaluation platform in about 25 seconds, explains why Docker‑free deployment matters, and details how Apache Doris’s inverted index, VARIANT JSON type, and compute‑storage separation address the massive, long‑text, semi‑structured traces that differentiate Agent monitoring from traditional observability.

Agent observabilityApache DorisLitefuse

0 likes · 12 min read

How Litefuse’s New Single‑Process Mode Lets an Agent Observability Platform Run in 25 seconds

Top Architect

Jun 5, 2026 · Databases

Eliminate LIKE% in MySQL: Use Full‑Text Search for Efficient Fuzzy Queries

This article explains why using LIKE% for fuzzy searches in MySQL is inefficient, introduces InnoDB full‑text search (available since MySQL 5.6), describes inverted indexes, shows how to create and query full‑text indexes with natural language, boolean, and query‑expansion modes, discusses relevance calculation, stopwords, token‑size parameters, and provides the syntax for dropping full‑text indexes.

Boolean ModeFull-Text SearchInnoDB

0 likes · 12 min read

Eliminate LIKE% in MySQL: Use Full‑Text Search for Efficient Fuzzy Queries

Data Party THU

May 17, 2026 · Artificial Intelligence

Personalizing AI Agents: Memory, Rolling Context, and Advanced Retrieval Techniques

The article explains how AI agents use memory to retain conversation context, why sending the full history to large language models is inefficient, and presents rolling context windows, inverted‑index pruning, semantic embedding retrieval, and GraphRAG as complementary strategies to build more accurate and personalized agents.

AI memoryGraphRAGLLM Optimization

0 likes · 10 min read

Personalizing AI Agents: Memory, Rolling Context, and Advanced Retrieval Techniques

DeepHub IMBA

May 1, 2026 · Artificial Intelligence

How to Build Intelligent Contextual Memory for AI Agents

The article examines why naïvely feeding all dialogue history to large language models is costly and unreliable, and it walks through rolling context windows, inverted‑index pruning, semantic vector search, and GraphRAG as complementary techniques for creating efficient, reasoning‑capable AI agent memory.

AIAgent MemoryGraphRAG

0 likes · 11 min read

How to Build Intelligent Contextual Memory for AI Agents

macrozheng

Jan 15, 2026 · Databases

Master MySQL Full-Text Search: Inverted Index, Queries, and Best Practices

This guide explains why InnoDB fuzzy queries lose indexes, introduces MySQL full‑text search with inverted indexes, shows how to create and use full‑text indexes via CREATE TABLE and ALTER statements, and demonstrates natural language, boolean, and query‑expansion modes with practical SQL examples.

Boolean ModeDatabaseFull-Text Search

0 likes · 12 min read

Master MySQL Full-Text Search: Inverted Index, Queries, and Best Practices

Mike Chen's Internet Architecture

Dec 21, 2025 · Backend Development

How Elasticsearch Scales to Billions of Queries: Sharding, Inverted Index, Distributed Execution, and Replication

Elasticsearch achieves billion‑scale search performance by combining horizontal sharding, immutable inverted‑index segments, a two‑stage distributed Query/FETCH model, and multiple replicas with a coordinator node to ensure high concurrency, scalability, and availability.

Distributed QueryElasticsearchSharding

0 likes · 4 min read

How Elasticsearch Scales to Billions of Queries: Sharding, Inverted Index, Distributed Execution, and Replication

Java Tech Enthusiast

Dec 1, 2025 · Databases

Master MySQL Full‑Text Search: Inverted Index, Query Modes, and Practical Examples

This guide explains how InnoDB implements full‑text search with inverted indexes, shows how to create and drop full‑text indexes, demonstrates MATCH() AGAINST() syntax across natural language, boolean, and query‑expansion modes, and covers relevance scoring, stopwords, token size limits, and real‑world query examples.

Full-Text SearchInnoDBMySQL

0 likes · 14 min read

Master MySQL Full‑Text Search: Inverted Index, Query Modes, and Practical Examples

Java Backend Technology

Dec 1, 2025 · Databases

Mastering MySQL Full‑Text Search: Index Types, Queries, and Best Practices

This guide explains how MySQL InnoDB full‑text search works, covering inverted indexes, index creation, query syntax for natural language, boolean, and query‑expansion modes, relevance calculation, and index maintenance.

Boolean ModeFull-Text SearchMySQL

0 likes · 11 min read

Mastering MySQL Full‑Text Search: Index Types, Queries, and Best Practices

Mike Chen's Internet Architecture

Jul 10, 2025 · Big Data

Master ElasticSearch: Core Concepts, Architecture, and Search Process Explained

This article provides a comprehensive overview of ElasticSearch, covering its role as a distributed full‑text search engine built on Lucene, key concepts such as index, type, document, field, shard and replica, the analysis pipeline, inverted index mechanics, and the two‑phase query‑fetch search workflow.

AnalysisElasticsearchFull-Text Search

0 likes · 7 min read

Master ElasticSearch: Core Concepts, Architecture, and Search Process Explained

Architect's Tech Stack

Jul 8, 2025 · Databases

Master MySQL Full-Text Search: Indexes, Queries, and Advanced Techniques

This article explains MySQL's InnoDB full‑text search, covering the theory of inverted indexes, how to create and use full‑text indexes with various MATCH…AGAINST modes, Boolean operators, query expansion, relevance calculation, and index removal, illustrated with practical SQL examples and diagrams.

DatabaseFull-Text SearchMySQL

0 likes · 12 min read

Master MySQL Full-Text Search: Indexes, Queries, and Advanced Techniques

Mike Chen's Internet Architecture

Jul 1, 2025 · Big Data

Master ElasticSearch: Core Concepts, Architecture, and Search Workflow Explained

This article provides a comprehensive overview of ElasticSearch, covering its definition, core components such as indexes, shards and replicas, the analysis pipeline, inverted index mechanics, and the two‑stage search process that enables scalable, fault‑tolerant full‑text search in big‑data environments.

AnalyzersBig DataDistributed Search

0 likes · 7 min read

Master ElasticSearch: Core Concepts, Architecture, and Search Workflow Explained

Architect's Tech Stack

Jun 6, 2025 · Databases

Master MySQL Full-Text Search: Inverted Indexes, Queries, and Best Practices

This article explains how MySQL's InnoDB engine implements full‑text search using inverted indexes, shows how to create and use full‑text indexes with various query modes, demonstrates Boolean operators and query expansion, and covers index removal techniques.

DatabaseFull-Text SearchMySQL

0 likes · 12 min read

Master MySQL Full-Text Search: Inverted Indexes, Queries, and Best Practices

Java Architect Essentials

Mar 14, 2025 · Databases

Mastering MySQL Full-Text Search: Inverted Index, Query Modes, and Best Practices

This article explains how MySQL InnoDB full‑text search works, covering inverted index structures, creation and deletion of full‑text indexes, query syntax for natural language, boolean, and query‑expansion modes, relevance calculation, and practical examples with SQL code.

DatabaseFull-Text SearchMySQL

0 likes · 12 min read

Mastering MySQL Full-Text Search: Inverted Index, Query Modes, and Best Practices

JavaEdge

Dec 22, 2024 · Backend Development

How LinkedIn Powers Lightning‑Fast Message Search with RocksDB, Lucene, and In‑Memory Indexing

LinkedIn’s message search system stores messages in RocksDB, builds Lucene inverted indexes on demand, partitions them by user, keeps indexes in memory, and uses a coordinator with D2/Zookeeper for node routing, enabling rapid, cost‑effective searches while minimizing write overhead.

LinkedInLuceneRocksDB

0 likes · 9 min read

How LinkedIn Powers Lightning‑Fast Message Search with RocksDB, Lucene, and In‑Memory Indexing

Selected Java Interview Questions

Aug 27, 2024 · Databases

MySQL Full‑Text Search: Inverted Index, Query Modes, and Usage

This article explains how MySQL InnoDB implements full‑text search using inverted indexes, shows how to create and drop full‑text indexes, and demonstrates the three query modes—Natural Language, Boolean, and Query Expansion—along with their syntax, operators, relevance calculation, and practical examples.

Boolean ModeDatabaseMySQL

0 likes · 11 min read

MySQL Full‑Text Search: Inverted Index, Query Modes, and Usage

IT Services Circle

Jun 24, 2024 · Databases

Understanding Elasticsearch Architecture: Inverted Index, Term Dictionary, Segments, and Distributed Search

This article explains how Elasticsearch transforms simple keyword matching into a high‑performance, scalable search engine by using inverted indexes, term dictionaries, posting lists, term indexes, stored fields, doc values, segments, and distributed node architectures to achieve fast, reliable full‑text search on massive data sets.

ElasticsearchLuceneSearch Engine

0 likes · 16 min read

Understanding Elasticsearch Architecture: Inverted Index, Term Dictionary, Segments, and Distributed Search

360 Smart Cloud

May 23, 2024 · Big Data

Archer Engine: Integrating Inverted Index with Iceberg for Scalable Big Data Log Analytics

The article introduces Archer, a new big‑data warehouse engine built on Iceberg that adds an inverted‑index mechanism using Tantivy to provide full‑text and JSON search, storage‑compute separation, and significant performance gains over traditional Elasticsearch and Iceberg connectors.

Archer EngineBig DataParquet

0 likes · 9 min read

Archer Engine: Integrating Inverted Index with Iceberg for Scalable Big Data Log Analytics

Programmer XiaoFu

May 16, 2024 · Databases

MySQL Fuzzy Search Without LIKE%: Using Full‑Text Indexes

This article explains how InnoDB full‑text search can replace LIKE% fuzzy queries by using inverted indexes, covering index structures, creation syntax, MATCH() AGAINST() modes, practical demos, query expansion, and index removal, while noting when to prefer external search engines.

Boolean ModeFull-Text SearchMySQL

0 likes · 12 min read

MySQL Fuzzy Search Without LIKE%: Using Full‑Text Indexes

Top Architect

Apr 18, 2024 · Big Data

Understanding ElasticSearch Architecture and Its Underlying Lucene Mechanics

This article provides a comprehensive, top‑down and bottom‑up explanation of ElasticSearch’s core architecture, detailing nodes, shards, Lucene segments, inverted indexes, stored fields, document values, caching, query processing, routing, and scaling considerations for efficient search operations.

LuceneSearch EngineSharding

0 likes · 10 min read

Understanding ElasticSearch Architecture and Its Underlying Lucene Mechanics

Code Ape Tech Column

Apr 12, 2024 · Databases

Understanding Full-Text Search and Inverted Indexes in MySQL InnoDB

This article explains MySQL InnoDB full-text search, covering inverted index structures, creation and usage of full-text indexes, query modes such as natural language, boolean, and query expansion, and provides practical SQL examples and considerations for effective text searching.

DatabaseFull-Text SearchMySQL

0 likes · 11 min read

Understanding Full-Text Search and Inverted Indexes in MySQL InnoDB

Liangxu Linux

Feb 28, 2024 · Databases

Master MySQL InnoDB Full-Text Search: Indexes, Queries, and Advanced Techniques

This article explains how MySQL InnoDB implements full‑text search, covering the underlying inverted index structures, how to create and drop full‑text indexes, the MATCH…AGAINST syntax, and detailed examples of natural language, boolean, and query‑expansion search modes with practical SQL demos.

Boolean ModeFull-Text SearchMySQL

0 likes · 12 min read

Master MySQL InnoDB Full-Text Search: Indexes, Queries, and Advanced Techniques

Laravel Tech Community

Feb 4, 2024 · Databases

Understanding Full-Text Search and Inverted Indexes in MySQL InnoDB

This article explains how MySQL InnoDB implements full‑text search using inverted indexes, covers the creation and usage of FULLTEXT indexes, demonstrates various MATCH…AGAINST query modes such as natural language, boolean, and query expansion, and shows how to manage and delete full‑text indexes with practical SQL examples.

Boolean ModeMySQLQuery Expansion

0 likes · 10 min read

Architecture Digest

Jan 23, 2024 · Databases

MySQL InnoDB Full‑Text Search: Inverted Index, Query Modes, and Practical Examples

This article explains how MySQL InnoDB implements full‑text search using inverted indexes, shows how to create and drop FULLTEXT indexes, and demonstrates natural language, boolean, and query‑expansion search modes with concrete SQL examples and operator details.

Boolean ModeDatabaseFull-Text Search

0 likes · 10 min read

MySQL InnoDB Full‑Text Search: Inverted Index, Query Modes, and Practical Examples

JD Tech

Jan 22, 2024 · Big Data

Efficient High‑Concurrency Data Retrieval Using Inverted Index and Bitmap Techniques

This article explores how to achieve fast, scalable data retrieval in million‑level high‑concurrency scenarios by replacing naïve full‑combination rule matching with column‑wise inverted indexes and bitmap operations, dramatically reducing time complexity and improving stability while leveraging RoaringBitmap compression for space efficiency.

High concurrencyRoaringBitmapRule Engine

0 likes · 12 min read

Efficient High‑Concurrency Data Retrieval Using Inverted Index and Bitmap Techniques

ITPUB

Dec 28, 2023 · Databases

How ByConity 0.3.0 Boosts Text Search and Cold‑Read Performance with Inverted Indexes

ByConity 0.3.0 introduces a new inverted index supporting Chinese tokenization, a shared‑storage leader election mechanism, enhanced cold‑read performance via Prefetch and adaptive mark allocation, and an upgraded ELT pipeline with BSP mode, all detailed with implementation steps, code examples, and future roadmap.

ByConityCold ReadDatabase

0 likes · 13 min read

How ByConity 0.3.0 Boosts Text Search and Cold‑Read Performance with Inverted Indexes

政采云技术

Dec 19, 2023 · Backend Development

Principles and Simple Implementation of a Search Engine in Go

This article explains the fundamental concepts of search engine technology—including forward and inverted indexes, tokenizers, stop words, synonym handling, ranking algorithms, and NLP integration—and provides a concise Go implementation with code examples and performance testing.

GoInformation RetrievalNLP

0 likes · 21 min read

Principles and Simple Implementation of a Search Engine in Go

Python Programming Learning Circle

Nov 17, 2023 · Big Data

Building a Simple Search Engine with Bloom Filter, Tokenization, and Inverted Index in Python

This article demonstrates how to implement a basic big‑data search engine in Python by creating a Bloom filter for fast existence checks, designing tokenization functions for major and minor segmentation, building an inverted index, and supporting AND/OR queries with example code and execution results.

Big DataSearchTokenization

0 likes · 12 min read

Building a Simple Search Engine with Bloom Filter, Tokenization, and Inverted Index in Python

Java Architect Essentials

Nov 6, 2023 · Databases

Understanding MySQL InnoDB Full-Text Search and Inverted Indexes

This article explains why full‑text search is needed, how InnoDB implements it with inverted indexes, the syntax for creating and using full‑text indexes, the different query modes (natural language, boolean, query expansion), and how to delete full‑text indexes in MySQL.

DatabaseMySQLQuery Optimization

0 likes · 10 min read

Understanding MySQL InnoDB Full-Text Search and Inverted Indexes

Ximalaya Technology Team

Sep 6, 2023 · Backend Development

Design Analysis of Lucene and In-Memory Inverted Index Service for Advertising Retrieval

The team analyzed Lucene’s disk‑based inverted index and built a custom in‑memory inverted‑index service for Himalaya’s ad engine, encoding terms as 64‑bit keys, supporting real‑time updates and BooleanQuery‑style and custom expression filtering, which cut query latency from ~50 ms to under 5 ms and enabled massive scaling.

Data StructuresJavaLucene

0 likes · 27 min read

Design Analysis of Lucene and In-Memory Inverted Index Service for Advertising Retrieval

Senior Tony

Aug 20, 2023 · Fundamentals

Why Elasticsearch Is Called Near‑Real‑Time and How It Works Under the Hood

This article explains Elasticsearch’s near‑real‑time nature, its core mechanisms such as inverted indexes and tokenizers, common interview scenarios, search types, refresh strategies, and the difference between query and filter, helping readers understand when and why to choose ES for full‑text and complex queries.

Query vs FilterSearch Typesinverted index

0 likes · 15 min read

Why Elasticsearch Is Called Near‑Real‑Time and How It Works Under the Hood

JD Cloud Developers

Jul 25, 2023 · Backend Development

Cutting Rule Matching Latency: From Exponential to Linear with Inverted Index & Bitmaps

This article explains how the Promise time‑control system replaces a naïve exponential rule‑matching approach with a column‑wise inverted index and bitmap bitwise operations, reducing computational complexity from 2ⁿ to n while also addressing space efficiency through RoaringBitmap compression.

Rule Enginebackendbitmap

0 likes · 9 min read

Cutting Rule Matching Latency: From Exponential to Linear with Inverted Index & Bitmaps

DataFunTalk

May 9, 2023 · Databases

High‑Performance Inverted Index in Apache Doris for Log Data Storage and Analysis

This article explains how Apache Doris implements a high‑performance, column‑oriented inverted index to address the challenges of massive, real‑time log data storage and analysis, delivering dramatically higher write throughput, lower storage costs, and faster query performance than traditional Elasticsearch and Loki solutions.

Apache DorisBig DataLog Analytics

0 likes · 19 min read

High‑Performance Inverted Index in Apache Doris for Log Data Storage and Analysis

IT Architects Alliance

Mar 2, 2023 · Cloud Native

How Prometheus V2 Stores Time‑Series Data: Disk Formats and Query Mechanics

This article provides an in‑depth analysis of Prometheus V2's storage architecture, detailing the on‑disk block layout, chunk and index formats, the inverted index structure, memory representations, and the step‑by‑step query process that locates matching time‑series data.

Cloud NativeGoPrometheus

0 likes · 13 min read

How Prometheus V2 Stores Time‑Series Data: Disk Formats and Query Mechanics

21CTO

Feb 11, 2023 · Big Data

Mastering Elasticsearch: Core Concepts, Architecture, and Performance Tips

This comprehensive guide explains what Elasticsearch does, its underlying Lucene technology, core concepts such as clusters, shards, replicas, mapping, indexing and storage mechanisms, and provides practical performance‑tuning advice for building and operating a robust distributed search engine.

ElasticsearchMappingPerformance Optimization

0 likes · 35 min read

Architect

Jan 26, 2023 · Backend Development

Optimizing Elasticsearch for High‑Concurrency LBS Search with an RLE‑Based Inverted Index

This article details Meituan's search‑engine optimization for its food‑delivery platform, describing the performance bottlenecks of Elasticsearch's inverted‑list query and merge phases, the design of a run‑length‑encoding (RLE) index, custom hash‑map term look‑ups, sparse RoaringBitmap structures, integration steps, and the resulting 84% latency reduction.

ElasticsearchJavaPerformance Optimization

0 likes · 27 min read

Optimizing Elasticsearch for High‑Concurrency LBS Search with an RLE‑Based Inverted Index

Java High-Performance Architecture

Dec 30, 2022 · Backend Development

How Elasticsearch Powers Real-Time Search: Inverted Index, Sharding, and Write Mechanics

This article explains Elasticsearch’s core concepts—including inverted indexes, shard architecture, node roles, and the detailed write‑read‑search workflow—so readers can grasp how the system achieves near‑real‑time search and reliable data storage.

ElasticsearchSearch EngineSharding

0 likes · 10 min read

How Elasticsearch Powers Real-Time Search: Inverted Index, Sharding, and Write Mechanics

IT Architects Alliance

Dec 25, 2022 · Backend Development

How Elasticsearch Inverted Index Works: From Sharding to Real‑Time Search

This article explains Elasticsearch's inverted index structure, sharding and replica concepts, cluster node roles, and the detailed write, read, search, delete, and update processes, illustrating how data moves from memory to disk and becomes searchable in near real‑time.

ElasticsearchRead ProcessSearch

0 likes · 9 min read

How Elasticsearch Inverted Index Works: From Sharding to Real‑Time Search

Top Architect

Dec 24, 2022 · Databases

Elasticsearch Architecture: Inverted Index, Sharding, and Data Operations

This article explains the core concepts of Elasticsearch, including how its inverted index works, the structure of term dictionaries and posting lists, shard and replica configuration, cluster node roles, the detailed write, refresh, flush, and merge processes, as well as how search queries are executed across distributed shards.

Search EngineShardingdistributed architecture

0 likes · 9 min read

Elasticsearch Architecture: Inverted Index, Sharding, and Data Operations

Efficient Ops

Dec 21, 2022 · Big Data

How Elasticsearch Leverages Lucene’s Inverted Index for Real‑Time Distributed Search

This article explains the fundamentals of structured and unstructured data, introduces Lucene’s inverted index, and details how Elasticsearch builds on Lucene to provide distributed, near‑real‑time search with concepts such as clusters, shards, replicas, routing, and performance optimizations.

Distributed SearchElasticsearchLucene

0 likes · 36 min read

How Elasticsearch Leverages Lucene’s Inverted Index for Real‑Time Distributed Search

Baidu Geek Talk

Dec 14, 2022 · Databases

How Baidu’s HybridIndexTable Redefined Inverted Index Performance

This article examines Baidu's Limitless ad‑funnel optimization, detailing how a cache‑friendly HybridIndexTable and adaptive memory containers dramatically improve inverted‑list scan speed, reduce memory overhead, and enable lock‑free updates for high‑throughput retrieval systems.

Cache OptimizationHybridIndexTableLearned Index

0 likes · 17 min read

How Baidu’s HybridIndexTable Redefined Inverted Index Performance

dbaplus Community

Dec 6, 2022 · Backend Development

How Meituan Cut Elasticsearch Search Latency by 84% with an RLE‑Based Inverted Index

This article details Meituan's search‑engine team optimization of Elasticsearch for a high‑traffic LBS scenario, describing the performance bottlenecks in term‑posting retrieval and merging, the design of a run‑length‑encoding (RLE) inverted index, its integration as a plugin, extensive benchmarking, and the resulting 84% reduction in TP99 query latency.

Backend SearchElasticsearchRoaringBitmap

0 likes · 25 min read

How Meituan Cut Elasticsearch Search Latency by 84% with an RLE‑Based Inverted Index

Meituan Technology Team

Nov 17, 2022 · Backend Development

Elasticsearch Query and Merge Optimization Using Run-Length Encoding for Meituan Takeaway Search

Meituan's food‑delivery search team identified heavy CPU and latency hotspots in Elasticsearch's posting‑list query and merge phases, then redesigned the inverted index using Run‑Length Encoding, hash‑based term lookup, index sorting and a custom SparseRoaringDocIdSet, ultimately reducing TP99 search latency by 84% and cutting CPU usage dramatically.

ElasticsearchIndex SortingJava

0 likes · 26 min read

Elasticsearch Query and Merge Optimization Using Run-Length Encoding for Meituan Takeaway Search

Top Architect

Sep 16, 2022 · Big Data

Understanding ElasticSearch: Distributed Search, Full‑Text Retrieval, and Inverted Index

This article explains the fundamentals of search, why traditional databases struggle with large‑scale text queries, introduces full‑text search and inverted indexes, describes Lucene as the core library, and details ElasticSearch's distributed architecture, features, and common use cases.

Big DataFull-Text SearchSearch Engine

0 likes · 7 min read

Understanding ElasticSearch: Distributed Search, Full‑Text Retrieval, and Inverted Index

Selected Java Interview Questions

Aug 24, 2022 · Backend Development

Understanding ElasticSearch: Distributed Search, Full‑Text Retrieval, and Inverted Index

This article explains what search is, why traditional databases struggle with full‑text queries, introduces the concepts of inverted indexes and Lucene, and shows how ElasticSearch combines distributed architecture, real‑time analytics, and powerful search features to solve these problems.

Full-Text SearchSearch Enginedistributed systems

0 likes · 8 min read

Open Source Linux

Aug 21, 2022 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch leverages inverted indexes, term dictionaries, and compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to enable rapid full‑text search, detailing the underlying data structures, query processing, and practical indexing tips for efficient backend search implementations.

Backend DevelopmentElasticsearchPostings List

0 likes · 17 min read

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

Efficient Ops

Aug 16, 2022 · Backend Development

How Elasticsearch Delivers Lightning‑Fast Search with Inverted Indexes and Compression

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and advanced compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to achieve rapid search performance while minimizing memory and disk usage, and it also covers practical indexing tips for production use.

Backend DevelopmentElasticsearchPostings List

0 likes · 15 min read

How Elasticsearch Delivers Lightning‑Fast Search with Inverted Indexes and Compression

Senior Brother's Insights

May 10, 2022 · Backend Development

Mastering Elasticsearch: Core Concepts, Architecture, and Performance Tuning

This comprehensive guide explains what Elasticsearch does, its underlying Lucene engine, core concepts like clusters, shards, replicas, mappings, and provides practical steps for installation, configuration, indexing, storage mechanics, and performance optimization.

LuceneShardingcluster management

0 likes · 36 min read

Mastering Elasticsearch: Core Concepts, Architecture, and Performance Tuning

Top Architect

Mar 17, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This comprehensive guide explains Elasticsearch fundamentals, including data types, Lucene and inverted indexes, cluster and node roles, shard and replica mechanisms, mapping, installation steps, health monitoring, write and storage processes, segment merging, and practical performance tuning tips for large‑scale search deployments.

ElasticsearchPerformance OptimizationSearch Engine

0 likes · 35 min read

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

Architect's Tech Stack

Mar 3, 2022 · Databases

Understanding MySQL InnoDB Full-Text Search and Inverted Index

This article explains why traditional B‑Tree indexes fail for keyword searches, introduces the concept of inverted indexes, shows how to create and use MySQL InnoDB full‑text indexes with MATCH‑AGAINST in various modes, and covers index maintenance and query‑expansion techniques.

Boolean ModeDatabaseFull-Text Search

0 likes · 12 min read

Understanding MySQL InnoDB Full-Text Search and Inverted Index

Liangxu Linux

Feb 8, 2022 · Databases

Master MySQL InnoDB Full-Text Search: Indexes, Queries, and Advanced Tips

This guide explains why InnoDB full‑text search is needed, how inverted indexes work, how to create and use full‑text indexes with MATCH AGAINST in natural language, boolean, and query‑expansion modes, and how to drop them safely.

DatabaseFull-Text SearchInnoDB

0 likes · 11 min read

Master MySQL InnoDB Full-Text Search: Indexes, Queries, and Advanced Tips

Architect

Feb 6, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, inverted indexes, cluster components, node roles, shard and replica mechanisms, mapping, installation, health monitoring, write path, storage strategies, segment management, refresh and translog processes, as well as practical performance and JVM tuning tips.

Distributed SearchElasticsearchPerformance Optimization

0 likes · 37 min read

Top Architect

Feb 6, 2022 · Databases

Understanding MySQL InnoDB Full‑Text Search and Inverted Index

This article explains why InnoDB full‑text search is needed for keyword‑based queries, introduces inverted index structures, shows how to create, use, and delete full‑text indexes in MySQL, and demonstrates natural language, boolean, and query‑expansion search modes with practical SQL examples.

DatabaseFull-Text SearchMySQL

0 likes · 15 min read

Java Backend Technology

Jan 18, 2022 · Databases

Master MySQL Full-Text Search: Inverted Indexes, Query Modes, and Optimization

This article explains how InnoDB implements full-text search using inverted indexes, shows how to create and use full-text indexes with various query modes—including natural language, boolean, and query expansion—covers relevance calculation, stopwords, token size limits, and demonstrates how to delete indexes, all illustrated with SQL examples and diagrams.

Boolean ModeFull-Text SearchMySQL

0 likes · 13 min read

Master MySQL Full-Text Search: Inverted Indexes, Query Modes, and Optimization

Open Source Linux

Dec 29, 2021 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and compression techniques like FOR and Roaring Bitmaps to enable rapid full‑text search, contrasting its approach with traditional relational databases and offering practical indexing tips for large‑scale applications.

ElasticsearchLucenePostings List

0 likes · 15 min read

政采云技术

Dec 9, 2021 · Fundamentals

Understanding Elasticsearch Inverted Index and Its Optimization Techniques

This article explains how Elasticsearch leverages Lucene's inverted index, term dictionaries, and posting list compression methods such as FST and Frame‑of‑Reference to achieve fast, accurate, and tolerant search performance, while also discussing practical implementation details and examples.

ElasticsearchFSTPosting List Compression

0 likes · 11 min read

Understanding Elasticsearch Inverted Index and Its Optimization Techniques

Open Source Linux

Dec 8, 2021 · Backend Development

How Elasticsearch Uses Lucene’s Inverted Index for Lightning‑Fast Search

This article explains how Elasticsearch leverages Lucene’s inverted index, detailing the structure of term dictionaries, postings lists, compression techniques like Frame‑of‑Reference and Roaring Bitmaps, and query optimizations such as filter caches and skip‑list intersections to achieve fast, memory‑efficient search.

ElasticsearchLuceneSearch Engine

0 likes · 19 min read

How Elasticsearch Uses Lucene’s Inverted Index for Lightning‑Fast Search

Efficient Ops

Dec 2, 2021 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to deliver rapid full‑text search, efficient storage, and fast union queries, while also offering practical indexing tips for production use.

LucenePostings ListRoaring Bitmap

0 likes · 15 min read

Java Interview Crash Guide

Nov 11, 2021 · Big Data

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes and Compression

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and advanced compression techniques like Frame of Reference and Roaring Bitmaps to enable rapid, scalable search over massive datasets, contrasting its approach with traditional relational database queries and detailing practical optimization tips.

ElasticsearchLucenePostings List

0 likes · 16 min read

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes and Compression

Top Architect

Nov 7, 2021 · Backend Development

Understanding Near Real‑Time Search and Core Architecture of ElasticSearch

This article explains how ElasticSearch achieves near real‑time search by using immutable inverted indexes, segment merging, shard distribution, and a translog for durability, while also offering practical guidance on how to study the system effectively.

Distributed SearchSegmentinverted index

0 likes · 8 min read

Understanding Near Real‑Time Search and Core Architecture of ElasticSearch

Programmer DD

Nov 4, 2021 · Backend Development

How Elasticsearch Achieves Near Real-Time Search: Core Mechanisms Explained

This article explains how Elasticsearch implements near real-time search by using immutable inverted indexes, segment merging, shard distribution, and a translog for durability, detailing the challenges of persistence, disk I/O, and data recovery in a distributed environment.

Data persistenceElasticsearchNear Real-Time Search

0 likes · 9 min read

How Elasticsearch Achieves Near Real-Time Search: Core Mechanisms Explained

21CTO

Nov 3, 2021 · Databases

Master MySQL Full‑Text Search: Index Creation, Modes, and Internals

This tutorial explains how MySQL implements full‑text search, covering the creation of full‑text indexes (including Chinese ngram support), the three query modes (natural language, boolean, and query expansion), relevance ranking, underlying inverted‑index structures, cache handling, and common DML operations.

Boolean ModeDatabaseFull-Text Search

0 likes · 14 min read

Master MySQL Full‑Text Search: Index Creation, Modes, and Internals

21CTO

Oct 26, 2021 · Databases

How ElasticSearch Delivers Near Real-Time Search with Immutable Indexes

ElasticSearch achieves near real-time search by building immutable inverted indexes (segments), using incremental indexing, logical deletions, background segment merging, and a write-ahead translog to ensure durability, while distributing shards across nodes to balance load and maintain data consistency.

Near Real-Time SearchSegment Merginginverted index

0 likes · 8 min read

How ElasticSearch Delivers Near Real-Time Search with Immutable Indexes

MaGe Linux Operations

Oct 13, 2021 · Backend Development

How Elasticsearch Achieves Near Real-Time Search: Core Techniques Explained

This article explains how Elasticsearch implements near real-time search by using immutable inverted indexes, segment merging, sharding, and a translog for durability, detailing the challenges and solutions behind its distributed full‑text search architecture.

ElasticsearchLuceneNear Real-Time Search

0 likes · 9 min read

How Elasticsearch Achieves Near Real-Time Search: Core Techniques Explained

21CTO

Oct 9, 2021 · Backend Development

ElasticSearch Near Real-Time Search: Immutable Indexes, Segments, and Translog

This article explores how ElasticSearch delivers near real‑time search by leveraging immutable inverted indexes, segment merging, shard distribution, and a write‑ahead translog, detailing the challenges of persistence, disk I/O, and data loss prevention in a distributed environment.

Near Real-Time SearchSegment Mergingdistributed systems

0 likes · 9 min read

ElasticSearch Near Real-Time Search: Immutable Indexes, Segments, and Translog

IT Architects Alliance

Oct 6, 2021 · Big Data

Understanding Elasticsearch Inverted Index and Efficient Search Retrieval

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and postings lists—along with compression techniques like Frame of Reference and Roaring Bitmaps—to achieve fast, memory‑efficient search queries, and provides practical tips for optimizing indexing and query performance.

ElasticsearchLucenePostings List

0 likes · 14 min read

Understanding Elasticsearch Inverted Index and Efficient Search Retrieval

IT Architects Alliance

Sep 29, 2021 · Databases

Understanding Elasticsearch Inverted Index: Fast Retrieval, Compression, and Query Techniques

This article explains how Elasticsearch uses inverted index structures—including term dictionaries, term indexes, and postings lists—combined with compression methods like Frame‑of‑Reference and Roaring Bitmaps to achieve fast search, efficient storage, and effective union queries compared to traditional relational databases.

ElasticsearchLucenePostings List

0 likes · 14 min read

Understanding Elasticsearch Inverted Index: Fast Retrieval, Compression, and Query Techniques

Java Interview Crash Guide

Sep 23, 2021 · Fundamentals

How Elasticsearch Writes, Reads, and Searches Data: Deep Dive into ES Internals

This article explains Elasticsearch's core mechanisms for indexing, querying, and searching data, covering the roles of coordinating nodes, primary and replica shards, refresh cycles, translog, commit/flush processes, and the underlying Lucene inverted index.

ElasticsearchLuceneSearch Engine

0 likes · 13 min read

How Elasticsearch Writes, Reads, and Searches Data: Deep Dive into ES Internals

Efficient Ops

Sep 21, 2021 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes and Compression

This article explains how Elasticsearch uses inverted indexes, term dictionaries, postings lists, and compression techniques like Frame‑of‑Reference and Roaring Bitmaps to deliver rapid, memory‑efficient search, and provides practical tips for indexing and query optimization.

Postings ListRoaring BitmapSearch Engine

0 likes · 15 min read

21CTO

Sep 12, 2021 · Big Data

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes and Compression

This article explains how Elasticsearch uses inverted indexes, term dictionaries, FST structures, and compression techniques like FOR and Roaring Bitmaps to dramatically speed up search queries over massive datasets while minimizing memory and disk usage.

LuceneSearch Enginecompression

0 likes · 14 min read

IT Architects Alliance

Sep 5, 2021 · Databases

Understanding Elasticsearch Fast Retrieval: Inverted Index, Postings List, and Compression Techniques

This article explains how Elasticsearch achieves rapid search by using inverted indexes, detailing the structure of posting lists, term dictionaries, compression methods like Frame‑of‑Reference and Roaring Bitmaps, and how these techniques enable efficient union queries and filter caching.

ElasticsearchLucenePostings List

0 likes · 14 min read

Understanding Elasticsearch Fast Retrieval: Inverted Index, Postings List, and Compression Techniques

Architecture Digest

Sep 5, 2021 · Databases

How Elasticsearch Achieves Fast Retrieval: Inverted Index, Term Dictionary, and Compression Techniques

This article explains how Elasticsearch leverages Lucene's inverted index, term dictionary, term index, and compression methods such as Frame‑of‑Reference and Roaring Bitmaps to enable rapid search, efficient storage, and fast set operations for large‑scale data retrieval.

ElasticsearchLuceneRoaring Bitmap

0 likes · 16 min read

How Elasticsearch Achieves Fast Retrieval: Inverted Index, Term Dictionary, and Compression Techniques

Architect

Sep 4, 2021 · Databases

Understanding Elasticsearch Fast Retrieval: Inverted Index, Term Dictionary, and Compression Techniques

This article explains how Elasticsearch achieves fast data retrieval by comparing it with traditional relational databases, detailing search engine fundamentals, the structure of Lucene's inverted index—including term dictionaries, postings lists, and term indexes—and the compression techniques such as Frame of Reference and Roaring Bitmaps that optimize storage and query performance.

ElasticsearchLucenePostings List

0 likes · 14 min read

Understanding Elasticsearch Fast Retrieval: Inverted Index, Term Dictionary, and Compression Techniques

Java Interview Crash Guide

Jul 2, 2021 · Databases

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch leverages inverted indexes, term dictionaries, and advanced compression techniques like Frame of Reference and Roaring Bitmaps to enable rapid full‑text search, covering the underlying concepts, data structures, and query optimizations essential for high‑performance search applications.

ElasticsearchLucenePostings List

0 likes · 17 min read

Selected Java Interview Questions

Jul 1, 2021 · Fundamentals

Understanding Elasticsearch Inverted Index: Posting Lists, Term Dictionary, and Compression Techniques

This article explains how Elasticsearch achieves fast search by using inverted indexes, detailing the structure of posting lists, term dictionaries, term indexes, and compression methods such as Frame of Reference and Roaring Bitmaps, as well as techniques for efficient union and intersection queries.

ElasticsearchPostings ListSearch Engine

0 likes · 16 min read

Understanding Elasticsearch Inverted Index: Posting Lists, Term Dictionary, and Compression Techniques

Efficient Ops

Jun 23, 2021 · Backend Development

Why Can’t Elasticsearch Find My Logs? Uncovering Full‑Text Search Pitfalls and Tokenizer Tweaks

This article explains why large‑scale Elasticsearch clusters may miss log entries during keyword searches, dives into the fundamentals of inverted indexes and tokenization, and demonstrates practical index‑time and query‑time tokenizer optimizations—including custom analyzers for English and Chinese—to dramatically improve search recall and precision.

ElasticsearchFull-Text Searchinverted index

0 likes · 13 min read

Why Can’t Elasticsearch Find My Logs? Uncovering Full‑Text Search Pitfalls and Tokenizer Tweaks

Java High-Performance Architecture

Jun 8, 2021 · Big Data

How Elasticsearch Writes, Reads, and Searches Data: Deep Dive into ES Internals

This article explains Elasticsearch's write, read, and search mechanisms, the role of coordinating nodes, primary and replica shards, refresh and commit cycles, Lucene's inverted index, and how data becomes searchable in near‑real‑time.

ElasticsearchLuceneRead Process

0 likes · 12 min read

dbaplus Community

Apr 4, 2021 · Databases

Elasticsearch vs MySQL: How Inverted Indexes Enable Faster Complex Queries

This article explains why Elasticsearch handles complex conditional queries more efficiently than MySQL by using inverted indexes, term dictionaries, skip‑list and roaring bitmap structures, while also discussing the trade‑offs such as slower write performance.

Complex QueriesElasticsearchMySQL

0 likes · 12 min read

Elasticsearch vs MySQL: How Inverted Indexes Enable Faster Complex Queries

58 Tech

Mar 8, 2021 · Fundamentals

Real‑Time Inverted Index Update Techniques in the 58 Search Engine

This article explains how the 58 search engine achieves millisecond‑level real‑time inverted‑index updates by redesigning the underlying data structures, combining static and dynamic indexing, using a chain‑array hybrid, segment merging strategies, and lock‑free read‑write concurrency while maintaining search performance.

Data StructuresReal-Time UpdateSearch Engine

0 likes · 18 min read

Real‑Time Inverted Index Update Techniques in the 58 Search Engine

Architecture Digest

Feb 18, 2021 · Big Data

Elasticsearch Write, Read, and Search Processes: Underlying Mechanisms and Lucene Inverted Index

This article explains how Elasticsearch handles data ingestion, retrieval, and full‑text search by describing the roles of coordinating, primary, and replica nodes, the refresh‑commit‑flush cycle, segment files, translog, and the Lucene‑based inverted index that powers its near‑real‑time capabilities.

ElasticsearchLuceneRead Process

0 likes · 11 min read

Elasticsearch Write, Read, and Search Processes: Underlying Mechanisms and Lucene Inverted Index

Qunar Tech Salon

Feb 4, 2021 · Fundamentals

Understanding Lucene Inverted Index: Principles and Implementation

This article explains the concept of inverted indexes, their role in full‑text search, and provides a detailed overview of how Apache Lucene implements inverted indexing, including term dictionaries, posting lists, query processing, and numeric handling with BKDTree.

BKDTreeLucenePosting List

0 likes · 15 min read

Understanding Lucene Inverted Index: Principles and Implementation

Programmer DD

Jan 28, 2021 · Databases

How Elasticsearch Writes, Reads, and Searches Data: Inside the Engine

This article explains Elasticsearch's internal mechanisms for writing, reading, and searching data, covering the roles of coordinating nodes, primary and replica shards, buffers, translog, segment files, refresh cycles, commit and flush operations, as well as Lucene's inverted index and how deletions and updates are handled.

ElasticsearchLuceneSegment

0 likes · 10 min read

How Elasticsearch Writes, Reads, and Searches Data: Inside the Engine

Big Data Technology & Architecture

Dec 24, 2020 · Big Data

Common Techniques for Processing Massive Data Sets

This article summarizes a range of practical methods—including Bloom filters, hashing, bit‑maps, heaps, bucket partitioning, database indexes, inverted indexes, external sorting, trie trees, and MapReduce—that are commonly used to handle, deduplicate, and query extremely large data volumes in big‑data applications.

Big DataHashingexternal sort

0 likes · 11 min read

Common Techniques for Processing Massive Data Sets

DeWu Technology

Dec 4, 2020 · Fundamentals

Introduction to Search Engine Technology and Information Retrieval

The article surveys core search‑engine technology—document hierarchy, flat and vertical inverted indexes, query operators for building and merging score lists, and ranking models from Boolean and BM25 to language‑model approaches like Indri—providing a foundational overview of information retrieval.

BM25Information RetrievalSearch Engine

0 likes · 14 min read

Introduction to Search Engine Technology and Information Retrieval

vivo Internet Technology

Nov 25, 2020 · Databases

Understanding Elasticsearch Architecture, Indexing, and Storage Mechanisms

Elasticsearch combines Lucene’s inverted index with a distributed cluster of master‑eligible, data, and coordinating nodes, using Zen discovery for node election and split‑brain prevention, while indexing writes to primary shards, replicating to replicas, storing immutable segments that are periodically merged for efficient search.

Elasticsearchclusterdistributed architecture

0 likes · 19 min read

Understanding Elasticsearch Architecture, Indexing, and Storage Mechanisms

Architecture Digest

Oct 1, 2020 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, the role of Lucene, cluster architecture, node roles, discovery mechanisms, shard and replica management, mapping, installation, health monitoring, indexing workflow, storage internals, refresh and translog processes, segment merging, and practical performance and JVM tuning tips.

ElasticsearchPerformance OptimizationSearch Engine

0 likes · 35 min read

Selected Java Interview Questions

Aug 20, 2020 · Big Data

Elasticsearch Write, Read, and Search Processes: Underlying Mechanisms and Lucene Inverted Index

This article explains Elasticsearch’s write, read, and search workflows, detailing the roles of coordinating nodes, primary and replica shards, refresh and commit cycles, translog handling, and the underlying Lucene inverted index mechanism.

ElasticsearchLuceneSearch Engine

0 likes · 11 min read

Swan Home Tech Team

Jul 13, 2020 · Backend Development

Design and Evolution of the DaJia App Search System

This article explains the motivations, requirements, and technical design of the DaJia app's search system, compares relational databases with Lucene‑based solutions, describes the inverted index mechanism, outlines common search workflows, and details the system's three iterative development phases and future improvement plans.

ElasticsearchInformation RetrievalLucene

0 likes · 12 min read

Design and Evolution of the DaJia App Search System

Programmer DD

Jul 10, 2020 · Fundamentals

How Search Engines Work: Inside Document and Query Processing

This article explains the core components of a search engine—document processing, query processing, and matching—detailing each step from indexing to ranking, and discusses the document features that influence relevance, providing a comprehensive overview of information retrieval fundamentals.

Document processingInformation RetrievalQuery Processing

0 likes · 20 min read

How Search Engines Work: Inside Document and Query Processing

Java High-Performance Architecture

Jan 20, 2020 · Fundamentals

How Inverted Indexes Power Fast Full-Text Search

This article explains what an inverted index is, why it’s essential for full‑text search, how it is built and queried, and common token transformations such as stop‑word removal, lemmatization, and stemming.

Full-Text SearchInformation RetrievalSearch Engine

0 likes · 4 min read

How Inverted Indexes Power Fast Full-Text Search

Tencent Cloud Developer

Nov 26, 2019 · Backend Development

TurboSearch: Tencent AI Lab's Next-Generation Large-Scale Search System

TurboSearch is Tencent AI Lab's next-generation large-scale search system, delivering distributed massive indexing, high-performance parallel retrieval, multi-granularity and multi-modal vector indexing, private Docker deployment, integrated NLP query analysis, extensible plugins, and robust operations for massive data and diverse search scenarios.

NLPSearch EngineTencent AI Lab

0 likes · 14 min read

TurboSearch: Tencent AI Lab's Next-Generation Large-Scale Search System

Big Data Technology Architecture

Aug 9, 2019 · Databases

Understanding Elasticsearch: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive overview of Elasticsearch, covering its role in handling structured and unstructured data, core concepts such as Lucene, inverted indexes, clusters, shards, replicas, mapping, indexing processes, storage mechanisms, and practical performance tuning tips for deployment.

ElasticsearchPerformance TuningSearch Engine

0 likes · 35 min read

Understanding Elasticsearch: Architecture, Core Concepts, and Performance Optimization

Efficient Ops

Apr 21, 2019 · Backend Development

Mastering Elasticsearch: From Inverted Index to Distributed Search

This article walks through the fundamentals of search engines, explaining inverted indexes, the explosion of index size, core Elasticsearch concepts, its distributed architecture, and how it powers the ELK stack for log analysis, all illustrated with clear diagrams and examples.

ELKElasticsearchSearch Engine

0 likes · 6 min read

Mastering Elasticsearch: From Inverted Index to Distributed Search

58 Tech

Mar 7, 2019 · Big Data

In-Memory Inverted Index Compression Algorithms: Overview and MILC Optimization for High‑Performance Search

This article reviews major in‑memory inverted index compression techniques such as PForDelta, PEF, and MILC, explains their principles and trade‑offs, and details practical optimizations applied at 58.com to achieve query performance comparable to uncompressed indexes while reducing memory usage by about 35 percent.

Big DataMILCSearch Engine

0 likes · 17 min read

In-Memory Inverted Index Compression Algorithms: Overview and MILC Optimization for High‑Performance Search

MaGe Linux Operations

Jan 23, 2019 · Big Data

How Bloom Filters Power Fast Big Data Searches with Python

This tutorial walks through building a simple Python search engine for big data, covering Bloom filter basics, tokenization with major and minor segmentation, inverted index creation, and implementing both simple and complex (AND/OR) queries, complete with code examples and visual illustrations.

AND/OR queriesBig DataPython

0 likes · 15 min read

How Bloom Filters Power Fast Big Data Searches with Python

Dada Group Technology

Dec 27, 2018 · Backend Development

Evolution and Architecture of JD Daojia Location Service

This article explains how JD Daojia’s location system has evolved from simple distance calculations to a high‑performance, index‑driven backend service, detailing its core responsibilities, architectural stages, challenges, and the optimizations that improved accuracy, scalability, and resource usage.

GISPerformance Optimizationdistance-calculation

0 likes · 10 min read

Evolution and Architecture of JD Daojia Location Service

MaGe Linux Operations

Nov 27, 2018 · Big Data

How a Simple Python Bloom Filter Powers Fast Big Data Search

This article demonstrates how to implement a basic Bloom filter, tokenization, and inverted index in Python to illustrate the core principles of big‑data search, including fast negative lookups, term segmentation, and support for AND/OR queries.

AND/OR queriesTokenizationbig data search

0 likes · 13 min read

How a Simple Python Bloom Filter Powers Fast Big Data Search

Alibaba Cloud Developer

Apr 19, 2018 · Databases

How HiTSDB’s New Streaming Aggregation Engine Boosts Query Speed 10×

This article examines the architectural redesign of Alibaba's High‑Performance Time Series Database (HiTSDB), covering storage model changes, inverted‑index enhancements, a pipelined streaming aggregation engine, data‑migration strategies, and performance benchmarks that together deliver over tenfold query speed improvements.

Data MigrationHiTSDBPerformance Optimization

0 likes · 24 min read

How HiTSDB’s New Streaming Aggregation Engine Boosts Query Speed 10×

Meituan Technology Team

Mar 22, 2018 · Big Data

High-Performance User Behavior Analysis Solution for Massive Data

The paper describes a high‑performance user‑behavior analysis system that processes hundreds of billions of daily logs for Meituan‑Dianping, using an inverted‑index structure with bitmap UUID sets and timestamp sequences, combined with Spark, Spring and Alluxio optimizations to cut query times from hours to under five seconds.

Big DataDistributed ComputingOLAP analysis

0 likes · 14 min read

High-Performance User Behavior Analysis Solution for Massive Data

MaGe Linux Operations

Mar 5, 2018 · Big Data

How a Simple Bloom Filter Powers Fast Big Data Searches in Python

This article demonstrates how to build a basic big‑data search engine in Python using a Bloom filter, tokenisation, and an inverted index, showing step‑by‑step code examples that illustrate the core principles behind efficient data lookup.

Pythonbig data searchbloom filter

0 likes · 14 min read

How a Simple Bloom Filter Powers Fast Big Data Searches in Python

Architecture Digest

Feb 1, 2018 · Fundamentals

How Search Engines Work: Building Inverted Indexes

This article explains the core of search engine technology by describing what an inverted index is, how it is built using single‑pass memory and multi‑way merge methods, how indexes can be partitioned and incrementally updated, and how Hadoop can be used for large‑scale indexing.

Big DataHadoopIndexing

0 likes · 10 min read