Tagged articles
297 articles
Page 2 of 3
Top Architect
Top Architect
Oct 19, 2022 · Big Data

Elasticsearch Architecture Overview and Core Concepts

This article provides a comprehensive overview of Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, shard allocation, indexing mechanisms, storage strategies, refresh and translog processes, segment merging, performance tuning, and JVM optimization for building scalable, near‑real‑time search solutions.

Big DataClusterElasticsearch
0 likes · 37 min read
Elasticsearch Architecture Overview and Core Concepts
DataFunTalk
DataFunTalk
Oct 16, 2022 · Artificial Intelligence

Query Understanding and Processing in E‑commerce Search Systems

This article explains the end‑to‑end pipeline of query understanding for e‑commerce search, covering preprocessing, segmentation, spell correction, normalization, and expansion, and discusses both academic research and industry implementations with examples and references.

Query ProcessingQuery Rewritingnatural language processing
0 likes · 13 min read
Query Understanding and Processing in E‑commerce Search Systems
Top Architect
Top Architect
Oct 14, 2022 · Databases

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, node roles, shard and replica mechanisms, mapping, basic usage, installation steps, health monitoring, indexing workflow, storage strategies, refresh and translog handling, segment merging, and practical performance tuning tips.

ClusterElasticsearchindexing
0 likes · 36 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
MaGe Linux Operations
MaGe Linux Operations
Oct 12, 2022 · Databases

Mastering Elasticsearch Index Lifecycle Management with Docker Compose

This guide explains how to deploy an Elasticsearch cluster with hot, warm, and cold nodes, configure Index Lifecycle Management policies, adjust polling intervals, create indices with rollover aliases, add documents, and observe automatic phase transitions and deletions, all using Docker Compose.

Docker ComposeILMIndex Lifecycle Management
0 likes · 9 min read
Mastering Elasticsearch Index Lifecycle Management with Docker Compose
Xianyu Technology
Xianyu Technology
Sep 27, 2022 · Backend Development

Design and Real-Time Optimization of Xianyu E‑commerce Search System

The article details Xianyu’s end‑to‑end product‑search architecture—covering tokenization, indexing, online request flow, offline index building, multi‑datacenter active‑active deployment, and supporting ad and debugging systems—and explains how expanding searcher capacity, separating query engines, grading updates, and diffusing auxiliary‑table writes together reduced latency from hours to near‑zero, enabling real‑time search.

ScalabilitySystem Architecturee‑commerce
0 likes · 11 min read
Design and Real-Time Optimization of Xianyu E‑commerce Search System
Youzan Coder
Youzan Coder
Sep 5, 2022 · Artificial Intelligence

Inside Youzan’s Query Parser: Architecture, Plugins, and Real‑World Impact

This article explains the role of Youzan’s Query Parser (QP) in search, walks through its overall and layered architecture, details each algorithmic plugin—from preprocessing to synonym handling—and shows concrete code examples and results that improve search relevance across multiple retail scenarios.

NLPSystem ArchitectureYouzan
0 likes · 12 min read
Inside Youzan’s Query Parser: Architecture, Plugins, and Real‑World Impact
Selected Java Interview Questions
Selected Java Interview Questions
Aug 24, 2022 · Backend Development

Understanding ElasticSearch: Distributed Search, Full‑Text Retrieval, and Inverted Index

This article explains what search is, why traditional databases struggle with full‑text queries, introduces the concepts of inverted indexes and Lucene, and shows how ElasticSearch combines distributed architecture, real‑time analytics, and powerful search features to solve these problems.

Distributed SystemsFull‑Text Searchinverted index
0 likes · 8 min read
Understanding ElasticSearch: Distributed Search, Full‑Text Retrieval, and Inverted Index
Open Source Linux
Open Source Linux
Aug 21, 2022 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch leverages inverted indexes, term dictionaries, and compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to enable rapid full‑text search, detailing the underlying data structures, query processing, and practical indexing tips for efficient backend search implementations.

ElasticsearchPostings Listbackend-development
0 likes · 17 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes
Efficient Ops
Efficient Ops
Aug 16, 2022 · Backend Development

How Elasticsearch Delivers Lightning‑Fast Search with Inverted Indexes and Compression

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and advanced compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to achieve rapid search performance while minimizing memory and disk usage, and it also covers practical indexing tips for production use.

ElasticsearchPostings Listbackend-development
0 likes · 15 min read
How Elasticsearch Delivers Lightning‑Fast Search with Inverted Indexes and Compression
dbaplus Community
dbaplus Community
Jul 26, 2022 · Backend Development

Unlocking Elasticsearch: Core Concepts, Architecture, and Performance Tips

This comprehensive guide explains Elasticsearch’s role in searching structured and unstructured data, covers Lucene’s inverted index, details cluster components, shard and replica mechanics, mapping types, installation steps, indexing workflow, storage strategies, and practical performance optimizations for real‑world deployments.

Distributed SystemsElasticsearchindexing
0 likes · 39 min read
Unlocking Elasticsearch: Core Concepts, Architecture, and Performance Tips
21CTO
21CTO
Jul 20, 2022 · Backend Development

Boost Your Python Projects with Whoosh: A Lightweight Search Engine Tutorial

This article introduces the lightweight pure‑Python search library Whoosh, outlines its key features, demonstrates how to define a schema, build an index from a CSV of Chinese poems, and perform full‑text queries with example code, making it ideal for small search projects.

Full‑Text Searchexampleindexing
0 likes · 7 min read
Boost Your Python Projects with Whoosh: A Lightweight Search Engine Tutorial
IT Architects Alliance
IT Architects Alliance
Jul 14, 2022 · Big Data

Elasticsearch Overview: Core Concepts, Architecture, and Practical Usage

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, node roles, shard and replica mechanisms, mapping, installation, health monitoring, indexing principles, storage strategies, refresh and translog handling, segment merging, performance tuning, and JVM optimization for large‑scale search applications.

Big DataElasticsearchindexing
0 likes · 35 min read
Elasticsearch Overview: Core Concepts, Architecture, and Practical Usage
Top Architect
Top Architect
Jul 14, 2022 · Big Data

A Comprehensive Introduction to Elasticsearch: Architecture, Core Concepts, and Practical Usage

This article provides a detailed overview of Elasticsearch, covering its data model, Lucene foundation, cluster architecture, shard and replica mechanisms, index mapping, installation steps, health monitoring, write and storage processes, segment management, and performance tuning techniques for large‑scale search applications.

Big DataElasticsearchindexing
0 likes · 35 min read
A Comprehensive Introduction to Elasticsearch: Architecture, Core Concepts, and Practical Usage
Hulu Beijing
Hulu Beijing
May 26, 2022 · Artificial Intelligence

Why Vector Retrieval Outperforms Keyword Search for Personalized Video Discovery

This article explains how modern video platforms combine traditional keyword retrieval with deep‑learning‑based vector retrieval, detailing model architectures, attention mechanisms, personalization features, offline experiments, and online A/B results that show significant improvements in recall, relevance, and user experience.

Deep LearningVector Retrievalinformation retrieval
0 likes · 18 min read
Why Vector Retrieval Outperforms Keyword Search for Personalized Video Discovery
Top Architect
Top Architect
May 20, 2022 · Big Data

Step-by-Step Guide to Deploying an Elasticsearch Cluster with Docker on CentOS

This article provides a comprehensive tutorial on why to use Elasticsearch, how to pull the Docker image, set up data directories, configure cluster and node settings, adjust system limits, launch three Elasticsearch containers, and verify the cluster using REST APIs and the elasticsearch‑head UI.

CentOSClusterElasticsearch
0 likes · 13 min read
Step-by-Step Guide to Deploying an Elasticsearch Cluster with Docker on CentOS
Top Architect
Top Architect
May 4, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, node roles, shard and replica mechanisms, mapping, basic usage, health monitoring, indexing workflow, storage strategies, and practical performance tuning techniques.

Elasticsearchindexingperformance optimization
0 likes · 36 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
HelloTech
HelloTech
Apr 25, 2022 · Big Data

Analyzing and Optimizing Slow Elasticsearch Queries in a Shared Cluster

In a shared Elasticsearch cluster, the team used slow‑log analysis to pinpoint costly queries caused by unnecessary fuzzy matches and integer‑mapped low‑cardinality fields, then optimized them by converting matches to filters and remapping those fields to keyword, re‑indexing, which cut latency from over 100 ms to under 10 ms and eliminated slow‑query alerts.

BackendElasticsearchperformance optimization
0 likes · 10 min read
Analyzing and Optimizing Slow Elasticsearch Queries in a Shared Cluster
Su San Talks Tech
Su San Talks Tech
Apr 17, 2022 · Backend Development

How Elasticsearch Powers Real-Time Search: Core Concepts and Best Practices

This article provides a comprehensive overview of Elasticsearch, explaining its underlying Lucene technology, data modeling, cluster architecture, shard and replica mechanisms, indexing workflow, storage strategies, refresh and translog processes, as well as practical performance and JVM tuning tips for building scalable, near‑real‑time search solutions.

Elasticsearchlucenesearch engine
0 likes · 37 min read
How Elasticsearch Powers Real-Time Search: Core Concepts and Best Practices
Ctrip Technology
Ctrip Technology
Apr 14, 2022 · Cloud Computing

Migrating Ctrip Hotel Search Engine to AWS: Challenges and Practices

This article details Ctrip's experience migrating its high‑traffic hotel search service to AWS, covering microservice architecture challenges, API‑level deployment, persistence strategies with Redis, MySQL, XPipe and DRC, file sharing via S3, and lessons learned for future cloud‑native designs.

AWSPersistencesearch engine
0 likes · 9 min read
Migrating Ctrip Hotel Search Engine to AWS: Challenges and Practices
Top Architect
Top Architect
Apr 12, 2022 · Databases

Key New Features and Changes in Elasticsearch 8.0 Release

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default-enabled security with registration tokens, system index protection, a preview KNN search API using dense_vector, storage‑saving field mappings, faster geo indexing, PyTorch model support, and numerous deprecations and configuration changes across aggregations, allocation, analysis, authentication, and core infrastructure.

ElasticsearchRelease Notessearch engine
0 likes · 9 min read
Key New Features and Changes in Elasticsearch 8.0 Release
IT Architects Alliance
IT Architects Alliance
Apr 10, 2022 · Backend Development

Understanding Elasticsearch: Core Concepts, Architecture, and Performance Tips

This article provides a comprehensive overview of Elasticsearch, covering data types, Lucene fundamentals, cluster discovery, node roles, shard and replica management, mapping, installation, health monitoring, indexing mechanics, storage strategies, refresh and translog processes, segment merging, and practical performance optimizations for production deployments.

Distributed SystemsElasticsearchindexing
0 likes · 39 min read
Understanding Elasticsearch: Core Concepts, Architecture, and Performance Tips
Top Architect
Top Architect
Apr 9, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, Indexing Mechanics, and Performance Optimization

This comprehensive article explains what Elasticsearch is, how it builds on Lucene to provide distributed real‑time search and analytics, covering data types, cluster components, shard routing, indexing pipelines, storage formats, segment merging, and practical performance‑tuning tips for production deployments.

Elasticsearchindexinglucene
0 likes · 36 min read
Elasticsearch Overview: Architecture, Core Concepts, Indexing Mechanics, and Performance Optimization
Java Captain
Java Captain
Apr 2, 2022 · Backend Development

Building a Site Search Engine with Java Indexing and File Parsing

This article explains how to build a site‑wide search engine using Java, covering crawling concepts, forward and inverted indexing, module design, tokenization methods, and detailed code examples for file enumeration, HTML parsing, and index generation.

Site Searchindexingparsing
0 likes · 14 min read
Building a Site Search Engine with Java Indexing and File Parsing
Top Architect
Top Architect
Mar 17, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This comprehensive guide explains Elasticsearch fundamentals, including data types, Lucene and inverted indexes, cluster and node roles, shard and replica mechanisms, mapping, installation steps, health monitoring, write and storage processes, segment merging, and practical performance tuning tips for large‑scale search deployments.

ClusterElasticsearchinverted index
0 likes · 35 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
macrozheng
macrozheng
Mar 2, 2022 · Backend Development

Boost Your Search Capabilities with RediSearch and RedisJSON – A Hands‑On Guide

This guide introduces RedisMod’s enhanced modules, shows how to install Redis with Docker, demonstrates native JSON storage using RedisJSON, walks through building and querying a full‑text search index with RediSearch—including Chinese support—and compares its performance against Elasticsearch.

DockerRediSearchRedisJSON
0 likes · 8 min read
Boost Your Search Capabilities with RediSearch and RedisJSON – A Hands‑On Guide
Ops Development Stories
Ops Development Stories
Feb 24, 2022 · Big Data

Master Elasticsearch: Core Concepts, APIs, Mapping, and Performance Tuning

This comprehensive guide explains Elasticsearch fundamentals—including documents, indices, nodes, clusters, REST and Document APIs, query DSL, relevance scoring, distributed architecture, real‑time indexing, search execution, pagination, scroll, aggregations, data modeling, mapping options, parent/child relationships, reindexing, and practical cluster and write/read performance optimizations.

Cluster TuningElasticsearchaggregation
0 likes · 58 min read
Master Elasticsearch: Core Concepts, APIs, Mapping, and Performance Tuning
Top Architect
Top Architect
Feb 21, 2022 · Databases

Key New Features in Elasticsearch 8.0

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default-enabled security with registration tokens, known issues on ARM/macOS, a preview KNN search API using dense_vector, storage reductions for keyword and text fields, faster geo indexing, PyTorch model support, and numerous other enhancements across aggregations, allocation, analysis, authentication, and core infrastructure.

ElasticsearchPyTorchVersion 8
0 likes · 10 min read
Key New Features in Elasticsearch 8.0
Laravel Tech Community
Laravel Tech Community
Feb 17, 2022 · Backend Development

Key New Features and Changes in Elasticsearch 8.0 Release

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default‑enabled security with registration tokens, protected system indices, a preview KNN search API, storage‑efficient keyword/match_only_text/text fields, faster indexing for geo_point and geo_shape, PyTorch model support, and numerous deprecations and enhancements across aggregations, allocation, analysis, authentication, cluster coordination, and engine components.

BackendElasticsearchindexing
0 likes · 8 min read
Key New Features and Changes in Elasticsearch 8.0 Release
MaGe Linux Operations
MaGe Linux Operations
Feb 15, 2022 · Backend Development

What’s New in Elasticsearch 8.0? Key Features, Security, and API Changes

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default-enabled security with enrollment tokens, protected system indices, a preview KNN search API, storage‑saving field encodings, faster geo indexing, and numerous deprecations and enhancements across aggregations, authentication, cluster coordination, and packaging.

API compatibilityElasticsearchindexing
0 likes · 10 min read
What’s New in Elasticsearch 8.0? Key Features, Security, and API Changes
Java High-Performance Architecture
Java High-Performance Architecture
Feb 15, 2022 · Backend Development

What’s New in Elasticsearch 8.0? Key Features, Security Enhancements, and Performance Boosts

Elasticsearch 8.0 introduces 7.x REST API compatibility headers, default‑on security features with automatic enrollment tokens, tighter protection of system indices, a preview KNN search API, storage‑saving field encodings, faster geo‑point and geo‑shape indexing, PyTorch model support, and a long list of deprecations and internal improvements.

BackendElasticsearchsearch engine
0 likes · 10 min read
What’s New in Elasticsearch 8.0? Key Features, Security Enhancements, and Performance Boosts
Top Architect
Top Architect
Feb 14, 2022 · Databases

Elasticsearch Fundamentals: Architecture, Indexing, Sharding, and Performance Optimization

This comprehensive guide explains Elasticsearch’s core concepts—including its distributed architecture, indexing process, shard routing, refresh and translog mechanisms, segment merging, and performance tuning—while providing practical examples and configuration tips for building scalable, near‑real‑time search solutions.

Elasticsearchindexingperformance tuning
0 likes · 35 min read
Elasticsearch Fundamentals: Architecture, Indexing, Sharding, and Performance Optimization
Big Data Technology & Architecture
Big Data Technology & Architecture
Feb 13, 2022 · Big Data

What's New in Elasticsearch 8.0 – Key Features and Changes

The article provides a comprehensive overview of Elasticsearch 8.0, highlighting major updates such as 7.x REST API compatibility headers, default-enabled security, system‑index protection, a new KNN search API, storage and indexing optimizations, PyTorch model support, and numerous deprecations and feature removals across the stack.

8.0APIBig Data
0 likes · 10 min read
What's New in Elasticsearch 8.0 – Key Features and Changes
Programmer DD
Programmer DD
Feb 12, 2022 · Databases

What’s New in Elasticsearch 8.0? Key Features and Migration Tips

Elasticsearch 8.0 introduces major changes such as 7.x REST API compatibility headers, default‑enabled security with registration tokens, protected system indices, a technical preview of KNN search, storage‑saving field encodings, faster geo‑point indexing, PyTorch model support for NLP, and numerous deprecations and improvements across aggregations, allocation, analysis, authentication, cluster coordination, and packaging.

APIBig DataElasticsearch
0 likes · 10 min read
What’s New in Elasticsearch 8.0? Key Features and Migration Tips
dbaplus Community
dbaplus Community
Jan 26, 2022 · Big Data

Why Does Elasticsearch Aggregate Faster with Fewer Terms? Uncover the Secrets

This article examines a real‑world Elasticsearch cluster handling hundreds of terabytes, explains why high‑cardinality aggregations can be slower, and shows how setting execution_hint=map and tuning doc_values dramatically improves aggregation performance for ultra‑high‑concurrency workloads.

Big DataData AnalyticsElasticsearch
0 likes · 12 min read
Why Does Elasticsearch Aggregate Faster with Fewer Terms? Uncover the Secrets
Java Interview Crash Guide
Java Interview Crash Guide
Jan 18, 2022 · Databases

Why We Switched from SQL Server to Elasticsearch: A Real‑World Scaling Journey

This article shares a practical experience of replacing a massive SQL Server data store with Elasticsearch, covering the motivations, architectural design, performance advantages, indexing mechanisms, data synchronization strategies, implementation details, and operational considerations for handling tens of billions of records.

ElasticsearchNoSQLdatabase scaling
0 likes · 38 min read
Why We Switched from SQL Server to Elasticsearch: A Real‑World Scaling Journey
Architecture Digest
Architecture Digest
Jan 15, 2022 · Databases

Designing and Implementing Elasticsearch for Large‑Scale Data Search and Storage

This article details the business background, technical advantages, architecture, indexing mechanisms, clustering, data synchronization strategies, API design, and performance monitoring of Elasticsearch, illustrating how it replaces costly SQL LIKE queries with a scalable, high‑performance search solution for massive user activity data.

ElasticsearchNoSQLdata synchronization
0 likes · 29 min read
Designing and Implementing Elasticsearch for Large‑Scale Data Search and Storage
Tencent Music Tech Team
Tencent Music Tech Team
Jan 4, 2022 · Big Data

Elasticsearch Fundamentals: Architecture, Indexing, Query DSL and Search Mechanics

Elasticsearch is a distributed, schemaless search engine built on Lucene that stores JSON documents in sharded indexes, uses immutable segments and merges, provides a flexible Query DSL with aggregations and relevance scoring, and executes distributed query‑then‑fetch searches with features like scrolling, optimistic locking, and zero‑downtime reindexing.

AnalyzersElasticsearchQuery DSL
0 likes · 26 min read
Elasticsearch Fundamentals: Architecture, Indexing, Query DSL and Search Mechanics
Open Source Linux
Open Source Linux
Dec 29, 2021 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and compression techniques like FOR and Roaring Bitmaps to enable rapid full‑text search, contrasting its approach with traditional relational databases and offering practical indexing tips for large‑scale applications.

ElasticsearchPostings Listcompression
0 likes · 15 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes
Programmer DD
Programmer DD
Dec 28, 2021 · Databases

RedisJSON vs MongoDB & ElasticSearch: Why It Outperforms by Up to 500×

Benchmark tests reveal that RedisJSON, powered by RediSearch, dramatically outperforms MongoDB and ElasticSearch across isolated reads and writes, mixed workloads, and latency metrics, delivering up to 500‑fold speed gains, lower latency, and higher throughput, making it a compelling choice for modern data‑intensive applications.

NoSQLRedisJSONYCSB
0 likes · 14 min read
RedisJSON vs MongoDB & ElasticSearch: Why It Outperforms by Up to 500×
Top Architect
Top Architect
Dec 12, 2021 · Databases

Elasticsearch Cluster and Data Layer Architecture Overview

This article explains Elasticsearch’s cluster and data layer architecture, covering nodes, indices, shards, replicas, mixed and tiered deployment models, storage options, and the trade‑offs of different distributed system designs for scalable systems.

Cluster ArchitectureElasticsearchreplica
0 likes · 14 min read
Elasticsearch Cluster and Data Layer Architecture Overview
Laravel Tech Community
Laravel Tech Community
Dec 9, 2021 · Backend Development

Apache Lucene 9.0 Released – New Features and Improvements

Apache Lucene 9.0, a high‑performance Java full‑text search library, introduces high‑dimensional vector indexing, new language analyzers, faster faceting and sorting, updated file formats, and several performance optimizations, providing developers with a richer, more efficient search toolkit.

Apache LuceneFull‑Text SearchVector Search
0 likes · 3 min read
Apache Lucene 9.0 Released – New Features and Improvements
政采云技术
政采云技术
Dec 9, 2021 · Fundamentals

Understanding Elasticsearch Inverted Index and Its Optimization Techniques

This article explains how Elasticsearch leverages Lucene's inverted index, term dictionaries, and posting list compression methods such as FST and Frame‑of‑Reference to achieve fast, accurate, and tolerant search performance, while also discussing practical implementation details and examples.

ElasticsearchFSTPosting List Compression
0 likes · 11 min read
Understanding Elasticsearch Inverted Index and Its Optimization Techniques
Open Source Linux
Open Source Linux
Dec 8, 2021 · Backend Development

How Elasticsearch Uses Lucene’s Inverted Index for Lightning‑Fast Search

This article explains how Elasticsearch leverages Lucene’s inverted index, detailing the structure of term dictionaries, postings lists, compression techniques like Frame‑of‑Reference and Roaring Bitmaps, and query optimizations such as filter caches and skip‑list intersections to achieve fast, memory‑efficient search.

Elasticsearchcompressioninverted index
0 likes · 19 min read
How Elasticsearch Uses Lucene’s Inverted Index for Lightning‑Fast Search
Efficient Ops
Efficient Ops
Dec 2, 2021 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to deliver rapid full‑text search, efficient storage, and fast union queries, while also offering practical indexing tips for production use.

Postings ListRoaring Bitmapcompression
0 likes · 15 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes
DataFunSummit
DataFunSummit
Dec 2, 2021 · Backend Development

Integrating Elasticsearch with Spring Boot: A Step‑by‑Step Guide

This article explains how to integrate Elasticsearch into a Spring Boot project, covering version compatibility, supported access methods, dependency setup, configuration, entity mapping with annotations, repository creation, and unit testing to verify successful indexing and retrieval.

Elasticsearchbackend-developmentjava
0 likes · 9 min read
Integrating Elasticsearch with Spring Boot: A Step‑by‑Step Guide
Qunar Tech Salon
Qunar Tech Salon
Nov 24, 2021 · Databases

Comprehensive Guide to Elasticsearch Index Design, Settings, and Mapping

This article provides a detailed guide on Elasticsearch index design, covering index settings, shard and replica planning, mapping strategies, complex types, lifecycle management, template usage, and practical best‑practice recommendations for large‑scale log data clusters.

Big DataElasticsearchMapping
0 likes · 27 min read
Comprehensive Guide to Elasticsearch Index Design, Settings, and Mapping
Java Interview Crash Guide
Java Interview Crash Guide
Nov 11, 2021 · Big Data

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes and Compression

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and advanced compression techniques like Frame of Reference and Roaring Bitmaps to enable rapid, scalable search over massive datasets, contrasting its approach with traditional relational database queries and detailing practical optimization tips.

ElasticsearchPostings Listcompression
0 likes · 16 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes and Compression
Dada Group Technology
Dada Group Technology
Oct 29, 2021 · Artificial Intelligence

Query Understanding in JD Daojia E‑commerce Search: Architecture, Core Algorithms, and Experimental Results

This article presents a comprehensive overview of JD Daojia's query understanding system for e‑commerce search, detailing its overall architecture, core modules such as tokenization, term weighting, query rewriting, intent detection, the algorithms employed, experimental evaluations, and future directions.

Query Understandingnatural language processingsearch engine
0 likes · 27 min read
Query Understanding in JD Daojia E‑commerce Search: Architecture, Core Algorithms, and Experimental Results
IT Architects Alliance
IT Architects Alliance
Oct 6, 2021 · Big Data

Understanding Elasticsearch Inverted Index and Efficient Search Retrieval

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and postings lists—along with compression techniques like Frame of Reference and Roaring Bitmaps—to achieve fast, memory‑efficient search queries, and provides practical tips for optimizing indexing and query performance.

ElasticsearchPostings Listcompression
0 likes · 14 min read
Understanding Elasticsearch Inverted Index and Efficient Search Retrieval
IT Architects Alliance
IT Architects Alliance
Sep 29, 2021 · Databases

Understanding Elasticsearch Inverted Index: Fast Retrieval, Compression, and Query Techniques

This article explains how Elasticsearch uses inverted index structures—including term dictionaries, term indexes, and postings lists—combined with compression methods like Frame‑of‑Reference and Roaring Bitmaps to achieve fast search, efficient storage, and effective union queries compared to traditional relational databases.

ElasticsearchPostings ListRoaring Bitmap
0 likes · 14 min read
Understanding Elasticsearch Inverted Index: Fast Retrieval, Compression, and Query Techniques
DataFunTalk
DataFunTalk
Sep 24, 2021 · Artificial Intelligence

Intelligent Question Answering in QQ Browser Search Engine: KBQA, DeepQA, and IRQA

This article presents the architecture, techniques, and practical solutions behind intelligent question answering in QQ Browser's search engine, covering knowledge‑graph based QA (KBQA), machine‑reading‑comprehension QA (DeepQA), and information‑retrieval QA (IRQA), and discusses system design, model optimization, and future directions.

AIinformation retrievalknowledge graph
0 likes · 23 min read
Intelligent Question Answering in QQ Browser Search Engine: KBQA, DeepQA, and IRQA
DevOps Cloud Academy
DevOps Cloud Academy
Sep 21, 2021 · Operations

Practical Elasticsearch Operations and Performance Tuning Guide

This article extends previous Elasticsearch cheat sheets with practical commands and step‑by‑step instructions for shard allocation, replica adjustment, cluster settings, slow‑log configuration, mapping routing, force merge, bulk writes, refresh intervals, translog durability, heap sizing, disk‑space monitoring, and troubleshooting strategies.

Cluster ManagementElasticsearchOperations
0 likes · 7 min read
Practical Elasticsearch Operations and Performance Tuning Guide
Top Architect
Top Architect
Sep 21, 2021 · Databases

Elasticsearch Cluster Architecture and Deployment Strategies

This article explains Elasticsearch's core concepts—nodes, indices, shards, and replicas—describes the indexing flow, compares mixed and layered deployment models, and discusses data‑layer architectures, helping readers choose the appropriate cluster design for reliability and scalability.

Cluster DeploymentElasticsearchShard
0 likes · 13 min read
Elasticsearch Cluster Architecture and Deployment Strategies
Selected Java Interview Questions
Selected Java Interview Questions
Sep 15, 2021 · Big Data

Performance and Feature Comparison between Elasticsearch and ClickHouse for Log Analytics

This article compares Elasticsearch and ClickHouse in terms of architecture, query capabilities, and performance for log analytics, presenting test setups, Docker‑compose configurations, query examples, and benchmark results that show ClickHouse generally outperforms Elasticsearch in most basic query scenarios.

ElasticsearchLog Analyticsclickhouse
0 likes · 12 min read
Performance and Feature Comparison between Elasticsearch and ClickHouse for Log Analytics
Selected Java Interview Questions
Selected Java Interview Questions
Sep 7, 2021 · Big Data

Elasticsearch Basics: Core Concepts, Indexing, Write and Search Processes, Cluster Management and Performance Tips

This article provides a comprehensive overview of Elasticsearch, covering its fundamental architecture, key concepts such as indices, shards and replicas, the complete write and search workflows, consistency mechanisms, master node election, and practical performance‑tuning recommendations for large‑scale deployments.

Big DataCluster ManagementElasticsearch
0 likes · 15 min read
Elasticsearch Basics: Core Concepts, Indexing, Write and Search Processes, Cluster Management and Performance Tips
IT Architects Alliance
IT Architects Alliance
Sep 5, 2021 · Databases

Understanding Elasticsearch Fast Retrieval: Inverted Index, Postings List, and Compression Techniques

This article explains how Elasticsearch achieves rapid search by using inverted indexes, detailing the structure of posting lists, term dictionaries, compression methods like Frame‑of‑Reference and Roaring Bitmaps, and how these techniques enable efficient union queries and filter caching.

ElasticsearchPostings ListRoaring Bitmap
0 likes · 14 min read
Understanding Elasticsearch Fast Retrieval: Inverted Index, Postings List, and Compression Techniques
Architect
Architect
Sep 4, 2021 · Databases

Understanding Elasticsearch Fast Retrieval: Inverted Index, Term Dictionary, and Compression Techniques

This article explains how Elasticsearch achieves fast data retrieval by comparing it with traditional relational databases, detailing search engine fundamentals, the structure of Lucene's inverted index—including term dictionaries, postings lists, and term indexes—and the compression techniques such as Frame of Reference and Roaring Bitmaps that optimize storage and query performance.

ElasticsearchPostings ListRoaring Bitmap
0 likes · 14 min read
Understanding Elasticsearch Fast Retrieval: Inverted Index, Term Dictionary, and Compression Techniques
Top Architect
Top Architect
Aug 14, 2021 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering its underlying Lucene-based inverted index, data types, shard routing, cluster roles, discovery mechanisms, refresh and translog handling, segment merging, and practical performance and JVM tuning tips for building scalable, near‑real‑time search systems.

Elasticsearchindexingperformance tuning
0 likes · 35 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
Python Programming Learning Circle
Python Programming Learning Circle
Aug 12, 2021 · Databases

Understanding MySQL Slow Queries, Index Optimization, ElasticSearch Basics, and HBase Overview

This article explains why MySQL queries become slow, how proper indexing—including B+‑tree, left‑most prefix, index push‑down, and covering indexes—can improve performance, outlines common causes of index failure, and then introduces ElasticSearch search capabilities and HBase column‑family storage as complementary solutions for large‑scale data handling.

Database OptimizationElasticsearchHBase
0 likes · 16 min read
Understanding MySQL Slow Queries, Index Optimization, ElasticSearch Basics, and HBase Overview
MaGe Linux Operations
MaGe Linux Operations
Aug 7, 2021 · Backend Development

Quickly Deploy a Search Engine with Docker and Searx

This guide shows how to set up the open‑source Searx search engine using a Docker image, walk through the essential Docker commands, explore its Python source code, and explains how to customize the response handling for building your own lightweight search engine.

BackendDockerSearx
0 likes · 6 min read
Quickly Deploy a Search Engine with Docker and Searx
vivo Internet Technology
vivo Internet Technology
Jul 14, 2021 · Databases

An Overview of Lucene: Architecture, Indexing Workflow, and Code Implementation

The article introduces Apache Lucene 7.3.1, explains its core architecture and index hierarchy, details the two‑phase indexing and search workflow with code examples for document addition, deletion, merging, and query execution, and highlights its suitability for small‑to‑medium projects versus distributed alternatives.

Code ExampleFull‑Text Searchindexing
0 likes · 20 min read
An Overview of Lucene: Architecture, Indexing Workflow, and Code Implementation
Java Interview Crash Guide
Java Interview Crash Guide
Jul 2, 2021 · Databases

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch leverages inverted indexes, term dictionaries, and advanced compression techniques like Frame of Reference and Roaring Bitmaps to enable rapid full‑text search, covering the underlying concepts, data structures, and query optimizations essential for high‑performance search applications.

ElasticsearchPostings Listcompression
0 likes · 17 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes
Selected Java Interview Questions
Selected Java Interview Questions
Jul 1, 2021 · Fundamentals

Understanding Elasticsearch Inverted Index: Posting Lists, Term Dictionary, and Compression Techniques

This article explains how Elasticsearch achieves fast search by using inverted indexes, detailing the structure of posting lists, term dictionaries, term indexes, and compression methods such as Frame of Reference and Roaring Bitmaps, as well as techniques for efficient union and intersection queries.

ElasticsearchPostings ListTerm Dictionary
0 likes · 16 min read
Understanding Elasticsearch Inverted Index: Posting Lists, Term Dictionary, and Compression Techniques
Java Architect Essentials
Java Architect Essentials
Jun 21, 2021 · Databases

Understanding MySQL Slow Queries, Index Optimization, and Integration with Elasticsearch and HBase

This article explains why MySQL queries become slow, how index design and common pitfalls affect performance, introduces MDL locks and large‑table strategies, then compares Elasticsearch and HBase as complementary storage and search solutions, providing practical code examples and best‑practice recommendations.

Database OptimizationHBaseindexing
0 likes · 16 min read
Understanding MySQL Slow Queries, Index Optimization, and Integration with Elasticsearch and HBase
Wukong Talks Architecture
Wukong Talks Architecture
May 25, 2021 · Backend Development

Elasticsearch Performance Pitfalls and Optimization Strategies

This article examines common performance pitfalls in Elasticsearch—including slow queries, cluster architecture bottlenecks, and business‑scenario challenges—and provides practical guidance such as caching key fields, data pre‑heating, hot‑cold separation, avoiding joins, and using tribe nodes to improve accuracy and response time.

BackendClusterElasticsearch
0 likes · 9 min read
Elasticsearch Performance Pitfalls and Optimization Strategies
Java Interview Crash Guide
Java Interview Crash Guide
May 21, 2021 · Backend Development

Mastering Elasticsearch: Core Concepts, Indexing, and Real‑Time Search Explained

This comprehensive guide walks through Elasticsearch fundamentals, including its architecture, core concepts like indices, shards, and replicas, the write and update processes, search workflow, consistency mechanisms, master election, performance tuning, and strategies for deep pagination and scroll searches.

Elasticsearchindexingreal-time search
0 likes · 17 min read
Mastering Elasticsearch: Core Concepts, Indexing, and Real‑Time Search Explained
Intelligent Backend & Architecture
Intelligent Backend & Architecture
Apr 23, 2021 · Big Data

Mastering Elasticsearch: Core Concepts, Architecture, and Performance Tips

This comprehensive guide explains Elasticsearch’s fundamentals, including its distributed architecture, indexing process, shard and replica mechanisms, query execution, near‑real‑time search, segment management, and practical optimization techniques, providing developers and engineers with the knowledge needed to design, operate, and troubleshoot large‑scale search clusters.

Distributed Systemsindexingnear real-time
0 likes · 71 min read
Mastering Elasticsearch: Core Concepts, Architecture, and Performance Tips
Baidu Geek Talk
Baidu Geek Talk
Mar 17, 2021 · Artificial Intelligence

Overview of Baidu's Wànxiàng System for Large‑Scale Rich Media Processing

Baidu’s Wànxiàng system processes billions of images and videos daily by extracting low‑ and high‑level features, linking related media, and aggregating semantic attributes in a scalable, timely architecture that leverages thousands of CPU, GPU, and FPGA cores to power accurate, low‑latency rich‑media search and recommendation.

Artificial IntelligenceBaiduImage Analysis
0 likes · 14 min read
Overview of Baidu's Wànxiàng System for Large‑Scale Rich Media Processing
58 Tech
58 Tech
Mar 8, 2021 · Fundamentals

Real‑Time Inverted Index Update Techniques in the 58 Search Engine

This article explains how the 58 search engine achieves millisecond‑level real‑time inverted‑index updates by redesigning the underlying data structures, combining static and dynamic indexing, using a chain‑array hybrid, segment merging strategies, and lock‑free read‑write concurrency while maintaining search performance.

Data StructuresReal-Time UpdateSegment Merging
0 likes · 18 min read
Real‑Time Inverted Index Update Techniques in the 58 Search Engine
Top Architect
Top Architect
Mar 5, 2021 · Big Data

Elasticsearch Indexing and Search Optimization: Principles, Lucene Internals, and Performance Tuning

This article explains the architecture and core concepts of Elasticsearch and Lucene, outlines the requirements for cross‑month and high‑speed queries on massive datasets, and provides detailed index and search performance tuning techniques—including bulk writes, shard routing, doc‑values management, and pagination strategies—to achieve sub‑second response times on billions of records.

Big DataElasticsearchIndex Optimization
0 likes · 13 min read
Elasticsearch Indexing and Search Optimization: Principles, Lucene Internals, and Performance Tuning
Laravel Tech Community
Laravel Tech Community
Feb 25, 2021 · Backend Development

Apache Lucene 8.8.1 Released – New Features and Fixes

Apache Lucene 8.8.1, a high‑performance Java full‑text search engine library, has been released, fixing bugs from 8.8.0 and including optimizations, while the previous 8.8.0 version introduced features such as LatLonPoint spatial queries, configurable Doc value compression, and FeatureField’s newLinearQuery scoring.

Apache LuceneBackendFull‑Text Search
0 likes · 2 min read
Apache Lucene 8.8.1 Released – New Features and Fixes
21CTO
21CTO
Feb 6, 2021 · Backend Development

From a Cooking App to Elasticsearch: The Story Behind the Search Engine

This article recounts how Shay Banon’s early cooking‑app project led to the creation of Compass, the evolution into Elasticsearch, and the pivotal role of Apache Lucene, distributed testing, and visionary design in building today’s powerful search platform.

Apache LuceneDistributed TestingElasticsearch
0 likes · 9 min read
From a Cooking App to Elasticsearch: The Story Behind the Search Engine
Qunar Tech Salon
Qunar Tech Salon
Feb 4, 2021 · Fundamentals

Understanding Lucene Inverted Index: Principles and Implementation

This article explains the concept of inverted indexes, their role in full‑text search, and provides a detailed overview of how Apache Lucene implements inverted indexing, including term dictionaries, posting lists, query processing, and numeric handling with BKDTree.

BKDTreePosting ListTerm Dictionary
0 likes · 15 min read
Understanding Lucene Inverted Index: Principles and Implementation
MaGe Linux Operations
MaGe Linux Operations
Dec 28, 2020 · Backend Development

Mastering Elasticsearch: Core Concepts and Indexing Workflow Explained

This article introduces Elasticsearch’s core concepts—including clusters, node roles, documents, mappings, and shards—and walks through the complete indexing workflow from client request to replica synchronization, highlighting key settings, routing calculations, and the role of refresh and flush operations.

ClusterDistributed SystemsElasticsearch
0 likes · 13 min read
Mastering Elasticsearch: Core Concepts and Indexing Workflow Explained
Tencent Cloud Developer
Tencent Cloud Developer
Dec 24, 2020 · Big Data

Distributed Search Engine Design and Index Management in WeChat Search

The article details WeChat Search’s practical distributed architecture—using a Chubby‑elected leader for shard‑to‑node mapping, hash‑based sharding with dynamic rebalancing, a Lambda‑style batch and near‑real‑time indexing pipeline, relaxed monotonic consistency, and group‑based searcher scaling—to illustrate trade‑offs and lessons for building scalable, reliable search services.

Distributed SystemsIndex ManagementLSM
0 likes · 28 min read
Distributed Search Engine Design and Index Management in WeChat Search
DeWu Technology
DeWu Technology
Dec 4, 2020 · Fundamentals

Introduction to Search Engine Technology and Information Retrieval

The article surveys core search‑engine technology—document hierarchy, flat and vertical inverted indexes, query operators for building and merging score lists, and ranking models from Boolean and BM25 to language‑model approaches like Indri—providing a foundational overview of information retrieval.

BM25information retrievalinverted index
0 likes · 14 min read
Introduction to Search Engine Technology and Information Retrieval