Tagged articles
1178 articles
Page 6 of 12
High Availability Architecture
High Availability Architecture
Nov 7, 2022 · Backend Development

Design and Implementation of Meituan's Logan Real-Time Log System

This article describes how Meituan built Logan, a high‑performance, end‑to‑end real‑time logging platform for mobile, web, mini‑programs and IoT, covering its background, architecture, data collection, processing, consumption, monitoring, deployment strategies, achieved results and future roadmap.

Backend ArchitectureElasticsearchFlink
0 likes · 15 min read
Design and Implementation of Meituan's Logan Real-Time Log System
Meituan Technology Team
Meituan Technology Team
Nov 3, 2022 · Backend Development

Design and Implementation of Logan Real-Time Log System at Meituan

The article details Meituan’s end‑to‑end design and implementation of Logan, a high‑performance real‑time logging service for mobile apps, web, mini‑programs and IoT, covering background challenges, architecture layers, technology choices such as Flink and Elasticsearch, stability measures, deployment practices, achieved results and future plans.

Blue‑Green deploymentElasticsearchFlink
0 likes · 21 min read
Design and Implementation of Logan Real-Time Log System at Meituan
Su San Talks Tech
Su San Talks Tech
Nov 3, 2022 · Backend Development

Mastering Elasticsearch Pagination: From From/Size to Scroll and Search After

Elasticsearch offers several pagination strategies—simple from/size, scroll, scroll‑scan, sliced scroll, and the newer search_after with point‑in‑time—each with distinct performance trade‑offs and use‑case suitability, and this guide explains their mechanics, limitations, and best‑practice recommendations for handling deep pagination.

ElasticsearchSearchperformance
0 likes · 18 min read
Mastering Elasticsearch Pagination: From From/Size to Scroll and Search After
Architecture Digest
Architecture Digest
Oct 30, 2022 · Backend Development

High‑Availability Architecture for a Large‑Scale Membership System

This article details the design and implementation of a high‑availability, high‑performance membership system that serves billions of users across multiple platforms, covering Elasticsearch dual‑center clusters, traffic‑isolated three‑cluster setups, Redis caching strategies, MySQL dual‑center partitioning, and advanced flow‑control and degradation mechanisms.

ElasticsearchPartitioningSystem Architecture
0 likes · 18 min read
High‑Availability Architecture for a Large‑Scale Membership System
Architect's Guide
Architect's Guide
Oct 27, 2022 · Big Data

Elasticsearch Overview: Data Types, Lucene Foundations, Core Concepts, Cluster Architecture, Indexing, Storage, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering the distinction between structured and unstructured data, Lucene’s inverted index, ES core concepts such as clusters, nodes, shards and replicas, mapping, basic usage, storage mechanisms, and practical performance‑tuning tips for large‑scale search deployments.

Distributed SystemsElasticsearchindexing
0 likes · 39 min read
Elasticsearch Overview: Data Types, Lucene Foundations, Core Concepts, Cluster Architecture, Indexing, Storage, and Performance Optimization
Selected Java Interview Questions
Selected Java Interview Questions
Oct 23, 2022 · Big Data

Building a Cost‑Effective Data Analysis Platform: ClickHouse vs Elasticsearch and Deployment Guide for Zookeeper, Kafka, Filebeat, and ClickHouse

This article compares Elasticsearch and ClickHouse for log analytics, presents cost‑benefit calculations, and provides a step‑by‑step deployment guide for Zookeeper, Kafka, Filebeat, and ClickHouse to build a scalable, low‑cost data analysis platform for SaaS services.

Big DataDeploymentElasticsearch
0 likes · 12 min read
Building a Cost‑Effective Data Analysis Platform: ClickHouse vs Elasticsearch and Deployment Guide for Zookeeper, Kafka, Filebeat, and ClickHouse
Architecture Digest
Architecture Digest
Oct 21, 2022 · Operations

Benchmarking and Sizing Your Elasticsearch Cluster for Logs and Metrics

This article explains how to assess hardware resources, calculate required Elasticsearch cluster size based on data volume, and perform indexing and search benchmark tests to ensure stable performance and optimal throughput for log and metric workloads in production environments.

BenchmarkingCluster SizingElasticsearch
0 likes · 10 min read
Benchmarking and Sizing Your Elasticsearch Cluster for Logs and Metrics
Top Architect
Top Architect
Oct 19, 2022 · Big Data

Elasticsearch Architecture Overview and Core Concepts

This article provides a comprehensive overview of Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, shard allocation, indexing mechanisms, storage strategies, refresh and translog processes, segment merging, performance tuning, and JVM optimization for building scalable, near‑real‑time search solutions.

Big DataClusterElasticsearch
0 likes · 37 min read
Elasticsearch Architecture Overview and Core Concepts
Top Architect
Top Architect
Oct 14, 2022 · Databases

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, node roles, shard and replica mechanisms, mapping, basic usage, installation steps, health monitoring, indexing workflow, storage strategies, refresh and translog handling, segment merging, and practical performance tuning tips.

ClusterElasticsearchindexing
0 likes · 36 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
dbaplus Community
dbaplus Community
Oct 9, 2022 · Operations

How Ping An Health Scaled SkyWalking to Billions of Traces: A Full‑Link Monitoring Journey

This article recounts the end‑to‑end design, implementation, and iterative optimization of a billion‑scale full‑link tracing system at Ping An Health using SkyWalking, covering why full‑link monitoring is needed, the selection of SkyWalking, architecture choices, performance bottlenecks, and the roadmap for future enhancements.

APMElasticsearchFull‑Link Tracing
0 likes · 21 min read
How Ping An Health Scaled SkyWalking to Billions of Traces: A Full‑Link Monitoring Journey
DataFunSummit
DataFunSummit
Sep 28, 2022 · Big Data

Elasticsearch Time Series Engine: Practices, Challenges, and Alibaba Cloud TimeStream

This article presents a comprehensive overview of using Elasticsearch as a time series engine, covering its motivations, challenges, key features, Alibaba Cloud TimeStream optimizations such as columnar storage, LSM structures, downsampling, and integration with Prometheus and Grafana, while also discussing performance and cost considerations.

Big DataDownsamplingElasticsearch
0 likes · 15 min read
Elasticsearch Time Series Engine: Practices, Challenges, and Alibaba Cloud TimeStream
Baidu Geek Talk
Baidu Geek Talk
Sep 28, 2022 · Backend Development

How to Optimize Elasticsearch Queries for Precise Enterprise Search Results

This article walks through the practical steps of improving Elasticsearch relevance for an enterprise search platform, covering user requirements, index creation, analysis, scoring models, boost and filter techniques, function_score customizations, and post‑query interventions to deliver more accurate and business‑aligned results.

ElasticsearchQuery DSLRelevance Scoring
0 likes · 11 min read
How to Optimize Elasticsearch Queries for Precise Enterprise Search Results
Code Ape Tech Column
Code Ape Tech Column
Sep 15, 2022 · Databases

Understanding Data Heterogeneity: Scenarios, Methods, and Implementation with Binlog, Canal, and MQ

This article explains the concept of data heterogeneity, outlines common use cases such as sharding and multi‑dimensional queries, and details practical implementation methods including full cloning, marked sync, binlog‑based replication with Canal, and MQ‑driven approaches, while providing deployment tips and references.

CanalElasticsearchMessage Queue
0 likes · 12 min read
Understanding Data Heterogeneity: Scenarios, Methods, and Implementation with Binlog, Canal, and MQ
Selected Java Interview Questions
Selected Java Interview Questions
Sep 13, 2022 · Big Data

Java API for Elasticsearch: Configuration, CRUD, DSL Queries, Pagination, Sorting, and Highlighting

This article demonstrates how to integrate Elasticsearch 6.2.1 with a Spring Boot project using the high‑level REST client, covering Maven dependencies, bean configuration, index creation and deletion, various DSL queries, pagination, sorting, boosting, boolean filters, and result highlighting with complete Java code examples.

DSLElasticsearchSearch
0 likes · 17 min read
Java API for Elasticsearch: Configuration, CRUD, DSL Queries, Pagination, Sorting, and Highlighting
Selected Java Interview Questions
Selected Java Interview Questions
Sep 9, 2022 · Databases

Performance Testing and Optimization of ClickHouse and Elasticsearch for High-Concurrency Scenarios

This technical report details the requirement analysis, environment setup, monitoring tools, load‑test scripts, data design, execution results, and optimization recommendations for stress‑testing ClickHouse and Elasticsearch to ensure they can handle high‑concurrency business peaks.

Big DataDatabase OptimizationElasticsearch
0 likes · 11 min read
Performance Testing and Optimization of ClickHouse and Elasticsearch for High-Concurrency Scenarios
DataFunTalk
DataFunTalk
Sep 8, 2022 · Databases

Elasticsearch as a Time Series Engine: Practices, Challenges, and Alibaba Cloud TimeStream Solutions

This article explains why Elasticsearch is being adapted as a time‑series engine, outlines its unique characteristics and challenges such as high query complexity and storage cost, and introduces Alibaba Cloud’s TimeStream solution with optimizations like index settings, compression, down‑sampling, and Prometheus integration.

DataStreamElasticsearchTime Series
0 likes · 13 min read
Elasticsearch as a Time Series Engine: Practices, Challenges, and Alibaba Cloud TimeStream Solutions
Tencent Cloud Middleware
Tencent Cloud Middleware
Sep 6, 2022 · Cloud Computing

Quickly Set Up One‑Click Data Ingestion Pipelines in Tencent Cloud Elasticsearch

This guide explains how to use Tencent Cloud Elasticsearch Service’s one‑click data‑link visual integration with CKafka to create end‑to‑end pipelines—covering source selection, component configuration, data collection, caching, processing, and destination setup—for both CVM and TKE environments, while reducing operational overhead.

CKafkaCVMElasticsearch
0 likes · 9 min read
Quickly Set Up One‑Click Data Ingestion Pipelines in Tencent Cloud Elasticsearch
HomeTech
HomeTech
Aug 30, 2022 · Big Data

Real‑time Data Collection SDK Visualization: Architecture, Implementation and Usage Guide

This article introduces a data‑collection SDK with a real‑time visualization feature, explains the shortcomings of traditional packet‑capture and log‑based methods, describes the underlying architecture—including a new SDK entry, encrypted reporting, WebSocket communication and Elasticsearch storage—and provides step‑by‑step usage instructions for developers.

ElasticsearchWebSocketdata collection
0 likes · 8 min read
Real‑time Data Collection SDK Visualization: Architecture, Implementation and Usage Guide
Programmer DD
Programmer DD
Aug 30, 2022 · Artificial Intelligence

How to Build a Custom HanLP Analyzer Plugin for Elasticsearch with Nginx

This guide walks through setting up a Java GraalVM 17 environment, installing Nginx to serve static dictionary files, configuring a HanLP‑based Elasticsearch analyzer plugin, packaging and deploying it, and testing the analyzer with JUnit5 and curl commands.

ElasticsearchHanLPNLP
0 likes · 14 min read
How to Build a Custom HanLP Analyzer Plugin for Elasticsearch with Nginx
Selected Java Interview Questions
Selected Java Interview Questions
Aug 27, 2022 · Backend Development

Deploying a Cost‑Effective ClickHouse‑Based Backend Data Platform: Comparison with Elasticsearch and Step‑by‑Step Setup Guide

This article compares Elasticsearch and ClickHouse for log analytics, presents cost analysis, and provides detailed deployment instructions for Zookeeper, Kafka, Filebeat, and ClickHouse to build a private, high‑performance backend data platform for SaaS services.

ElasticsearchFilebeatKafka
0 likes · 12 min read
Deploying a Cost‑Effective ClickHouse‑Based Backend Data Platform: Comparison with Elasticsearch and Step‑by‑Step Setup Guide
政采云技术
政采云技术
Aug 23, 2022 · Backend Development

Understanding Elasticsearch Document Scoring and Aggregation Techniques

This article explains the underlying principles of Elasticsearch scoring, covering Boolean model queries, TF/IDF, field length normalization, the vector space model, and detailed aggregation examples with code snippets to illustrate practical search and analytics usage.

ElasticsearchScoringSearch
0 likes · 19 min read
Understanding Elasticsearch Document Scoring and Aggregation Techniques
Open Source Linux
Open Source Linux
Aug 21, 2022 · Backend Development

How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes

This article explains how Elasticsearch leverages inverted indexes, term dictionaries, and compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to enable rapid full‑text search, detailing the underlying data structures, query processing, and practical indexing tips for efficient backend search implementations.

ElasticsearchPostings Listbackend-development
0 likes · 17 min read
How Elasticsearch Achieves Lightning‑Fast Search with Inverted Indexes
SQB Blog
SQB Blog
Aug 19, 2022 · Backend Development

How to Build a Zero‑Code, Scalable Enterprise Approval Center

This article outlines the design and implementation of a zero‑code, visual approval system that integrates with internal business services, detailing its architecture, core modules, rule engine, multi‑platform support, search capabilities, and security considerations for large‑scale enterprise use.

BPMBackend ArchitectureElasticsearch
0 likes · 15 min read
How to Build a Zero‑Code, Scalable Enterprise Approval Center
Efficient Ops
Efficient Ops
Aug 16, 2022 · Backend Development

How Elasticsearch Delivers Lightning‑Fast Search with Inverted Indexes and Compression

This article explains how Elasticsearch uses inverted indexes, term dictionaries, and advanced compression techniques such as Frame‑of‑Reference and Roaring Bitmaps to achieve rapid search performance while minimizing memory and disk usage, and it also covers practical indexing tips for production use.

ElasticsearchPostings Listbackend-development
0 likes · 15 min read
How Elasticsearch Delivers Lightning‑Fast Search with Inverted Indexes and Compression
IT Architects Alliance
IT Architects Alliance
Aug 13, 2022 · Operations

Why ClickHouse Beats Elasticsearch: Performance, Cost, and Deployment Guide

This article compares ClickHouse and Elasticsearch, analyzes cost savings, and provides step‑by‑step deployment instructions for Zookeeper, Kafka, Filebeat, and ClickHouse clusters, including configuration details, troubleshooting tips, and practical code snippets for building a scalable analytics pipeline.

DeploymentElasticsearchFilebeat
0 likes · 13 min read
Why ClickHouse Beats Elasticsearch: Performance, Cost, and Deployment Guide
Sohu Tech Products
Sohu Tech Products
Aug 3, 2022 · Backend Development

Design and Evolution of the Price‑Increase Coupon Service for a C2B Recycling Platform

This article details the architectural evolution of a price‑increase coupon system used in a C2B digital‑product recycling business, covering its initial experimental phase, platformization, sharding, intelligent recommendation, Elasticsearch integration, and stability enhancements through NoSQL and read‑replica strategies.

CouponElasticsearchSystem Design
0 likes · 10 min read
Design and Evolution of the Price‑Increase Coupon Service for a C2B Recycling Platform
dbaplus Community
dbaplus Community
Jul 26, 2022 · Backend Development

Unlocking Elasticsearch: Core Concepts, Architecture, and Performance Tips

This comprehensive guide explains Elasticsearch’s role in searching structured and unstructured data, covers Lucene’s inverted index, details cluster components, shard and replica mechanics, mapping types, installation steps, indexing workflow, storage strategies, and practical performance optimizations for real‑world deployments.

Distributed SystemsElasticsearchindexing
0 likes · 39 min read
Unlocking Elasticsearch: Core Concepts, Architecture, and Performance Tips
IT Architects Alliance
IT Architects Alliance
Jul 20, 2022 · Backend Development

Implementation Approach for Query Separation Using Message Queues and Elasticsearch

This article explains the design and implementation of query separation, covering trigger mechanisms, data storage, synchronization via asynchronous threads or message queues, handling of MQ failures, idempotent consumption, ordering issues, and migration of historical data to an Elasticsearch-based query store.

Elasticsearchdata synchronizationhistorical data migration
0 likes · 19 min read
Implementation Approach for Query Separation Using Message Queues and Elasticsearch
DataFunSummit
DataFunSummit
Jul 17, 2022 · Big Data

Elasticsearch and Big Data: Architecture, Use Cases, and Advantages

This article explains what Elasticsearch is, how it solves database acceleration, log observability, and data analysis problems, details its core components and underlying engine features, compares its strengths and weaknesses, and presents classic application scenarios and a real‑world case study integrating Elasticsearch with Flink for large‑scale log analytics.

Big DataElasticsearchFlink
0 likes · 13 min read
Elasticsearch and Big Data: Architecture, Use Cases, and Advantages
DataFunTalk
DataFunTalk
Jul 15, 2022 · Cloud Computing

Alibaba Cloud Elasticsearch Log Scenario Best Practices, Engine Optimizations, and Performance Evaluation

This article presents a comprehensive overview of Alibaba Cloud Elasticsearch for log analytics, detailing cluster characteristics, common pain points, five key optimization techniques—including cold/hot resource sharing and indexing service—followed by engine kernel improvements, performance benchmarks, and a step‑by‑step product demonstration.

ElasticsearchIndexing ServiceLog Analytics
0 likes · 11 min read
Alibaba Cloud Elasticsearch Log Scenario Best Practices, Engine Optimizations, and Performance Evaluation
IT Architects Alliance
IT Architects Alliance
Jul 14, 2022 · Big Data

Elasticsearch Overview: Core Concepts, Architecture, and Practical Usage

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, node roles, shard and replica mechanisms, mapping, installation, health monitoring, indexing principles, storage strategies, refresh and translog handling, segment merging, performance tuning, and JVM optimization for large‑scale search applications.

Big DataElasticsearchindexing
0 likes · 35 min read
Elasticsearch Overview: Core Concepts, Architecture, and Practical Usage
Top Architect
Top Architect
Jul 14, 2022 · Big Data

A Comprehensive Introduction to Elasticsearch: Architecture, Core Concepts, and Practical Usage

This article provides a detailed overview of Elasticsearch, covering its data model, Lucene foundation, cluster architecture, shard and replica mechanisms, index mapping, installation steps, health monitoring, write and storage processes, segment management, and performance tuning techniques for large‑scale search applications.

Big DataElasticsearchindexing
0 likes · 35 min read
A Comprehensive Introduction to Elasticsearch: Architecture, Core Concepts, and Practical Usage
政采云技术
政采云技术
Jul 14, 2022 · Operations

Diagnosing and Optimizing Elasticsearch IO Bottlenecks for Billion-Scale Product Catalogs

Facing severe IO-wait and read bottlenecks as product data grew from tens of millions to billions, this article analyzes root causes in Elasticsearch clusters and presents a comprehensive solution involving index parameter tuning, merge settings, translog async writes, query optimizations, and hardware upgrades to restore performance and stability.

ElasticsearchIO optimizationIndex Tuning
0 likes · 14 min read
Diagnosing and Optimizing Elasticsearch IO Bottlenecks for Billion-Scale Product Catalogs
HelloTech
HelloTech
Jul 6, 2022 · Big Data

Investigation and Resolution of Elasticsearch Write Timeout Issues in a Real-Time Flink Data Sync Pipeline

The team diagnosed intermittent Elasticsearch write‑timeout failures in their real‑time Flink‑to‑Elasticsearch pipeline as lock contention from frequent duplicate updates to the same document IDs, and eliminated the issue by aggregating binlog events in a 5‑second sliding window to deduplicate writes, adjusting refresh intervals, using async translog durability, and disabling non‑essential fields.

Big DataElasticsearchFlink
0 likes · 7 min read
Investigation and Resolution of Elasticsearch Write Timeout Issues in a Real-Time Flink Data Sync Pipeline
Selected Java Interview Questions
Selected Java Interview Questions
Jul 5, 2022 · Big Data

Understanding Elasticsearch: Core Concepts, Architecture, Indexing Mechanics and Performance Optimization

This article explains the fundamentals of structured and unstructured data, introduces Lucene's inverted index, describes Elasticsearch's distributed cluster architecture, node roles, sharding and replication mechanisms, indexing workflow with refresh and translog, storage segment model, and provides practical performance‑tuning recommendations.

ClusterElasticsearchReplication
0 likes · 36 min read
Understanding Elasticsearch: Core Concepts, Architecture, Indexing Mechanics and Performance Optimization
vivo Internet Technology
vivo Internet Technology
Jun 22, 2022 · Big Data

Evolution and Optimization of Numerical Indexing in Elasticsearch for Geo‑Location Queries

The article traces Elasticsearch’s geo‑indexing evolution from early string‑based simulations through Quadtree filters to the modern BKD‑tree implementation, showing how each optimization dramatically improves memory usage, query speed, and accuracy for large‑scale point‑of‑interest searches in location‑based services.

BKD-TreeElasticsearchGeo-Location
0 likes · 25 min read
Evolution and Optimization of Numerical Indexing in Elasticsearch for Geo‑Location Queries
HelloTech
HelloTech
Jun 21, 2022 · Backend Development

Recommendation Engine Upgrade Path, Architecture, and Performance Optimization for the "Guangguang" Content Community

The article details Guangguang’s shift from a rule‑based, Hive‑driven recommendation pipeline to an algorithmic service that leverages Elasticsearch and Redis for multi‑source recall, coarse and fine model ranking, exposure filtering, cold‑start handling, latency optimizations, reliability monitoring, and future vector‑based enhancements.

ElasticsearchReal-Timebandit algorithm
0 likes · 16 min read
Recommendation Engine Upgrade Path, Architecture, and Performance Optimization for the "Guangguang" Content Community
dbaplus Community
dbaplus Community
Jun 7, 2022 · Databases

How to Diagnose and Fix Elasticsearch Slow Queries: From PointRange to Keyword

This article examines why Elasticsearch slow queries occur in a shared cluster, analyzes a problematic query's structure and data‑type choices, and demonstrates how converting integer fields to keyword mappings and adjusting filter order can reduce latency from over 100 ms to under 10 ms while eliminating slow‑query alerts.

ElasticsearchIndex MappingPointRangeQuery
0 likes · 10 min read
How to Diagnose and Fix Elasticsearch Slow Queries: From PointRange to Keyword
Python Programming Learning Circle
Python Programming Learning Circle
Jun 7, 2022 · Big Data

Getting Started with Kibana: Installation, Configuration, and Usage Guide

This article provides a comprehensive guide to Kibana, an open‑source data analysis and visualization platform for Elasticsearch, covering its installation, configuration, index pattern setup, data discovery, query syntax, visualization creation, dashboard building, and monitoring, with step‑by‑step instructions and screenshots.

Data visualizationElasticsearchKibana
0 likes · 11 min read
Getting Started with Kibana: Installation, Configuration, and Usage Guide
Zuoyebang Tech Team
Zuoyebang Tech Team
Jun 7, 2022 · Big Data

How Doris Powered Zuoyebang’s Real‑Time Data Warehouse for Faster Insights

Zuoyebang’s data team replaced fragmented, slow query solutions with Apache Doris, building a unified real‑time data warehouse that dramatically cut query latency from hours to seconds, streamlined data modeling, and improved reliability across diverse business scenarios, while integrating with Flink, Kafka, and ES via a unified API.

Apache DorisElasticsearchFlink
0 likes · 20 min read
How Doris Powered Zuoyebang’s Real‑Time Data Warehouse for Faster Insights
Dada Group Technology
Dada Group Technology
Jun 6, 2022 · Backend Development

Evolution of JD Daojia Search System Architecture from Version 1.0 to 3.0

The article details the progressive architectural evolution of JD Daojia's search system—starting from a simple, single‑layer ES‑based 1.0 design, through the 2.0 overhaul that introduced full‑recall, independent ranking services, and index disaster‑recovery, to the 3.0 version that adds multi‑path recall, sophisticated ranking models, and automated routing for high availability.

ElasticsearchScalabilitySearch
0 likes · 20 min read
Evolution of JD Daojia Search System Architecture from Version 1.0 to 3.0
IT Architects Alliance
IT Architects Alliance
Jun 5, 2022 · Big Data

Optimizing 10K‑Record Queries from Tens of Millions: CK, ES, HBase & Redis Strategies

This article examines a real‑world requirement to extract no more than 100 000 rows from a pool of tens of millions, comparing multithreaded ClickHouse pagination, Elasticsearch scroll‑scan deep paging, an ES‑HBase hybrid query, and a RediSearch‑RedisJSON approach, and presents performance measurements and practical conclusions.

ElasticsearchHBaseLargeScaleQuery
0 likes · 12 min read
Optimizing 10K‑Record Queries from Tens of Millions: CK, ES, HBase & Redis Strategies
Java Baker
Java Baker
Jun 4, 2022 · Databases

Boost Multi-Dimensional Queries with Elasticsearch: From Basics to Advanced Integration

This article explains common multi‑dimensional query scenarios, compares basic indexing tricks and data heterogeneity approaches, then details how to integrate Elasticsearch with MySQL—including architecture, data import, Java client usage, code examples, and best‑practice recommendations—for scalable, near‑real‑time search.

ElasticsearchJava ClientMulti-dimensional query
0 likes · 15 min read
Boost Multi-Dimensional Queries with Elasticsearch: From Basics to Advanced Integration
Big Data Technology & Architecture
Big Data Technology & Architecture
May 30, 2022 · Big Data

Doris Architecture, Principles, and Key Features Overview

This article provides a comprehensive overview of Doris's architecture—including its FE and BE components, metadata management, data organization, execution planning—and details its major features such as adaptive join aggregation, vectorized execution, materialized views, and Elasticsearch integration, supplemented with example DDL and query code.

Big DataDatabase ArchitectureElasticsearch
0 likes · 7 min read
Doris Architecture, Principles, and Key Features Overview
Top Architect
Top Architect
May 29, 2022 · Backend Development

Integrating Spring Data Elasticsearch with Spring Boot: Configuration, Entity, Repository, and Query Examples

This tutorial demonstrates how to set up Elasticsearch 7.6 with the IK analyzer in a Spring Boot 2.3 project, import the appropriate Spring Data Elasticsearch dependency, configure the client, define indexed entity classes, create repository interfaces, and implement REST controllers for CRUD, pagination, and highlighted search queries, complete with code examples and test results.

ElasticsearchSearchSpring Boot
0 likes · 9 min read
Integrating Spring Data Elasticsearch with Spring Boot: Configuration, Entity, Repository, and Query Examples
Liangxu Linux
Liangxu Linux
May 23, 2022 · Big Data

Master Kibana: From Installation to Advanced Data Visualization

This guide walks you through installing Kibana, configuring its connection to Elasticsearch, exploring data with Discover, crafting queries using Lucene and DSL, creating visualizations and dashboards, and monitoring cluster health, providing step‑by‑step instructions, screenshots, and practical tips for effective data analysis.

DashboardData visualizationElasticsearch
0 likes · 13 min read
Master Kibana: From Installation to Advanced Data Visualization
Top Architect
Top Architect
May 20, 2022 · Big Data

Step-by-Step Guide to Deploying an Elasticsearch Cluster with Docker on CentOS

This article provides a comprehensive tutorial on why to use Elasticsearch, how to pull the Docker image, set up data directories, configure cluster and node settings, adjust system limits, launch three Elasticsearch containers, and verify the cluster using REST APIs and the elasticsearch‑head UI.

CentOSClusterElasticsearch
0 likes · 13 min read
Step-by-Step Guide to Deploying an Elasticsearch Cluster with Docker on CentOS
IT Architects Alliance
IT Architects Alliance
May 17, 2022 · Backend Development

Using Elasticsearch for File Upload, Indexing, and Keyword Search with Ingest Attachment Plugin

This article explains how to implement file upload, download, and precise keyword search for Word, PDF, and txt documents using Elasticsearch, covering environment setup, ingest‑attachment preprocessing, index mapping, Java code for uploading and querying, Chinese analysis with IK analyzer, and highlighting of results.

ElasticsearchIK AnalyzerIngest Attachment
0 likes · 13 min read
Using Elasticsearch for File Upload, Indexing, and Keyword Search with Ingest Attachment Plugin
Open Source Linux
Open Source Linux
May 16, 2022 · Big Data

Master ELK: Build a Scalable Log Management System with Elasticsearch, Logstash, Kibana

This guide introduces the ELK stack (Elasticsearch, Logstash, Kibana, and Filebeat), explains why centralized log management is essential, details the architecture options, and provides step‑by‑step installation and configuration instructions—including a Kafka‑backed pipeline—to help you deploy a production‑grade logging solution.

ELKElasticsearchFilebeat
0 likes · 24 min read
Master ELK: Build a Scalable Log Management System with Elasticsearch, Logstash, Kibana
Architecture Digest
Architecture Digest
May 13, 2022 · Big Data

Step-by-Step Guide to Deploy a Multi-Node Elasticsearch Cluster with Docker

This article provides a comprehensive tutorial on pulling Elasticsearch Docker images, configuring data directories, creating cluster configuration files, adjusting JVM settings, launching three Elasticsearch nodes in Docker containers, and verifying the cluster using both the REST API and the elasticsearch‑head UI.

ConfigurationDockerElasticsearch
0 likes · 12 min read
Step-by-Step Guide to Deploy a Multi-Node Elasticsearch Cluster with Docker
Top Architect
Top Architect
May 12, 2022 · Operations

ELK Stack: Common Deployment Architectures, Configuration Tips, and Troubleshooting

This article introduces the ELK stack as a popular centralized logging solution, explains three typical deployment architectures—including Logstash‑based, Filebeat‑based, and cache‑queue‑enhanced setups—details their advantages and drawbacks, and provides practical configuration examples and troubleshooting techniques for multiline logs, timestamp handling, and module filtering.

ELKElasticsearchFilebeat
0 likes · 9 min read
ELK Stack: Common Deployment Architectures, Configuration Tips, and Troubleshooting
政采云技术
政采云技术
May 12, 2022 · Fundamentals

Understanding Lucene Query Process and Core Principles

This article explains Lucene's query types, the step‑by‑step query execution flow—including entry, rewrite, weight creation, scoring, and result collection—while providing code examples and performance considerations to help developers troubleshoot and optimize search performance.

BM25ElasticsearchSearch
0 likes · 15 min read
Understanding Lucene Query Process and Core Principles
Sohu Tech Products
Sohu Tech Products
May 11, 2022 · Backend Development

Elasticsearch Pagination: From/Size, Deep Paging Issues, Scroll, Search After, PIT and Best Practices

This article explains Elasticsearch pagination mechanisms—including from/size, deep paging drawbacks, scroll, scroll‑scan, sliced scroll, search_after and point‑in‑time—detailing their inner workings, performance trade‑offs, configuration limits, and practical recommendations for handling large result sets.

BackendDeep PagingElasticsearch
0 likes · 17 min read
Elasticsearch Pagination: From/Size, Deep Paging Issues, Scroll, Search After, PIT and Best Practices
Top Architect
Top Architect
May 9, 2022 · Big Data

Using Elasticsearch for File Upload, Text Extraction, and Keyword Search with Ingest Pipelines and IK Analyzer

This tutorial explains how to leverage Elasticsearch to support file upload and download, preprocess PDF/Word/TXT files via ingest pipelines and the attachment processor, configure index mappings with Chinese IK analyzers, and perform accurate keyword searches with highlighting, all demonstrated with Java code examples.

ElasticsearchIK AnalyzerIngest Pipeline
0 likes · 13 min read
Using Elasticsearch for File Upload, Text Extraction, and Keyword Search with Ingest Pipelines and IK Analyzer
Top Architect
Top Architect
May 4, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This article provides a comprehensive introduction to Elasticsearch, covering data types, Lucene fundamentals, cluster architecture, node roles, shard and replica mechanisms, mapping, basic usage, health monitoring, indexing workflow, storage strategies, and practical performance tuning techniques.

Elasticsearchindexingperformance optimization
0 likes · 36 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
Architect
Architect
Apr 27, 2022 · Big Data

File Upload, Download, and Keyword Search with Elasticsearch

This article demonstrates how to use Elasticsearch, along with plugins like ingest-attachment, Kibana, and Elasticsearch-head, to build a system that supports file upload and download, preprocesses various file types (txt, pdf, word), extracts text for precise keyword search, and highlights results using the ik analyzer.

ElasticsearchIK AnalyzerIngest Attachment
0 likes · 13 min read
File Upload, Download, and Keyword Search with Elasticsearch
Programmer DD
Programmer DD
Apr 27, 2022 · Backend Development

How to Implement Full-Text Search for Word, PDF, and TXT Files with Elasticsearch

This guide explains how to upload Word, PDF, and TXT files, preprocess them with Elasticsearch ingest pipelines, index their content using appropriate analyzers, and perform accurate keyword searches with highlighting, providing complete Java code examples and configuration steps.

ElasticsearchFull‑Text SearchIK Analyzer
0 likes · 13 min read
How to Implement Full-Text Search for Word, PDF, and TXT Files with Elasticsearch
Top Architect
Top Architect
Apr 26, 2022 · Backend Development

High‑Availability Architecture for a Membership System: Elasticsearch Dual‑Center Cluster, Redis Caching, and MySQL Migration

The article describes how a large‑scale membership system achieves high performance and fault tolerance by deploying a dual‑center Elasticsearch cluster, isolating traffic with multiple ES clusters, adding a Redis cache with distributed locks, migrating the primary relational store from SQL Server to a partitioned MySQL cluster, and implementing fine‑grained flow‑control and degradation strategies.

Elasticsearchmysqlredis
0 likes · 20 min read
High‑Availability Architecture for a Membership System: Elasticsearch Dual‑Center Cluster, Redis Caching, and MySQL Migration
HelloTech
HelloTech
Apr 25, 2022 · Big Data

Analyzing and Optimizing Slow Elasticsearch Queries in a Shared Cluster

In a shared Elasticsearch cluster, the team used slow‑log analysis to pinpoint costly queries caused by unnecessary fuzzy matches and integer‑mapped low‑cardinality fields, then optimized them by converting matches to filters and remapping those fields to keyword, re‑indexing, which cut latency from over 100 ms to under 10 ms and eliminated slow‑query alerts.

BackendElasticsearchperformance optimization
0 likes · 10 min read
Analyzing and Optimizing Slow Elasticsearch Queries in a Shared Cluster
Su San Talks Tech
Su San Talks Tech
Apr 17, 2022 · Backend Development

How Elasticsearch Powers Real-Time Search: Core Concepts and Best Practices

This article provides a comprehensive overview of Elasticsearch, explaining its underlying Lucene technology, data modeling, cluster architecture, shard and replica mechanisms, indexing workflow, storage strategies, refresh and translog processes, as well as practical performance and JVM tuning tips for building scalable, near‑real‑time search solutions.

Elasticsearchlucenesearch engine
0 likes · 37 min read
How Elasticsearch Powers Real-Time Search: Core Concepts and Best Practices
System Architect Go
System Architect Go
Apr 15, 2022 · Artificial Intelligence

Elasticsearch Vector Search: script_score and _knn_search Methods

This article explains Elasticsearch's vector search capabilities, detailing two approaches—script_score using dense_vector fields for exact similarity scoring and the experimental _knn_search for approximate nearest neighbor queries—along with data modeling examples, code snippets, performance considerations, and usage guidelines.

Elasticsearch_knn_searchdense_vector
0 likes · 6 min read
Elasticsearch Vector Search: script_score and _knn_search Methods
Top Architect
Top Architect
Apr 12, 2022 · Databases

Key New Features and Changes in Elasticsearch 8.0 Release

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default-enabled security with registration tokens, system index protection, a preview KNN search API using dense_vector, storage‑saving field mappings, faster geo indexing, PyTorch model support, and numerous deprecations and configuration changes across aggregations, allocation, analysis, authentication, and core infrastructure.

ElasticsearchRelease Notessearch engine
0 likes · 9 min read
Key New Features and Changes in Elasticsearch 8.0 Release
IT Architects Alliance
IT Architects Alliance
Apr 10, 2022 · Backend Development

Understanding Elasticsearch: Core Concepts, Architecture, and Performance Tips

This article provides a comprehensive overview of Elasticsearch, covering data types, Lucene fundamentals, cluster discovery, node roles, shard and replica management, mapping, installation, health monitoring, indexing mechanics, storage strategies, refresh and translog processes, segment merging, and practical performance optimizations for production deployments.

Distributed SystemsElasticsearchindexing
0 likes · 39 min read
Understanding Elasticsearch: Core Concepts, Architecture, and Performance Tips
Top Architect
Top Architect
Apr 9, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, Indexing Mechanics, and Performance Optimization

This comprehensive article explains what Elasticsearch is, how it builds on Lucene to provide distributed real‑time search and analytics, covering data types, cluster components, shard routing, indexing pipelines, storage formats, segment merging, and practical performance‑tuning tips for production deployments.

Elasticsearchindexinglucene
0 likes · 36 min read
Elasticsearch Overview: Architecture, Core Concepts, Indexing Mechanics, and Performance Optimization
MaGe Linux Operations
MaGe Linux Operations
Apr 7, 2022 · Operations

Master ELK Stack: From Basics to Full‑Scale Log Management

This article introduces the ELK stack components, explains why centralized logging is essential, outlines core log‑system features, compares three ELK architectures, provides step‑by‑step installation and configuration for Filebeat, Logstash, Elasticsearch and Kibana, and demonstrates a complete pipeline using Kafka with code examples and diagrams.

ELKElasticsearchFilebeat
0 likes · 25 min read
Master ELK Stack: From Basics to Full‑Scale Log Management
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 5, 2022 · Big Data

Using ElasticsearchSink with Apache Flink: Configuration, Retry Strategies, and Failure Handling

This article introduces the ElasticsearchSink for Apache Flink, explains how to add Maven dependencies, implement the sink with configuration and retry settings, details failure handlers, and highlights important considerations such as exception handling and checkpoint requirements for reliable streaming pipelines.

Big DataElasticsearchFailure Handling
0 likes · 9 min read
Using ElasticsearchSink with Apache Flink: Configuration, Retry Strategies, and Failure Handling
21CTO
21CTO
Apr 3, 2022 · Backend Development

How We Achieved 20k+ TPS High Availability for a Billion‑User Membership System

This article details the design and implementation of a highly available, high‑performance membership system serving over a billion users, covering Elasticsearch dual‑center clusters, traffic isolation, Redis caching, MySQL migration, and fine‑grained flow‑control and degradation strategies.

ElasticsearchScalabilitySystem Architecture
0 likes · 21 min read
How We Achieved 20k+ TPS High Availability for a Billion‑User Membership System
dbaplus Community
dbaplus Community
Apr 1, 2022 · Databases

How iQIYI Built a Scalable OLTP Data Center to Eliminate Data Silos

This article details iQIYI's design and implementation of a unified OLTP data center that consolidates data across business lines, solves data‑island issues, ensures strong consistency between MongoDB and Elasticsearch, and provides high‑availability, massive‑scale storage for billions of records.

Data ArchitectureElasticsearchMongoDB
0 likes · 12 min read
How iQIYI Built a Scalable OLTP Data Center to Eliminate Data Silos
Code Ape Tech Column
Code Ape Tech Column
Apr 1, 2022 · Operations

ELK Stack Common Deployment Architectures and Practical Solutions

This article introduces the ELK stack components, compares three typical deployment architectures—including Logstash‑based, Filebeat‑based, and Kafka‑enhanced setups—and provides detailed configuration examples and solutions for multiline log merging, timestamp handling, and module‑specific filtering.

ELKElasticsearchFilebeat
0 likes · 10 min read
ELK Stack Common Deployment Architectures and Practical Solutions
Java Interview Crash Guide
Java Interview Crash Guide
Mar 31, 2022 · Backend Development

How We Achieved 20k TPS High‑Availability for a Billion‑User Membership System

This article details the design and implementation of a highly available, high‑performance membership system serving billions of users, covering Elasticsearch dual‑center clusters, traffic‑isolated architectures, deep ES optimizations, Redis caching with distributed locks, dual‑center MySQL partitioning, migration strategies, abnormal account handling, and future fine‑grained flow‑control and degradation policies.

Distributed SystemsElasticsearchScalability
0 likes · 20 min read
How We Achieved 20k TPS High‑Availability for a Billion‑User Membership System
IT Architects Alliance
IT Architects Alliance
Mar 22, 2022 · Backend Development

High‑Availability Architecture for a Membership System: Elasticsearch Dual‑Center, Redis Caching, MySQL Migration and Fine‑Grained Flow Control

This article details the design and implementation of a high‑availability membership system, covering Elasticsearch dual‑center master‑slave clusters, traffic‑isolated three‑cluster ES architecture, Redis multi‑center caching, MySQL dual‑center partitioning, data migration strategies, and refined flow‑control and degradation mechanisms to ensure stable, low‑latency service under massive concurrent load.

Distributed SystemsElasticsearchScalability
0 likes · 20 min read
High‑Availability Architecture for a Membership System: Elasticsearch Dual‑Center, Redis Caching, MySQL Migration and Fine‑Grained Flow Control
Top Architect
Top Architect
Mar 22, 2022 · Big Data

Elasticsearch Cluster Architecture and Data Layer Design

This article explains Elasticsearch's cluster architecture, including nodes, indices, shards, and replicas, compares mixed and tiered deployment models, discusses the data storage layer and replication trade‑offs, and presents two typical distributed data system designs with their advantages and drawbacks.

Cluster ArchitectureElasticsearchdata replication
0 likes · 14 min read
Elasticsearch Cluster Architecture and Data Layer Design
Top Architect
Top Architect
Mar 20, 2022 · Backend Development

High‑Availability Architecture for a Membership System: Elasticsearch Dual‑Center Cluster, Redis Caching, MySQL Migration, and Flow‑Control Strategies

The article details a comprehensive high‑availability solution for a large‑scale membership system, covering Elasticsearch dual‑center master‑slave clusters, traffic‑isolated three‑cluster designs, deep ES optimizations, Redis caching with consistency safeguards, MySQL partitioned migration, and fine‑grained flow‑control and degradation mechanisms.

ElasticsearchFlow Controlhigh availability
0 likes · 19 min read
High‑Availability Architecture for a Membership System: Elasticsearch Dual‑Center Cluster, Redis Caching, MySQL Migration, and Flow‑Control Strategies
Architecture Digest
Architecture Digest
Mar 18, 2022 · Backend Development

High‑Availability Architecture for a Membership System: Elasticsearch Dual‑Center Cluster, Redis Caching, and MySQL Migration

This article details the design and implementation of a high‑performance, highly available membership system, covering Elasticsearch dual‑center master‑slave clusters, traffic‑isolated three‑cluster ES architecture, Redis cache strategies, MySQL dual‑center partitioning, seamless migration, abnormal member handling, and fine‑grained flow‑control and degradation policies.

ElasticsearchFlow ControlSystem Architecture
0 likes · 20 min read
High‑Availability Architecture for a Membership System: Elasticsearch Dual‑Center Cluster, Redis Caching, and MySQL Migration
Top Architect
Top Architect
Mar 17, 2022 · Big Data

Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization

This comprehensive guide explains Elasticsearch fundamentals, including data types, Lucene and inverted indexes, cluster and node roles, shard and replica mechanisms, mapping, installation steps, health monitoring, write and storage processes, segment merging, and practical performance tuning tips for large‑scale search deployments.

ClusterElasticsearchinverted index
0 likes · 35 min read
Elasticsearch Overview: Architecture, Core Concepts, and Performance Optimization
Efficient Ops
Efficient Ops
Mar 16, 2022 · Operations

How to Build a Real-Time ELK Log Analysis Platform on Ubuntu

This tutorial walks you through the complete setup of an ELK (Elasticsearch, Logstash, Kibana) real‑time log analysis platform on Ubuntu, covering component installation, configuration for Spring Boot and Nginx logs, Grok parsing, and background service management with Supervisor.

ELKElasticsearchKibana
0 likes · 21 min read
How to Build a Real-Time ELK Log Analysis Platform on Ubuntu
Baidu Geek Talk
Baidu Geek Talk
Mar 16, 2022 · Backend Development

How Baidu’s Transaction Accounting System Handles Real‑Time Reconciliation

This article explains the design of Baidu's transaction accounting platform, covering business scenarios, the flow of transaction records, system architecture, real‑time data synchronization via Canal, Elasticsearch storage strategies, consistency guarantees, and aggregation techniques for accurate merchant financial reconciliation.

Backend ArchitectureCanalElasticsearch
0 likes · 14 min read
How Baidu’s Transaction Accounting System Handles Real‑Time Reconciliation
Selected Java Interview Questions
Selected Java Interview Questions
Mar 9, 2022 · Big Data

Elasticsearch Overview: Core Concepts, Architecture, and Performance Optimization

This article provides a comprehensive overview of Elasticsearch, covering its data types, Lucene-based inverted index, cluster architecture, sharding and replication mechanisms, mapping definitions, basic usage, health monitoring, storage internals, and practical performance tuning tips for large‑scale search deployments.

ClusterElasticsearchReplication
0 likes · 36 min read
Elasticsearch Overview: Core Concepts, Architecture, and Performance Optimization
dbaplus Community
dbaplus Community
Mar 8, 2022 · Operations

How We Built a 20k TPS Highly Available Membership System with ES, Redis, and MySQL

This article explains how a large‑scale membership platform achieved over 20,000 TPS with high performance and zero downtime by deploying dual‑center Elasticsearch clusters, traffic‑isolated three‑cluster ES architecture, Redis caching with distributed locks, and a seamless MySQL migration and HA strategy.

ElasticsearchSystem Architecturemysql
0 likes · 20 min read
How We Built a 20k TPS Highly Available Membership System with ES, Redis, and MySQL
NiuNiu MaTe
NiuNiu MaTe
Mar 8, 2022 · Backend Development

How a Junior Engineer Landed a Tencent Backend Offer: Interview Experience & Tips

This article shares a junior engineer's detailed journey of securing a backend engineering offer at Tencent, covering three interview rounds, key technical topics like Spring, HashMap, Elasticsearch, Redis, and personal reflections on strengths, weaknesses, and preparation strategies.

ElasticsearchTencentbackend-development
0 likes · 5 min read
How a Junior Engineer Landed a Tencent Backend Offer: Interview Experience & Tips
Ops Development Stories
Ops Development Stories
Feb 24, 2022 · Big Data

Master Elasticsearch: Core Concepts, APIs, Mapping, and Performance Tuning

This comprehensive guide explains Elasticsearch fundamentals—including documents, indices, nodes, clusters, REST and Document APIs, query DSL, relevance scoring, distributed architecture, real‑time indexing, search execution, pagination, scroll, aggregations, data modeling, mapping options, parent/child relationships, reindexing, and practical cluster and write/read performance optimizations.

Cluster TuningElasticsearchaggregation
0 likes · 58 min read
Master Elasticsearch: Core Concepts, APIs, Mapping, and Performance Tuning
Top Architect
Top Architect
Feb 21, 2022 · Databases

Key New Features in Elasticsearch 8.0

Elasticsearch 8.0 introduces major updates including 7.x REST API compatibility headers, default-enabled security with registration tokens, known issues on ARM/macOS, a preview KNN search API using dense_vector, storage reductions for keyword and text fields, faster geo indexing, PyTorch model support, and numerous other enhancements across aggregations, allocation, analysis, authentication, and core infrastructure.

ElasticsearchPyTorchVersion 8
0 likes · 10 min read
Key New Features in Elasticsearch 8.0