Tagged articles
1178 articles
Page 5 of 12
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 25, 2023 · Big Data

Venus Log Platform Architecture Evolution: From ELK to Data Lake

The Venus log platform at iQiyi migrated from an ElasticSearch‑Kibana architecture to an Iceberg‑based data lake with Trino, cutting storage and compute costs by over 70%, boosting stability by 85%, and efficiently supporting billions of daily logs through write‑heavy, low‑query workloads.

Big DataElasticsearchIceberg
0 likes · 22 min read
Venus Log Platform Architecture Evolution: From ELK to Data Lake
政采云技术
政采云技术
Aug 17, 2023 · Backend Development

Traffic‑Based Quality Assurance Framework for Elasticsearch Search Service

This article presents a traffic‑driven quality assurance framework for an Elasticsearch‑based search service, detailing active and inactive code protection strategies, automated scenario generation from Dubbo logs, template fingerprinting, de‑duplication, expected result pools, and validation rules to ensure comprehensive test coverage and reliable regression.

Automated TestingElasticsearchSearch
0 likes · 13 min read
Traffic‑Based Quality Assurance Framework for Elasticsearch Search Service
Code Ape Tech Column
Code Ape Tech Column
Aug 15, 2023 · Operations

High‑Availability Architecture for a Billion‑Scale Membership System: Dual‑Center ES, Redis, and MySQL Solutions

This article details the design and implementation of a highly available, high‑performance membership system serving over a billion users, covering dual‑center Elasticsearch clusters, traffic‑isolated three‑cluster ES architecture, Redis dual‑center caching, MySQL partitioned clusters, migration strategies, and refined flow‑control and degradation mechanisms.

Distributed SystemsElasticsearchhigh availability
0 likes · 20 min read
High‑Availability Architecture for a Billion‑Scale Membership System: Dual‑Center ES, Redis, and MySQL Solutions
Code Ape Tech Column
Code Ape Tech Column
Aug 11, 2023 · Big Data

Elasticsearch Pagination: From/Size, Deep Paging Issues, and Alternative Methods (Scroll, Search After, PIT)

This article explains how Elasticsearch pagination works with from/size, why deep paging can cause performance problems, and compares alternative techniques such as Scroll, Scroll‑Scan, Sliced Scroll, Search After, and point‑in‑time (PIT) searches for handling large result sets efficiently.

Deep PagingElasticsearchpagination
0 likes · 17 min read
Elasticsearch Pagination: From/Size, Deep Paging Issues, and Alternative Methods (Scroll, Search After, PIT)
Didi Tech
Didi Tech
Aug 10, 2023 · Information Security

Security Hardening and Architecture of Didi's Elasticsearch Deployment

Didi hardened its massive Elasticsearch deployment—spanning 66 clusters and thousands of nodes—by adding a custom security plugin that authenticates requests at the cluster level, implementing a one‑click toggle and staged rolling upgrades, ultimately enabling authentication across all clusters and dramatically reducing data‑leak risk.

AuthenticationData ProtectionDidi
0 likes · 12 min read
Security Hardening and Architecture of Didi's Elasticsearch Deployment
ITPUB
ITPUB
Aug 9, 2023 · Operations

Why Is My Elasticsearch Cluster Using 15 GB Heap? A Deep Dive into Memory Bottlenecks

The article examines a 7‑node Elasticsearch cluster with 500 million documents, uncovering excessive heap usage, high OS memory pressure, numerous deleted documents, large translog, low query‑cache hit rate, and an over‑sharded design, then offers concrete tuning and redesign recommendations to restore performance.

ElasticsearchMemory Optimizationcluster operations
0 likes · 16 min read
Why Is My Elasticsearch Cluster Using 15 GB Heap? A Deep Dive into Memory Bottlenecks
Java Interview Crash Guide
Java Interview Crash Guide
Aug 8, 2023 · Operations

How We Built 99.99% High Availability for a Billion‑User Membership System

This article details the end‑to‑end high‑availability architecture—including dual‑center Elasticsearch clusters, Redis caching with distributed locks, and a dual‑center MySQL partitioned setup—that enables a membership platform serving billions of users to sustain massive traffic while ensuring data consistency and rapid recovery.

ElasticsearchScalabilitySystem Architecture
0 likes · 21 min read
How We Built 99.99% High Availability for a Billion‑User Membership System
Didi Tech
Didi Tech
Aug 7, 2023 · Backend Development

How Didi Achieved Cross‑Datacenter Elasticsearch Replication for Strong Consistency

This article explains Didi's self‑developed DCDR system that replicates Elasticsearch indices across data‑center clusters, detailing its design goals, core mechanisms, chain construction, historical data recovery, real‑time sync, and data‑quality validation to ensure high availability and strong consistency.

Cross‑Datacenter ReplicationDCDRData Consistency
0 likes · 15 min read
How Didi Achieved Cross‑Datacenter Elasticsearch Replication for Strong Consistency
MaGe Linux Operations
MaGe Linux Operations
Aug 5, 2023 · Databases

Elasticsearch vs ClickHouse: Architecture, Queries, and Performance

This article compares Elasticsearch and ClickHouse by examining their underlying architectures, node roles, query languages, and performance through a series of benchmark tests using Docker‑compose, Vector data pipelines, and Python SDKs, revealing ClickHouse’s superior speed in most query scenarios despite lacking advanced search features.

Elasticsearchclickhouse
0 likes · 12 min read
Elasticsearch vs ClickHouse: Architecture, Queries, and Performance
Efficient Ops
Efficient Ops
Aug 2, 2023 · Databases

Why ClickHouse Outperforms Elasticsearch in Real‑World Queries

This article compares Elasticsearch and ClickHouse across architecture, query capabilities, and performance using Docker‑compose stacks and Python SDK tests, demonstrating that ClickHouse often delivers superior speed, especially in aggregation and regex queries, while highlighting each system’s design trade‑offs.

Docker ComposeElasticsearchclickhouse
0 likes · 13 min read
Why ClickHouse Outperforms Elasticsearch in Real‑World Queries
Architect
Architect
Jul 28, 2023 · Databases

Which MySQL‑to‑Elasticsearch Sync Method Wins? 4 Solutions & Tool Picks

The article compares four MySQL‑to‑Elasticsearch synchronization approaches—synchronous dual‑write, asynchronous dual‑write via MQ, timer‑based SQL extraction, and binlog‑based real‑time sync—evaluates their trade‑offs, and recommends practical tools such as Canal, DTS, and Databus for implementation.

BinlogCanalDTS
0 likes · 13 min read
Which MySQL‑to‑Elasticsearch Sync Method Wins? 4 Solutions & Tool Picks
Top Architect
Top Architect
Jul 27, 2023 · Big Data

Performance Comparison of Elasticsearch and ClickHouse for Log Search

This article compares Elasticsearch and ClickHouse as log‑search solutions, detailing their architectures, Docker‑compose deployments, data‑ingestion pipelines with Vector, query syntax differences, and benchmark results that show ClickHouse generally outperforms Elasticsearch in speed and aggregation efficiency.

Big DataDockerElasticsearch
0 likes · 13 min read
Performance Comparison of Elasticsearch and ClickHouse for Log Search
SQB Blog
SQB Blog
Jul 20, 2023 · Artificial Intelligence

How We Built and Optimized a Multi‑Pool Recommendation System for Boss Circle

This article explains the design, implementation, and iterative optimization of Boss Circle's recommendation engine, covering the initial simple ranking, the introduction of Elasticsearch‑based scoring, multi‑pool data sources, machine‑learning experiments, real‑time feature handling, and future personalization challenges.

Elasticsearchdata pipelinespersonalization
0 likes · 17 min read
How We Built and Optimized a Multi‑Pool Recommendation System for Boss Circle
Top Architect
Top Architect
Jul 18, 2023 · Fundamentals

Comprehensive Introduction to Elasticsearch: Core Concepts, Architecture, and Practical Usage

This article provides a detailed overview of Elasticsearch, covering its underlying Lucene technology, data types, indexing mechanisms, cluster architecture, shard and replica management, mapping definitions, installation steps, health monitoring, write and storage processes, and performance optimization techniques for production deployments.

ElasticsearchReplicationindexing
0 likes · 36 min read
Comprehensive Introduction to Elasticsearch: Core Concepts, Architecture, and Practical Usage
Architect
Architect
Jul 17, 2023 · Databases

Performance Comparison of Elasticsearch and ClickHouse for Log Search and Analytics

This article compares Elasticsearch and ClickHouse by describing their architectures, presenting Docker‑based test stacks, showing code snippets for deployment, data ingestion, and queries, and reporting performance results that demonstrate ClickHouse generally outperforms Elasticsearch in log‑analytics scenarios.

DockerElasticsearchLog Analytics
0 likes · 12 min read
Performance Comparison of Elasticsearch and ClickHouse for Log Search and Analytics
Efficient Ops
Efficient Ops
Jul 16, 2023 · Operations

Mastering ELK: Deploy Architectures, Multiline Logs, and Kibana Tips

This guide explains the three main ELK deployment architectures, compares Logstash and Filebeat collectors, introduces a cache‑queue option for high‑volume logs, and provides practical solutions for multiline log merging, timestamp correction, and module‑level filtering in Kibana, helping operations teams build efficient log pipelines.

ELKElasticsearchFilebeat
0 likes · 10 min read
Mastering ELK: Deploy Architectures, Multiline Logs, and Kibana Tips
Programmer DD
Programmer DD
Jul 15, 2023 · Operations

Why Is My Elasticsearch Cluster Using 99% Memory? Sharding, Translog & JVM Insights

This article analyzes a 7‑node Elasticsearch cluster with 500 million documents, revealing excessive shard count, high heap and OS memory usage, large translog, low query‑cache hit rate, and heavy I/O, and offers concrete recommendations on mapping, sharding, JVM tuning, and resource management to restore performance.

Cluster MonitoringElasticsearchJVM
0 likes · 17 min read
Why Is My Elasticsearch Cluster Using 99% Memory? Sharding, Translog & JVM Insights
Selected Java Interview Questions
Selected Java Interview Questions
Jul 15, 2023 · Operations

High‑Availability Architecture for a Large‑Scale Membership System

The article describes how a membership system serving billions of users across multiple platforms achieves high performance and high availability through dual‑center Elasticsearch clusters, traffic‑isolated three‑cluster ES architecture, Redis caching with distributed locks, dual‑center MySQL partitioning, and fine‑grained flow‑control and degradation strategies.

Backend ArchitectureDistributed SystemsElasticsearch
0 likes · 25 min read
High‑Availability Architecture for a Large‑Scale Membership System
ITPUB
ITPUB
Jul 12, 2023 · Databases

How We Migrated a Multi‑Petabyte Elasticsearch Cluster Across Data Centers Without Downtime

This article details the end‑to‑end process of moving Qunar's massive Elasticsearch logging cluster from a saturated data‑center to a new facility, covering background constraints, migration planning, manual and automated steps, performance‑tuning parameters, shard‑balancing techniques, and the final outcomes achieved.

Cluster MigrationElasticsearchShard Allocation
0 likes · 21 min read
How We Migrated a Multi‑Petabyte Elasticsearch Cluster Across Data Centers Without Downtime
Architects Research Society
Architects Research Society
Jul 8, 2023 · Backend Development

System Design of Hotel Booking Applications (Airbnb, Booking.com, OYO)

This article explains how large hotel‑booking platforms such as Airbnb, Booking.com and OYO use a micro‑service architecture—including hotel management, customer search/booking, and view‑booking services—combined with load balancers, Kafka, Elasticsearch, Redis, Cassandra and Hadoop to achieve a seamless, high‑throughput booking flow.

Backend ArchitectureElasticsearchKafka
0 likes · 7 min read
System Design of Hotel Booking Applications (Airbnb, Booking.com, OYO)
Spring Full-Stack Practical Cases
Spring Full-Stack Practical Cases
Jul 6, 2023 · Databases

Master Spring Boot 2.3 with Elasticsearch 7.8: Full CRUD and Advanced Queries

This guide demonstrates how to integrate Spring Boot 2.3.10 with Elasticsearch 7.8 using the high‑level REST client, covering required Maven dependencies, index creation, retrieval, deletion, document CRUD operations, bulk processing, and a variety of advanced search techniques such as paging, sorting, filtering, range, highlighting, aggregations, and grouping.

CRUDElasticsearchSearch
0 likes · 12 min read
Master Spring Boot 2.3 with Elasticsearch 7.8: Full CRUD and Advanced Queries
Spring Full-Stack Practical Cases
Spring Full-Stack Practical Cases
Jul 3, 2023 · Backend Development

Master Spring Boot 2.3 with Elasticsearch 7: Full Integration Guide

This guide demonstrates how to integrate Spring Boot 2.3.10 with Elasticsearch 7.8, covering Maven dependencies, application configuration, data model annotations, repository interfaces, custom query methods, and comprehensive JUnit tests using both repository and ElasticsearchRestTemplate approaches.

ElasticsearchSpring Bootbackend-development
0 likes · 6 min read
Master Spring Boot 2.3 with Elasticsearch 7: Full Integration Guide
Architects Research Society
Architects Research Society
Jul 2, 2023 · Databases

PostgreSQL Full-Text Search vs Elasticsearch: Performance Comparison and Findings

This article compares PostgreSQL and Elasticsearch for full‑text search in Django applications, presenting performance benchmarks on up to 1.5 million records, discussing setup complexity, features like stemming and fuzziness, and concluding that PostgreSQL is preferable for most projects without massive data scales.

DjangoElasticsearchFull‑Text Search
0 likes · 7 min read
PostgreSQL Full-Text Search vs Elasticsearch: Performance Comparison and Findings
Programmer DD
Programmer DD
Jun 26, 2023 · Databases

Unlock Lightning-Fast Search: Proven Elasticsearch Performance Tuning Tips

This article presents comprehensive best‑practice recommendations for optimizing Elasticsearch deployments, covering hardware selection, RAID choices, index and shard planning, query and caching strategies, bulk indexing, refresh intervals, monitoring tools, version upgrades, and lifecycle management to achieve high performance, reliability, and scalability.

Elasticsearchindexingperformance tuning
0 likes · 18 min read
Unlock Lightning-Fast Search: Proven Elasticsearch Performance Tuning Tips
JD Cloud Developers
JD Cloud Developers
Jun 15, 2023 · Databases

Mastering Distributed Join Queries: MySQL Sharding-JDBC and Elasticsearch Strategies

This article explores the challenges of distributed join queries, detailing MySQL sharding‑jdbc join implementation, routing strategies, and code examples, then examines Elasticsearch‑SQL join capabilities, various join algorithms, and practical considerations for using nested types, offering insights for optimizing performance in distributed data environments.

DistributedElasticsearchmysql
0 likes · 11 min read
Mastering Distributed Join Queries: MySQL Sharding-JDBC and Elasticsearch Strategies
Open Source Linux
Open Source Linux
Jun 9, 2023 · Backend Development

How We Built a High‑Availability Membership System for Billions of Users

This article details the design and implementation of a highly available, high‑performance membership platform serving over a billion users, covering Elasticsearch dual‑center clusters, traffic‑isolated clusters, deep ES optimizations, Redis caching strategies, MySQL dual‑center partitioning, seamless migration, and fine‑grained flow‑control and degradation mechanisms.

Elasticsearchcachinghigh availability
0 likes · 21 min read
How We Built a High‑Availability Membership System for Billions of Users
Tongcheng Travel Technology Center
Tongcheng Travel Technology Center
Jun 6, 2023 · Operations

Root Cause Analysis and GC Parameter Optimization for Elasticsearch OOM Issues in the Membership Service

This article details a comprehensive investigation of an out‑of‑memory crash in a critical Elasticsearch cluster, explains how GC logs and heap dumps revealed a to‑space‑exhausted condition, and describes the G1GC tuning parameters that eliminated the nightly spikes and stabilized performance.

BackendElasticsearchOOM
0 likes · 9 min read
Root Cause Analysis and GC Parameter Optimization for Elasticsearch OOM Issues in the Membership Service
MaGe Linux Operations
MaGe Linux Operations
May 31, 2023 · Operations

How We Achieved 20k TPS High‑Availability for a Billion‑User Membership System

This article details the design and implementation of a highly available, high‑performance membership system that serves over a billion users, covering Elasticsearch dual‑center HA, traffic‑isolated clusters, Redis caching, MySQL dual‑center partitioning, seamless migration, and refined flow‑control and degradation strategies.

ElasticsearchSystem Architecturehigh availability
0 likes · 19 min read
How We Achieved 20k TPS High‑Availability for a Billion‑User Membership System
Laravel Tech Community
Laravel Tech Community
May 28, 2023 · Big Data

Elasticsearch 8.8.0 Release Notes: Bug Fixes, Deprecations, and New Features

Elasticsearch 8.8.0, the latest release of the Lucene‑based distributed search engine, introduces numerous bug fixes across aggregations, allocation, application and authorization, deprecates certain allocation settings, and adds new capabilities such as templated search APIs, JWT authentication, DLM enhancements, health metrics, ingest node licensing checks, machine‑learning query extensions, ranking improvements, search enhancements, and TSDB support.

Elasticsearchbig-databug-fix
0 likes · 5 min read
Elasticsearch 8.8.0 Release Notes: Bug Fixes, Deprecations, and New Features
Zhuanzhuan Tech
Zhuanzhuan Tech
May 26, 2023 · Backend Development

ECP (Elasticsearch Chain Planning) System: Design, Features, and Implementation for Efficient Index Management

The article introduces the ECP system, a backend platform built on Elasticsearch that standardizes, automates, and visualizes index refresh workflows, addressing manual bottlenecks, data cleaning challenges, and coupling issues while providing task management, permission control, and environment isolation for high‑efficiency index operations.

ElasticsearchIndex Managementautomation
0 likes · 12 min read
ECP (Elasticsearch Chain Planning) System: Design, Features, and Implementation for Efficient Index Management
Architecture Digest
Architecture Digest
May 16, 2023 · Backend Development

High‑Availability Architecture for a Membership System: Dual‑Center ES Cluster, Redis Caching, MySQL Migration, and Fine‑Grained Flow Control

This article presents a comprehensive engineering case study of a high‑traffic membership system, detailing the dual‑center Elasticsearch high‑availability design, traffic‑isolated three‑cluster ES architecture, Redis caching strategy, dual‑center MySQL partitioning and migration plan, abnormal member relationship governance, and future fine‑grained flow‑control and downgrade policies.

Backend ArchitectureData MigrationElasticsearch
0 likes · 19 min read
High‑Availability Architecture for a Membership System: Dual‑Center ES Cluster, Redis Caching, MySQL Migration, and Fine‑Grained Flow Control
ITPUB
ITPUB
May 15, 2023 · Big Data

Why ClickHouse Outperforms Elasticsearch in Log Analytics: A Practical Comparison

This article compares Elasticsearch and ClickHouse for log analytics by detailing their architectures, setting up Docker‑Compose stacks, ingesting synthetic syslog data with Vector, running equivalent queries, and measuring performance, revealing ClickHouse’s superior speed in most scenarios.

DockerElasticsearchLog Analytics
0 likes · 13 min read
Why ClickHouse Outperforms Elasticsearch in Log Analytics: A Practical Comparison
DeWu Technology
DeWu Technology
May 8, 2023 · Databases

Optimizing Elasticsearch Search Performance with Index Sorting

By defining index sorting on the publish_time field when creating the Elasticsearch index, the team transformed a multi‑second full‑scan query into a sub‑50 ms operation, demonstrating that pre‑ordered storage dramatically speeds up large‑result‑set sorts while modestly affecting write throughput.

DocValuesElasticsearchIndex Sorting
0 likes · 12 min read
Optimizing Elasticsearch Search Performance with Index Sorting
Efficient Ops
Efficient Ops
May 7, 2023 · Databases

Elasticsearch vs ClickHouse: Performance Comparison for Log Analytics

This article compares Elasticsearch and ClickHouse as log‑analytics solutions, detailing their architectures, node roles, data ingestion pipelines, query capabilities, and benchmark results, ultimately showing ClickHouse’s superior performance in most tested scenarios.

DockerElasticsearchLog Analytics
0 likes · 13 min read
Elasticsearch vs ClickHouse: Performance Comparison for Log Analytics
Java High-Performance Architecture
Java High-Performance Architecture
May 7, 2023 · Backend Development

How We Built a Billion‑User High‑Availability Membership System with Dual‑Center ES, Redis, and MySQL

This article details the design and implementation of a high‑performance, highly available membership platform serving billions of users, covering dual‑center Elasticsearch clusters, traffic‑isolated three‑cluster ES architecture, Redis caching strategies, MySQL dual‑center partitioning, seamless migration, and fine‑grained flow‑control and degradation mechanisms.

Backend EngineeringElasticsearchScalable Design
0 likes · 21 min read
How We Built a Billion‑User High‑Availability Membership System with Dual‑Center ES, Redis, and MySQL
ITPUB
ITPUB
May 2, 2023 · Backend Development

Four Ways to Sync MySQL Data to Elasticsearch – Pros, Cons, and Tools

This article compares four common approaches for synchronizing MySQL data to Elasticsearch—synchronous dual write, asynchronous dual write via message queues, scheduled tasks, and binlog‑based data subscription—detailing their advantages, drawbacks, implementation steps, and tool choices such as Canal, Maxwell, and Python‑MySQL‑Replication.

CanalElasticsearchMessage Queue
0 likes · 6 min read
Four Ways to Sync MySQL Data to Elasticsearch – Pros, Cons, and Tools
Top Architect
Top Architect
Apr 26, 2023 · Databases

Comparative Performance and Feature Analysis of Elasticsearch vs ClickHouse

This article presents a practical comparison between Elasticsearch and ClickHouse, detailing their architectures, Docker‑Compose deployment, data ingestion pipelines, a series of representative queries, and benchmark results that show ClickHouse generally outperforms Elasticsearch in basic search and aggregation scenarios.

Docker ComposeElasticsearchVector
0 likes · 14 min read
Comparative Performance and Feature Analysis of Elasticsearch vs ClickHouse
Architect
Architect
Apr 23, 2023 · Big Data

Performance Comparison of Elasticsearch and ClickHouse for Log Analytics

This article compares Elasticsearch and ClickHouse by describing their architectures, demonstrating a Docker‑compose test environment, executing equivalent queries via both systems, and presenting performance results that show ClickHouse generally outperforms Elasticsearch in basic search and aggregation scenarios for log data.

ElasticsearchSearchclickhouse
0 likes · 12 min read
Performance Comparison of Elasticsearch and ClickHouse for Log Analytics
Code Ape Tech Column
Code Ape Tech Column
Apr 19, 2023 · Databases

Comparative Analysis of Elasticsearch and ClickHouse: Architecture, Query Performance, and Practical Benchmarks

This article compares Elasticsearch and ClickHouse by outlining their architectures, detailing deployment configurations, presenting benchmark queries and performance results, and concluding that ClickHouse generally outperforms Elasticsearch in many basic search and aggregation scenarios, while also noting each system's strengths and limitations.

Big DataElasticsearchclickhouse
0 likes · 13 min read
Comparative Analysis of Elasticsearch and ClickHouse: Architecture, Query Performance, and Practical Benchmarks
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Apr 13, 2023 · Cloud Native

How Alibaba Cloud Made Elasticsearch Fully Serverless: Architecture & Performance Gains

Alibaba Cloud unveiled the fully serverless Elasticsearch architecture at the 2023 Elastic China Developer Conference, highlighting cloud‑native upgrades, the OpenStore storage engine, and TimeStream time‑series enhancements that dramatically improve cost efficiency, scalability, and query performance.

Alibaba CloudCloud NativeElasticsearch
0 likes · 5 min read
How Alibaba Cloud Made Elasticsearch Fully Serverless: Architecture & Performance Gains
Java Architect Essentials
Java Architect Essentials
Apr 12, 2023 · Operations

High‑Availability Architecture for a Billion‑Scale Membership System

This article details the design and implementation of a high‑availability, billion‑scale membership system, covering Elasticsearch dual‑center clusters, traffic‑isolated architectures, deep ES optimizations, Redis caching strategies, MySQL migration with dual‑center partitioning, abnormal member relationship handling, and future fine‑grained flow‑control and degradation plans.

Distributed SystemsElasticsearchFlow Control
0 likes · 20 min read
High‑Availability Architecture for a Billion‑Scale Membership System
Software Development Quality
Software Development Quality
Apr 4, 2023 · Operations

How Traffic Recording & Replay Boosts Test Efficiency at China Agricultural Bank

This article explains the challenges of testing in a rapidly evolving banking system and introduces a traffic recording and replay platform that automates test case generation, improves coverage, and reduces manual effort through web‑server, network‑stack, and application‑level recording techniques.

ElasticsearchSoftware Testingreplay testing
0 likes · 10 min read
How Traffic Recording & Replay Boosts Test Efficiency at China Agricultural Bank
HelloTech
HelloTech
Apr 3, 2023 · Artificial Intelligence

Integrating Machine Learning with Elasticsearch for Enhanced Ranking Capabilities

At the 2023 Elastic China Developer Conference in Shenzhen, Peng Cheng of Hello Technology will demonstrate how migrating online machine‑learning predictions into Elasticsearch can exploit its distributed architecture to rank thousands of models, expand model types and computational depth, and unlock new growth opportunities for business applications, underscoring the event’s status as China’s premier Elasticsearch open‑source technology forum.

ElasticsearchTechnical Conferencemachine learning
0 likes · 2 min read
Integrating Machine Learning with Elasticsearch for Enhanced Ranking Capabilities
Java High-Performance Architecture
Java High-Performance Architecture
Mar 24, 2023 · Backend Development

Explore Echo: Open-Source Java Community Platform & Deployment Guide

Echo is a full‑stack open‑source Java community system built with Spring Boot, MyBatis, MySQL, Redis, Kafka and Elasticsearch, offering modules like posts, comments and notifications, and the article provides its core tech stack, development environment, local setup steps, deployment architecture, demo screenshots and source code access.

ElasticsearchKafkaSpring Boot
0 likes · 5 min read
Explore Echo: Open-Source Java Community Platform & Deployment Guide
Su San Talks Tech
Su San Talks Tech
Mar 14, 2023 · Databases

How to Solve Read‑Amplification in MySQL Sharding: Strategies & Tips

This article explains the concepts of database sharding, compares vertical and horizontal partitioning, discusses ID‑range and modulo sharding methods, analyzes the read‑amplification problem caused by non‑sharding keys, and presents practical solutions such as auxiliary index tables, Elasticsearch integration, and TiDB adoption.

ElasticsearchTiDBdatabase scaling
0 likes · 14 min read
How to Solve Read‑Amplification in MySQL Sharding: Strategies & Tips
Zhuanzhuan Tech
Zhuanzhuan Tech
Mar 2, 2023 · Backend Development

Location-Based Services (LBS) in On‑Site Fulfillment: Geofencing, Order Allocation, Routing and Autonomous Scheduling

This article explains how a location‑based service platform uses geofencing, R‑tree indexing, ray‑casting algorithms, and Elasticsearch GEO queries to determine service eligibility, allocate on‑site engineers, plan routes, and enable autonomous order scheduling for a C2B recycling business.

ElasticsearchGeofencingLBS
0 likes · 12 min read
Location-Based Services (LBS) in On‑Site Fulfillment: Geofencing, Order Allocation, Routing and Autonomous Scheduling
Su San Talks Tech
Su San Talks Tech
Feb 27, 2023 · Big Data

How to Build Near Real-Time Elasticsearch Indexes for PB-Scale Data

This article explains how to construct near real-time Elasticsearch indexes for petabyte‑scale datasets by comparing MySQL limitations, introducing ES fundamentals, leveraging Hive and wide tables, and employing binlog‑based tools like Canal and Otter for low‑latency data synchronization.

CanalElasticsearchOtter
0 likes · 22 min read
How to Build Near Real-Time Elasticsearch Indexes for PB-Scale Data
Architect's Guide
Architect's Guide
Feb 25, 2023 · Big Data

Elasticsearch Optimization and Performance Tuning for Billion‑Scale Data

This article documents the evolution of a data platform, explains Elasticsearch and Lucene fundamentals, and presents practical index and search performance optimizations—including bulk writes, refresh control, memory allocation, doc‑values tuning, and pagination strategies—that enable cross‑month queries and sub‑second responses on billions of records.

Elasticsearchluceneperformance tuning
0 likes · 11 min read
Elasticsearch Optimization and Performance Tuning for Billion‑Scale Data
Top Architect
Top Architect
Feb 22, 2023 · Backend Development

High‑Availability Architecture for a Billion‑User Membership System: ES Dual‑Center Clusters, Traffic Isolation, Redis Caching, and MySQL Migration

The article describes how a large‑scale membership system serving over a billion users achieves high performance and availability through dual‑center Elasticsearch clusters, traffic‑isolated three‑cluster designs, Redis caching with distributed locks, and a seamless migration from SQL Server to sharded MySQL, while also detailing operational safeguards and fine‑grained flow‑control strategies.

ElasticsearchSystem Architecturemysql
0 likes · 20 min read
High‑Availability Architecture for a Billion‑User Membership System: ES Dual‑Center Clusters, Traffic Isolation, Redis Caching, and MySQL Migration
TAL Education Technology
TAL Education Technology
Feb 16, 2023 · Big Data

Step‑by‑Step Guide to Syncing Canal Data to Elasticsearch

This article provides a comprehensive, hands‑on tutorial for configuring Alibaba Canal and its client‑adapter to capture MySQL binlog changes and synchronize them into Elasticsearch, covering environment setup, Docker commands, YAML configuration files, index mapping, adapter startup, and common troubleshooting scenarios.

CDCCanalConfiguration
0 likes · 26 min read
Step‑by‑Step Guide to Syncing Canal Data to Elasticsearch
DeWu Technology
DeWu Technology
Feb 15, 2023 · Backend Development

E-commerce Product Ranking System Migration: Technical Implementation and Storage Optimization

The article describes how an e‑commerce product ranking system was migrated to the new “Liao Yue” platform, decoupling it from the search module, introducing fresh metrics and Elasticsearch‑based sorting, then optimizing storage by separating B‑end and C‑end data—cutting costs 60%—with a gray‑scale rollout, dual‑read validation, rollback safeguards, and completing the two‑week, zero‑failure migration that delivered a closed‑loop, faster iteration system.

ElasticsearchStorage Optimizationbackend-development
0 likes · 15 min read
E-commerce Product Ranking System Migration: Technical Implementation and Storage Optimization
Code Ape Tech Column
Code Ape Tech Column
Feb 14, 2023 · Backend Development

High‑Availability Architecture for a Billion‑Scale Membership System: Elasticsearch Dual‑Center Cluster, Redis Caching, and MySQL Migration

This article describes how a membership platform serving over ten billion users achieves high performance and fault tolerance through a dual‑center Elasticsearch cluster, traffic‑isolated three‑cluster ES design, Redis multi‑center caching, and a seamless migration from SQL Server to a partitioned MySQL architecture, while detailing operational safeguards and fine‑grained flow‑control strategies.

ElasticsearchOperationsScalability
0 likes · 23 min read
High‑Availability Architecture for a Billion‑Scale Membership System: Elasticsearch Dual‑Center Cluster, Redis Caching, and MySQL Migration
21CTO
21CTO
Feb 11, 2023 · Big Data

Mastering Elasticsearch: Core Concepts, Architecture, and Performance Tips

This comprehensive guide explains what Elasticsearch does, its underlying Lucene technology, core concepts such as clusters, shards, replicas, mapping, indexing and storage mechanisms, and provides practical performance‑tuning advice for building and operating a robust distributed search engine.

ClusterElasticsearchMapping
0 likes · 35 min read
Mastering Elasticsearch: Core Concepts, Architecture, and Performance Tips
Su San Talks Tech
Su San Talks Tech
Feb 1, 2023 · Backend Development

How to Sync MySQL Data to Elasticsearch: 4 Practical Strategies

This article explores four common approaches for synchronizing product data from MySQL to Elasticsearch in e‑commerce systems—synchronous dual write, asynchronous dual write with message queues, scheduled tasks, and binlog‑based data subscription—detailing their advantages, drawbacks, and implementation considerations.

CanalElasticsearchMessage Queue
0 likes · 7 min read
How to Sync MySQL Data to Elasticsearch: 4 Practical Strategies
Java Architect Essentials
Java Architect Essentials
Jan 31, 2023 · Big Data

Optimizing Large-Scale Data Retrieval: ClickHouse Pagination, Elasticsearch Scroll Scan, ES+HBase, and RediSearch + RedisJSON Solutions

This article examines a business requirement to filter and rank up to 100,000 records from a pool of tens of millions, presenting and evaluating four technical solutions—multithreaded ClickHouse pagination, Elasticsearch scroll‑scan deep paging, an ES‑HBase combined query, and a RediSearch + RedisJSON approach—along with performance data and code examples.

ElasticsearchHBasePerformanceOptimization
0 likes · 12 min read
Optimizing Large-Scale Data Retrieval: ClickHouse Pagination, Elasticsearch Scroll Scan, ES+HBase, and RediSearch + RedisJSON Solutions
macrozheng
macrozheng
Jan 31, 2023 · Backend Development

How to Sync MySQL Data to Elasticsearch: 4 Practical Strategies

This article explores four common approaches for synchronizing product data from MySQL to Elasticsearch—including synchronous dual writes, asynchronous messaging, scheduled jobs, and binlog‑based data subscription—detailing their advantages, drawbacks, and implementation considerations for e‑commerce search systems.

ElasticsearchMessage Queuedata synchronization
0 likes · 6 min read
How to Sync MySQL Data to Elasticsearch: 4 Practical Strategies
ITPUB
ITPUB
Jan 30, 2023 · Backend Development

How to Keep MySQL and Elasticsearch in Sync: 4 Practical Strategies

This article examines four common approaches for synchronizing product data from MySQL to Elasticsearch—synchronous dual write, asynchronous dual write with message queues, scheduled batch jobs, and binlog‑based data subscription—detailing their advantages, drawbacks, and implementation considerations.

CanalElasticsearchMessage Queue
0 likes · 6 min read
How to Keep MySQL and Elasticsearch in Sync: 4 Practical Strategies
Architect
Architect
Jan 26, 2023 · Backend Development

Optimizing Elasticsearch for High‑Concurrency LBS Search with an RLE‑Based Inverted Index

This article details Meituan's search‑engine optimization for its food‑delivery platform, describing the performance bottlenecks of Elasticsearch's inverted‑list query and merge phases, the design of a run‑length‑encoding (RLE) index, custom hash‑map term look‑ups, sparse RoaringBitmap structures, integration steps, and the resulting 84% latency reduction.

ElasticsearchRLEinverted index
0 likes · 27 min read
Optimizing Elasticsearch for High‑Concurrency LBS Search with an RLE‑Based Inverted Index
MaGe Linux Operations
MaGe Linux Operations
Jan 19, 2023 · Operations

Mastering ELK: Compare Three Log Collection Architectures and Solve Common Issues

This article introduces the ELK stack’s core components, compares three typical deployment architectures—including Logstash‑only, Filebeat‑assisted, and Kafka‑backed designs—highlights their trade‑offs, and provides practical solutions for multiline log merging, timestamp correction, and module‑specific filtering using Logstash and Filebeat configurations.

ELKElasticsearchFilebeat
0 likes · 11 min read
Mastering ELK: Compare Three Log Collection Architectures and Solve Common Issues
Ctrip Technology
Ctrip Technology
Jan 12, 2023 · Big Data

Evolution of Ctrip's Log System: From Elasticsearch to ClickHouse and Log 3.0

This article details the evolution of Ctrip's log infrastructure, describing the shift from fragmented departmental logging to a unified Elasticsearch-based platform, the migration to ClickHouse for cost‑effective, high‑performance storage, and the subsequent Log 3.0 redesign that leverages Kubernetes, sharding, and a unified query governance layer to handle petabyte‑scale data.

Big DataCloud NativeETL
0 likes · 16 min read
Evolution of Ctrip's Log System: From Elasticsearch to ClickHouse and Log 3.0
Top Architect
Top Architect
Jan 11, 2023 · Big Data

Comprehensive Elasticsearch Query Guide with Java High Level REST Client Examples

This tutorial provides a thorough overview of Elasticsearch 7.13.2 query techniques—including term, terms, range, prefix, wildcard, boolean, filter, and aggregation queries—showing both raw DSL syntax and equivalent Java High Level REST Client code, enabling developers to quickly master search operations on large datasets.

ElasticsearchSearchaggregation
0 likes · 16 min read
Comprehensive Elasticsearch Query Guide with Java High Level REST Client Examples
Java High-Performance Architecture
Java High-Performance Architecture
Jan 5, 2023 · Databases

Scaling Billions of Orders: MySQL Sharding, ES & Hive Strategies

This article explains how to handle massive order volumes by classifying data into hot and cold tiers, storing them in MySQL, Elasticsearch, and Hive, and implementing sharding and partitioning strategies—including shard keys, modulo routing, and combined database‑table distribution—to achieve high throughput and low cost.

Elasticsearchdatabase scalinghive
0 likes · 8 min read
Scaling Billions of Orders: MySQL Sharding, ES & Hive Strategies
FunTester
FunTester
Jan 4, 2023 · Backend Development

Mastering Elasticsearch Java REST Client: From Raw HTTP to a Ready‑to‑Use Wrapper

This tutorial walks through setting up an Elasticsearch server, creating a Java REST client, implementing a custom GET‑with‑body request, building a reusable ESClient wrapper with common CRUD and search methods, and providing Groovy performance test cases for indexing and querying.

ElasticsearchGroovyPerformance Testing
0 likes · 12 min read
Mastering Elasticsearch Java REST Client: From Raw HTTP to a Ready‑to‑Use Wrapper
Code Ape Tech Column
Code Ape Tech Column
Jan 3, 2023 · Big Data

Elasticsearch vs ClickHouse: Performance, Cost, and Deployment Guide

This article compares Elasticsearch and ClickHouse in terms of write throughput, query speed, and server cost, then provides a step‑by‑step deployment guide for a private data pipeline using Zookeeper, Kafka, FileBeat, and ClickHouse, along with common issues and their solutions.

Big DataDeploymentElasticsearch
0 likes · 15 min read
Elasticsearch vs ClickHouse: Performance, Cost, and Deployment Guide
Architect
Architect
Dec 30, 2022 · Databases

Database Sharding and Partitioning Strategy for High‑Volume Order Systems

The article explains how to handle billions of daily orders by classifying data into hot and cold segments, storing them in MySQL, Elasticsearch, and Hive, and applying sharding and partitioning techniques at both table and database levels to achieve scalable performance.

Data PartitioningElasticsearchdatabase sharding
0 likes · 9 min read
Database Sharding and Partitioning Strategy for High‑Volume Order Systems
MaGe Linux Operations
MaGe Linux Operations
Dec 21, 2022 · Operations

Mastering Elasticsearch Nodes: Types, Roles, and Scaling Strategies

This guide explains the different Elasticsearch node types, their default roles, how to configure master‑eligible, data, ingest, and coordinating‑only nodes, and provides best‑practice recommendations for planning and scaling large clusters to ensure stability and performance.

Cluster ConfigurationCoordinating NodeData Node
0 likes · 12 min read
Mastering Elasticsearch Nodes: Types, Roles, and Scaling Strategies
Architect
Architect
Dec 19, 2022 · Databases

Understanding Elasticsearch DSL Query Syntax (7.x)

This article provides a comprehensive guide to Elasticsearch 7.x DSL query syntax, explaining core keywords, field mappings, various query types such as match, term, range, fuzzy, and bool, and includes practical code examples for building effective search queries.

DSLElasticsearchElasticsearch7
0 likes · 8 min read
Understanding Elasticsearch DSL Query Syntax (7.x)
php Courses
php Courses
Dec 9, 2022 · Databases

Elasticsearch Index and Document Operations Tutorial

This tutorial explains how to create, query, update, and delete Elasticsearch indices and documents using RESTful HTTP requests, covering basic CRUD operations, various query types, pagination, sorting, aggregations, highlighting, and mapping definitions with practical JSON examples.

Big DataElasticsearchJSON
0 likes · 8 min read
Elasticsearch Index and Document Operations Tutorial
dbaplus Community
dbaplus Community
Dec 6, 2022 · Backend Development

How Meituan Cut Elasticsearch Search Latency by 84% with an RLE‑Based Inverted Index

This article details Meituan's search‑engine team optimization of Elasticsearch for a high‑traffic LBS scenario, describing the performance bottlenecks in term‑posting retrieval and merging, the design of a run‑length‑encoding (RLE) inverted index, its integration as a plugin, extensive benchmarking, and the resulting 84% reduction in TP99 query latency.

Backend SearchElasticsearchRoaringBitmap
0 likes · 25 min read
How Meituan Cut Elasticsearch Search Latency by 84% with an RLE‑Based Inverted Index
Java High-Performance Architecture
Java High-Performance Architecture
Dec 5, 2022 · Backend Development

How to Build a High‑Performance, Highly Available Membership System with ES, Redis, and MySQL

This article explains how a large‑scale membership system achieves high performance and high availability by using a dual‑center Elasticsearch cluster, traffic‑isolated three‑cluster architecture, Redis caching with dual‑center clusters, and a MySQL partitioned dual‑center setup, while also detailing optimization, migration, and fine‑grained flow‑control strategies.

Backend ArchitectureElasticsearchScalability
0 likes · 21 min read
How to Build a High‑Performance, Highly Available Membership System with ES, Redis, and MySQL
Architects Research Society
Architects Research Society
Dec 3, 2022 · Databases

Solr vs Elasticsearch: Choosing the Right Search Engine for Your Organization

This article compares Solr and Elasticsearch, examining their cloud, analytics, and cognitive search capabilities, and provides guidance on selecting the most suitable engine based on factors such as deployment complexity, resource requirements, scalability, integration with Hadoop ecosystems, and specific organizational use cases.

Big DataComparisonElasticsearch
0 likes · 9 min read
Solr vs Elasticsearch: Choosing the Right Search Engine for Your Organization
Tencent Cloud Developer
Tencent Cloud Developer
Dec 2, 2022 · Big Data

Design and Implementation of a Hundred‑Billion‑Scale Real‑Time Monitoring System

The paper presents the design and deployment of a hundred‑billion‑scale real‑time monitoring platform that meets stringent data‑collection, analysis, storage, alerting and visualization requirements, compares Oceanus + Elastic Stack against a Zabbix‑Prometheus‑Grafana stack, selects the former, and details performance‑and cost‑optimizations that enable massive, low‑latency monitoring while maintaining high availability.

ElasticsearchFlinkOceanus
0 likes · 20 min read
Design and Implementation of a Hundred‑Billion‑Scale Real‑Time Monitoring System
DataFunTalk
DataFunTalk
Nov 27, 2022 · Operations

Best Practices for Full‑Stack Operations Monitoring and Cost Reduction Using Alibaba Cloud Elasticsearch

This article presents a comprehensive, three‑part guide on the current state of full‑stack operations monitoring, common challenges and solutions, and a real‑world use case, illustrating how Alibaba Cloud Elasticsearch can improve observability, boost performance, and cut costs for complex distributed systems.

Cost OptimizationElasticsearchOperations
0 likes · 13 min read
Best Practices for Full‑Stack Operations Monitoring and Cost Reduction Using Alibaba Cloud Elasticsearch
21CTO
21CTO
Nov 20, 2022 · Big Data

How Meituan’s Logan Real‑Time Log System Boosts Debugging Across Mobile, Web, and IoT

This article details the design, architecture, and implementation of Meituan's Logan real‑time logging platform, covering its workflow, multi‑terminal collection SDK, ingestion, Flink‑based processing, consumption layers, stability measures, and future roadmap, illustrating how it improves fault diagnosis and system reliability.

ElasticsearchFlinkKafka
0 likes · 18 min read
How Meituan’s Logan Real‑Time Log System Boosts Debugging Across Mobile, Web, and IoT
Meituan Technology Team
Meituan Technology Team
Nov 17, 2022 · Backend Development

Elasticsearch Query and Merge Optimization Using Run-Length Encoding for Meituan Takeaway Search

Meituan's food‑delivery search team identified heavy CPU and latency hotspots in Elasticsearch's posting‑list query and merge phases, then redesigned the inverted index using Run‑Length Encoding, hash‑based term lookup, index sorting and a custom SparseRoaringDocIdSet, ultimately reducing TP99 search latency by 84% and cutting CPU usage dramatically.

ElasticsearchIndex SortingRoaringBitmap
0 likes · 26 min read
Elasticsearch Query and Merge Optimization Using Run-Length Encoding for Meituan Takeaway Search
Liulishuo Tech Team
Liulishuo Tech Team
Nov 17, 2022 · Big Data

Real‑time Data Warehouse Architecture and Technical Solution at Liulishuo

This article describes Liulishuo's migration to a Flink‑based real‑time data warehouse, covering background, benefits, technology selection (storage, Flink platform, dimension table connectors), overall architecture, concrete Hudi and Elasticsearch ingestion examples, processing SQL, and future outlook for unified batch‑streaming storage.

ElasticsearchFlinkHudi
0 likes · 15 min read
Real‑time Data Warehouse Architecture and Technical Solution at Liulishuo
HelloTech
HelloTech
Nov 15, 2022 · Backend Development

Investigation of Elasticsearch RestClient Load‑Balancing and Traffic Skew Issues

The investigation revealed that Elasticsearch RestClient’s built‑in round‑robin and dead‑node blacklisting redirected traffic from failed data‑node addresses—mistakenly included in the static IP list—onto a single client node, causing severe load imbalance and timeouts, which were eliminated after correcting the IP list.

ElasticsearchJava ClientRound Robin
0 likes · 17 min read
Investigation of Elasticsearch RestClient Load‑Balancing and Traffic Skew Issues
Selected Java Interview Questions
Selected Java Interview Questions
Nov 13, 2022 · Backend Development

Integrating Elasticsearch with Spring Boot for Book Search: A Step‑by‑Step Guide

This article explains why MySQL LIKE queries are inefficient for search, introduces Elasticsearch as a high‑performance alternative, and provides a complete Spring Boot integration tutorial—including version compatibility, Maven dependencies, Java configuration, entity mapping, repository interfaces, service and controller code, and testing steps—to implement CRUD‑style full‑text search for a book catalog.

CRUDElasticsearchSearch
0 likes · 11 min read
Integrating Elasticsearch with Spring Boot for Book Search: A Step‑by‑Step Guide