Tagged articles
8 articles
Page 1 of 1
macrozheng
macrozheng
Apr 18, 2025 · Big Data

How to Build Near Real-Time Elasticsearch Indexes for PB-Scale Data

This article explains why traditional databases like MySQL struggle with massive data, introduces Elasticsearch’s advantages, and details a practical architecture using Hive, Canal, and Otter to achieve near real‑time indexing of petabyte‑scale datasets with minimal latency.

Big DataCanalData Transfer Service
0 likes · 20 min read
How to Build Near Real-Time Elasticsearch Indexes for PB-Scale Data
Su San Talks Tech
Su San Talks Tech
Dec 8, 2024 · Big Data

How to Build Near Real-Time ElasticSearch Indexes for PB-Scale Data

This article explains why traditional databases like MySQL struggle with massive datasets, introduces ElasticSearch’s inverted‑index architecture, and details a practical pipeline using Hive, wide tables, binlog, Canal, and Otter to achieve near real‑time indexing for petabyte‑level data.

CanalOtterdata pipeline
0 likes · 19 min read
How to Build Near Real-Time ElasticSearch Indexes for PB-Scale Data
Su San Talks Tech
Su San Talks Tech
Feb 27, 2023 · Big Data

How to Build Near Real-Time Elasticsearch Indexes for PB-Scale Data

This article explains how to construct near real-time Elasticsearch indexes for petabyte‑scale datasets by comparing MySQL limitations, introducing ES fundamentals, leveraging Hive and wide tables, and employing binlog‑based tools like Canal and Otter for low‑latency data synchronization.

CanalElasticsearchOtter
0 likes · 22 min read
How to Build Near Real-Time Elasticsearch Indexes for PB-Scale Data
dbaplus Community
dbaplus Community
Jun 6, 2021 · Big Data

How to Build Near‑Real‑Time Elasticsearch Indexes for PB‑Scale Data

This article explains why traditional databases like MySQL struggle with petabyte‑scale queries, introduces Elasticsearch’s architecture and inverted‑index advantages, and details a practical pipeline using Hive, Canal, Otter, and a custom DTS service to achieve near‑real‑time indexing for massive data volumes.

CanalElasticsearchOtter
0 likes · 19 min read
How to Build Near‑Real‑Time Elasticsearch Indexes for PB‑Scale Data
dbaplus Community
dbaplus Community
Oct 27, 2019 · Databases

How to Build a Real‑Time MySQL Data Sync Platform with Alibaba Otter

This guide explains how to create a production‑grade, near‑real‑time MySQL data‑synchronisation and query platform using Alibaba's open‑source Otter middleware, covering architecture, key features, installation steps, configuration of manager and node components, and best practices for safe query operations.

Database AdministrationOtterdata synchronization
0 likes · 13 min read
How to Build a Real‑Time MySQL Data Sync Platform with Alibaba Otter