Tag

Indexing

0 views collected around this technical thread.

Alimama Tech
Alimama Tech
Feb 15, 2023 · Big Data

Dolphin: Alibaba's Hyper‑Converged Multi‑Modal Big Data Engine Overview

Dolphin, Alibaba’s hyper‑converged multi‑modal big‑data engine, unifies OLAP, AI, streaming, and batch workloads on a decoupled compute‑storage MPP foundation, offering a Dolphin SQL layer, advanced bitmap/GroupTable/AFile indexes, intelligent materialization, and one‑write‑multiple‑read storage that cuts costs over 70% while delivering sub‑millisecond queries on trillion‑row datasets.

AIBig DataIndexing
0 likes · 14 min read
Dolphin: Alibaba's Hyper‑Converged Multi‑Modal Big Data Engine Overview
HelloTech
HelloTech
Jul 6, 2022 · Big Data

Investigation and Resolution of Elasticsearch Write Timeout Issues in a Real-Time Flink Data Sync Pipeline

The team diagnosed intermittent Elasticsearch write‑timeout failures in their real‑time Flink‑to‑Elasticsearch pipeline as lock contention from frequent duplicate updates to the same document IDs, and eliminated the issue by aggregating binlog events in a 5‑second sliding window to deduplicate writes, adjusting refresh intervals, using async translog durability, and disabling non‑essential fields.

ElasticsearchFlinkIndexing
0 likes · 7 min read
Investigation and Resolution of Elasticsearch Write Timeout Issues in a Real-Time Flink Data Sync Pipeline
vivo Internet Technology
vivo Internet Technology
Nov 13, 2017 · Databases

Authoritative Guide to Elasticsearch Search Performance Tuning (Part 1)

This guide explains how to tune Elasticsearch 5.x search performance by modeling documents with flattened, nested, or parent‑child structures, managing global ordinals, allocating JVM heap under 50 % of RAM, monitoring garbage collection, and choosing SSD storage and ample file‑system cache.

ElasticsearchIndexingJVM
0 likes · 11 min read
Authoritative Guide to Elasticsearch Search Performance Tuning (Part 1)
vivo Internet Technology
vivo Internet Technology
Oct 14, 2017 · Databases

Elasticsearch Index Performance Optimization (Part 2)

To maximize Elasticsearch bulk-indexing speed, temporarily disable refreshes and replicas, tune merge throttling and scheduler threads, enlarge translog and index buffer thresholds, and adjust indexing and bulk thread-pool sizes, then restore defaults after the load completes.

ElasticsearchIndexingPerformance Tuning
0 likes · 13 min read
Elasticsearch Index Performance Optimization (Part 2)