Tagged articles

22 articles

Page 1 of 1

Apr 17, 2026 · Backend Development

Resolving ID Conflicts After Sharding: 5 Practical Solutions

When sharding databases, independent auto‑increment IDs can collide; this article explains why the conflict occurs and evaluates five concrete solutions—UUID, MySQL auto‑increment step, segment allocation, Snowflake algorithm, and Meituan Leaf—detailing their mechanisms, trade‑offs, performance, and suitability.

ID generationMeituan LeafSegment

0 likes · 14 min read

Resolving ID Conflicts After Sharding: 5 Practical Solutions

Sanyou's Java Diary

Jul 14, 2025 · Big Data

How Kafka Handles Millions of Messages per Second: Inside Its High‑Performance Architecture

This article explains how Kafka achieves extremely high throughput by using a Reactor‑based non‑blocking I/O model, zero‑copy data transfer, sequential disk writes, memory‑mapped files, sparse indexing, partition load‑balancing, compression, batch processing, and a lock‑free offset design.

KafkaPageCachePartition

0 likes · 27 min read

How Kafka Handles Millions of Messages per Second: Inside Its High‑Performance Architecture

JD Cloud Developers

Jun 30, 2025 · Backend Development

Unveiling Guava Cache Internals: Why It Lags Behind Caffeine

This article dissects Guava Cache's source code, explaining its segment‑based locking, data structures, put/get implementations, cleanup and eviction mechanisms, and then contrasts its performance and design choices with the more modern Caffeine cache, highlighting why Guava falls short.

CacheCaffeineGuava

0 likes · 39 min read

Unveiling Guava Cache Internals: Why It Lags Behind Caffeine

Sanyou's Java Diary

Oct 10, 2024 · Big Data

Why Kafka Handles Millions of Messages Per Second: Inside Its High‑Performance Architecture

This article explains how Kafka achieves ultra‑high throughput and low latency despite being disk‑based, covering its Reactor I/O network model, zero‑copy techniques, partitioning strategies, segment logs with sparse indexes, sequential disk writes, page cache usage, compression, batch processing, and lock‑free offset management.

KafkaOffset ManagementReactor I/O

0 likes · 27 min read

Why Kafka Handles Millions of Messages Per Second: Inside Its High‑Performance Architecture

IT Services Circle

Jun 24, 2024 · Databases

Understanding Elasticsearch Architecture: Inverted Index, Term Dictionary, Segments, and Distributed Search

This article explains how Elasticsearch transforms simple keyword matching into a high‑performance, scalable search engine by using inverted indexes, term dictionaries, posting lists, term indexes, stored fields, doc values, segments, and distributed node architectures to achieve fast, reliable full‑text search on massive data sets.

ElasticsearchSegmentTerm Dictionary

0 likes · 16 min read

Understanding Elasticsearch Architecture: Inverted Index, Term Dictionary, Segments, and Distributed Search

Youzan Coder

Aug 8, 2023 · Big Data

Kylin4 Deployment and Performance Optimizations at Youzan

Since 2018 Youzan has migrated all online services to Kylin4, addressing long cube rebuilds, single‑point cache, CPU spikes, and throttling gaps by adding batch segment builds, low‑priority concurrency controls, Redis‑based query caching, parquet skew mitigation, range‑query acceleration, and class‑loader optimizations, which together doubled query‑per‑second capacity to 150, cut latency by up to 50 % and reduced CPU usage.

Big DataCubeKylin

0 likes · 17 min read

Kylin4 Deployment and Performance Optimizations at Youzan

Programmer DD

May 15, 2023 · Backend Development

Choosing the Right Distributed ID Strategy: UUID, Snowflake, Segment, Redis, and More

This article examines various distributed ID generation techniques—including UUID, database auto‑increment, segment mode, Redis INCR, Snowflake, Meituan Leaf, Baidu UidGenerator, and Didi TinyID—detailing their principles, advantages, drawbacks, and code examples to help developers select the most suitable solution for their systems.

Segmentdistributed-idredis

0 likes · 16 min read

Choosing the Right Distributed ID Strategy: UUID, Snowflake, Segment, Redis, and More

Liangxu Linux

Dec 5, 2022 · Databases

How Does MySQL Store a Row? Inside InnoDB Files, Pages, and Row Formats

This article explains where MySQL keeps its data files, how InnoDB organizes tablespaces into segments, extents, pages and rows, details the Compact row format—including variable‑length field length lists, NULL‑value lists, and hidden fields—and clarifies the limits of VARCHAR and row‑overflow handling.

Database StorageExtentInnoDB

0 likes · 22 min read

How Does MySQL Store a Row? Inside InnoDB Files, Pages, and Row Formats

Aikesheng Open Source Community

Dec 5, 2022 · Databases

Greenplum Standby Failure Diagnosis and Recovery Procedure

This article describes how to identify and fix Greenplum standby segment failures by inspecting segment status, comparing missing files, generating a recovery configuration, executing gprecoverseg commands, and verifying that all segments return to normal operation without service interruption.

GreenplumRecoverySQL

0 likes · 17 min read

Greenplum Standby Failure Diagnosis and Recovery Procedure

Top Architect

Nov 29, 2022 · Backend Development

Distributed ID Generation Strategies: UUID, Auto‑Increment, Segment Mode, Redis, Snowflake, Baidu UidGenerator, Meituan Leaf, Didi TinyID

This article explains why distributed ID generation is needed, outlines business requirements such as global uniqueness and monotonicity, and compares eight practical schemes—including UUID, database auto‑increment, segment mode, Redis, Snowflake, Baidu UidGenerator, Meituan Leaf, and Didi TinyID—detailing their advantages, disadvantages, and implementation examples.

BackendSegmentdistributed-id

0 likes · 16 min read

Distributed ID Generation Strategies: UUID, Auto‑Increment, Segment Mode, Redis, Snowflake, Baidu UidGenerator, Meituan Leaf, Didi TinyID

Tencent Cloud Developer

Aug 24, 2022 · Backend Development

Go Distributed ID Generation: UUID, ShortUUID, XID, KSUID, ULID, Snowflake, and Segment Modes

The article surveys Go‑based distributed ID generation techniques—from UUID, short‑UUID, XID, KSUID, ULID, and Snowflake to database auto‑increment, Redis, ZooKeeper, and segment/leaf approaches—detailing their characteristics, trade‑offs, implementation snippets, and strategies for ordering, uniqueness, and clock‑rollback handling.

GoSegmentdatabase

0 likes · 12 min read

Go Distributed ID Generation: UUID, ShortUUID, XID, KSUID, ULID, Snowflake, and Segment Modes

Top Architect

Nov 7, 2021 · Backend Development

Understanding Near Real‑Time Search and Core Architecture of ElasticSearch

This article explains how ElasticSearch achieves near real‑time search by using immutable inverted indexes, segment merging, shard distribution, and a translog for durability, while also offering practical guidance on how to study the system effectively.

Distributed SearchSegmentinverted index

0 likes · 8 min read

Understanding Near Real‑Time Search and Core Architecture of ElasticSearch

IT Architects Alliance

Aug 29, 2021 · Big Data

Kafka Storage Mechanism and Reliability Guarantees

This article explains Kafka's storage architecture—including topics, partitions, segments, and their naming rules—along with how data is read, and details the system's reliability features such as ISR/OSR replication, leader election, producer acknowledgment levels, and delivery guarantees.

At-Least-OnceDistributedSegment

0 likes · 10 min read

Kafka Storage Mechanism and Reliability Guarantees

Java High-Performance Architecture

May 18, 2021 · Databases

How Elasticsearch Writes Data: Buffers, Translog, Refresh, and Merge Explained

Elasticsearch routes client write requests to the primary shard, buffers documents in memory, logs operations to a translog, periodically refreshes to create segment files, and uses flush and merge processes to ensure data durability, optimize I/O, and consolidate segments for efficient search.

ElasticsearchSegmentWrite Process

0 likes · 7 min read

How Elasticsearch Writes Data: Buffers, Translog, Refresh, and Merge Explained

Programmer DD

Jan 28, 2021 · Databases

How Elasticsearch Writes, Reads, and Searches Data: Inside the Engine

This article explains Elasticsearch's internal mechanisms for writing, reading, and searching data, covering the roles of coordinating nodes, primary and replica shards, buffers, translog, segment files, refresh cycles, commit and flush operations, as well as Lucene's inverted index and how deletions and updates are handled.

ElasticsearchSegmentinverted index

0 likes · 10 min read

How Elasticsearch Writes, Reads, and Searches Data: Inside the Engine

Big Data Technology & Architecture

Aug 25, 2020 · Big Data

Understanding Kafka's Segment Storage and Index Design

This article explains how Kafka partitions data into segments, stores each segment as paired index and log files, and uses sparse indexing to enable efficient queries, illustrating the process with examples and diagrams of segment layout and offset lookup.

Big DataKafkaSegment

0 likes · 4 min read

Understanding Kafka's Segment Storage and Index Design

Big Data Technology & Architecture

Jul 20, 2020 · Big Data

Kafka Workflow and File Storage Mechanism: Topics, Partitions, Segments, Index and Log Files

This article explains Kafka’s workflow, detailing how topics, partitions, and segments are organized, the structure of index and log files, message composition, offset-based retrieval, and the overall data directory layout, providing a comprehensive overview of Kafka’s storage architecture.

Big DataKafkaOFFSET

0 likes · 8 min read

Kafka Workflow and File Storage Mechanism: Topics, Partitions, Segments, Index and Log Files

Big Data Technology Architecture

Jun 29, 2020 · Fundamentals

Kafka Storage Mechanism and Reliability Guarantees

This article explains Kafka's internal storage architecture—including topics, partitions, segments, .log and .index files—how data is read, and the various reliability mechanisms such as ISR/OSR, LEO/HW, producer acknowledgment levels, leader election strategies, and delivery semantics.

KafkaProducer AcksReliability

0 likes · 9 min read

DevOps Coach

Nov 26, 2019 · Backend Development

Why Elasticsearch Creates Too Many Segments and How Lucene Flush Works

The article explains how Elasticsearch’s use of Lucene’s flush mechanism, concurrent shard writes, and IndexWriter buffering lead to an excess of small segments, outlines the flush conditions, and offers guidance on managing write concurrency for better performance.

ElasticsearchFlushIndexWriter

0 likes · 10 min read

Why Elasticsearch Creates Too Many Segments and How Lucene Flush Works

Architecture Digest

Sep 11, 2019 · Databases

Distributed ID Generation Mechanisms: Auto‑Increment, Multi‑Master, Segment Mode, Snowflake and Redis

This article reviews various distributed ID generation strategies—including database auto‑increment tables, dual‑master MySQL setups, segment allocation, the Snowflake algorithm, and Redis INCR—detailing their designs, SQL examples, scalability trade‑offs, and open‑source implementations.

Segmentdatabasedistributed-id

0 likes · 13 min read

Distributed ID Generation Mechanisms: Auto‑Increment, Multi‑Master, Segment Mode, Snowflake and Redis

dbaplus Community

Oct 23, 2017 · Databases

Unlock Oracle Performance: How Logical Structures Impact SQL Optimization

This article explains how Oracle's logical structures—blocks, segments, tablespaces, and rowids—affect SQL performance, presents detailed experiments on block capacity, row migration, segment handling, tablespace behavior, and provides practical optimization guidelines based on observed logical read reductions.

Block SizeLogical StructureOracle

0 likes · 11 min read

Unlock Oracle Performance: How Logical Structures Impact SQL Optimization

Meituan Technology Team

Jan 14, 2015 · Big Data

Kafka File Storage Mechanism and Architecture

Kafka stores each topic as partitions that are divided into sequential segment files containing paired .log data and .index files, using global offsets and sparse memory‑mapped indexes to enable fast offset‑based lookups, efficient deletions, and minimal disk I/O in real‑world deployments.

KafkaMessage QueuePartition

0 likes · 9 min read

Kafka File Storage Mechanism and Architecture