Tagged articles

1273 articles

Page 2 of 13

Sep 3, 2025 · Backend Development

How We Boosted Kafka Throughput by 35% with Filebeat Tuning and Compression Tricks

This case study details how a high‑traffic Kafka logging cluster was optimized by analyzing low compression ratios, tuning Filebeat parameters, adjusting memory queues and round‑robin settings, and validating the changes through gray‑scale tests, resulting in up to 35% higher throughput and significant resource savings.

FilebeatKafkacompression

0 likes · 10 min read

How We Boosted Kafka Throughput by 35% with Filebeat Tuning and Compression Tricks

Mike Chen's Internet Architecture

Aug 31, 2025 · Big Data

Understanding Kafka Cluster Architecture: Brokers, Topics, and Partitions Explained

This article provides a concise overview of Kafka cluster fundamentals, describing how multiple brokers form a distributed system, the role of topics and partitions for scalability, and illustrating data distribution and replication with a three‑broker example.

Kafkamessage queues

0 likes · 4 min read

Understanding Kafka Cluster Architecture: Brokers, Topics, and Partitions Explained

php Courses

Aug 29, 2025 · Operations

How to Build a Real‑Time PHP Log Event Pipeline for Instant Insights

Learn how to transform PHP logs into real‑time, structured events by implementing a log event pipeline that includes JSON logging, lightweight collectors like Filebeat, streaming platforms such as Kafka or Flink, enrichment, and visualization with Grafana, enabling instant monitoring, alerting, and data‑driven decisions.

FlinkGrafanaKafka

0 likes · 7 min read

How to Build a Real‑Time PHP Log Event Pipeline for Instant Insights

Mike Chen's Internet Architecture

Aug 28, 2025 · Big Data

Understanding Kafka Cluster Architecture: Core Components and How They Work

This article explains the essential components of a Kafka cluster—including producers, consumers, brokers, Zookeeper, and the KRaft controller—detailing how topics, partitions, and replication enable high‑throughput, scalable, and fault‑tolerant distributed messaging.

KafkaStreamingarchitecture

0 likes · 5 min read

Understanding Kafka Cluster Architecture: Core Components and How They Work

Mike Chen's Internet Architecture

Aug 27, 2025 · Big Data

Mastering Kafka: How Producer, Broker, and Consumer Work Together for High‑Throughput Streaming

This article explains Kafka’s core architecture—producer, broker storage, and consumer groups—detailing how messages are partitioned, buffered, replicated, and consumed to achieve high throughput, low latency, and scalable real‑time data processing.

Distributed MessagingKafkaStreaming

0 likes · 4 min read

Mastering Kafka: How Producer, Broker, and Consumer Work Together for High‑Throughput Streaming

Su San Talks Tech

Aug 23, 2025 · Backend Development

How to Build a Robust Asynchronous Processing SDK with Spring and Kafka

This article explains how to design and implement a generic asynchronous processing SDK for Java backend services, covering its purpose, advantages, underlying principles, component choices, database schema, configuration, usage patterns, and best‑practice notes, with code examples and diagrams.

ConfigurationJava backendKafka

0 likes · 9 min read

How to Build a Robust Asynchronous Processing SDK with Spring and Kafka

Alibaba Cloud Big Data AI Platform

Aug 21, 2025 · Big Data

How Hypergryph Built a High‑Performance Real‑Time Analytics Platform with StarRocks

This case study details how Hypergryph leveraged Alibaba Cloud EMR Serverless StarRocks, Flink, and Kafka to replace a ClickHouse data warehouse with a high‑performance, elastic, and easy‑to‑operate real‑time analytics platform that dramatically improved query speed, stability, operational efficiency, and cost for their gaming business.

FlinkKafkaStarRocks

0 likes · 8 min read

How Hypergryph Built a High‑Performance Real‑Time Analytics Platform with StarRocks

Mike Chen's Internet Architecture

Aug 20, 2025 · Backend Development

How Kafka Achieves High‑Performance Storage: Log Segments, Indexes, and Retention

This article explains Kafka's storage architecture, including its partition‑based log files, sequential append writes, log segment management, index files, and configurable time‑ and size‑based retention policies that together enable ultra‑high write throughput while controlling disk usage.

KafkaRetention PolicySequential Write

0 likes · 4 min read

How Kafka Achieves High‑Performance Storage: Log Segments, Indexes, and Retention

MaGe Linux Operations

Aug 19, 2025 · Big Data

Master Kafka High Availability: Replica Sync & Disaster Recovery Strategies

This article provides a comprehensive guide to building enterprise‑grade, highly available Kafka clusters, covering architecture design, hardware planning, production‑level broker configurations, ISR management, monitoring, fault‑tolerance procedures, rolling upgrades, capacity planning, and automation scripts for seamless operations.

KafkaOperationsdisaster-recovery

0 likes · 16 min read

Master Kafka High Availability: Replica Sync & Disaster Recovery Strategies

StarRocks

Aug 19, 2025 · Big Data

How Joydata Scaled to 150 Billion Daily Events with StarRocks: A Data Architecture Journey

Facing daily data growth from millions to 150 billion records, Joydata‑U transformed its analytics platform through three architectural stages—Hadoop, Hadoop + Trino, and finally StarRocks—introducing resource isolation, Flat JSON acceleration, and Bitmap indexing to cut query latency by up to seven times and achieve sub‑2‑minute data freshness across BI, ad‑tech, game analytics, and CRM workloads.

Bitmap IndexData ArchitectureFlat JSON

0 likes · 12 min read

How Joydata Scaled to 150 Billion Daily Events with StarRocks: A Data Architecture Journey

360 Zhihui Cloud Developer

Aug 19, 2025 · Big Data

How to Accurately Size Kafka Clusters: Real‑World Disk I/O Tests and Capacity Planning

This article shares 360 Group's systematic Kafka capacity‑planning methodology, covering hardware performance analysis, disk I/O benchmarking, cluster configuration, load‑testing procedures, observed write‑read dynamics, and practical recommendations for reliable Kafka deployments.

Kafkabig-datacapacity-planning

0 likes · 11 min read

How to Accurately Size Kafka Clusters: Real‑World Disk I/O Tests and Capacity Planning

Mike Chen's Internet Architecture

Aug 17, 2025 · Big Data

Master Kafka: Essential Commands for Starting, Managing Topics, and Messaging

This guide walks you through the core Kafka commands for starting and stopping the service, creating, listing, describing, and deleting topics, as well as producing and consuming messages, while explaining key parameters such as Zookeeper, partitions, and replication factors.

Big DataDistributed SystemsKafka

0 likes · 4 min read

Master Kafka: Essential Commands for Starting, Managing Topics, and Messaging

Practical DevOps Architecture

Aug 4, 2025 · Operations

How to Unlock Elasticsearch Write Block and Fix Logstash 403 Errors

When Kafka backs up and Logstash fails to write to Elasticsearch with a 403 cluster_block_exception, the index is locked by a write block, which can be resolved by clearing the index.blocks.write setting via the Elasticsearch API or Kibana dev tools.

KafkaLogstashtroubleshooting

0 likes · 2 min read

How to Unlock Elasticsearch Write Block and Fix Logstash 403 Errors

Tencent Cloud Middleware

Aug 1, 2025 · Backend Development

Mastering TDMQ CKafka Production: Configurations, Partitioning, and Reliable Messaging

This guide explains how to efficiently produce messages with TDMQ CKafka, covering topic creation, partition sizing, retry policies, asynchronous sending, key/value handling, batch tuning, sticky partitioning, idempotence, ACK settings, code examples, and strategies to avoid data skew and ensure high‑throughput, low‑latency messaging.

BatchingCKafkaIdempotence

0 likes · 18 min read

Mastering TDMQ CKafka Production: Configurations, Partitioning, and Reliable Messaging

Mike Chen's Internet Architecture

Jul 24, 2025 · Big Data

Boost Kafka Consumer Throughput: Multi‑Threading, Consumer Groups & Config Tuning

This guide explains why Kafka consumer throughput matters, then details practical techniques—including multi‑threaded consumption, scaling with consumer groups, client parameter tuning, and batch processing—to dramatically increase throughput while maintaining reliability in high‑concurrency, large‑scale data pipelines.

ConsumerHigh ThroughputKafka

0 likes · 5 min read

Boost Kafka Consumer Throughput: Multi‑Threading, Consumer Groups & Config Tuning

Mike Chen's Internet Architecture

Jul 17, 2025 · Big Data

How Kafka Achieves Million-Message Writes: Inside Its Sequential Append & Page Cache Magic

This article explains how Apache Kafka attains million‑level write throughput by sequentially appending messages to log files, leveraging operating‑system page cache and asynchronous disk flushing, and details the step‑by‑step data flow from producer to broker to storage.

High ThroughputKafkaMessage Queue

0 likes · 4 min read

How Kafka Achieves Million-Message Writes: Inside Its Sequential Append & Page Cache Magic

macrozheng

Jul 17, 2025 · Backend Development

How We Resolved Repeated Kafka Message Backlogs in a High‑Traffic Restaurant System

This article recounts a series of Kafka message backlog incidents in a restaurant ordering system and explains how targeted optimizations—batch database queries, index tuning, data archiving, and thread‑pool scaling—eliminated the delays and restored reliable kitchen display performance.

Backend DevelopmentKafkaMessage Queue

0 likes · 12 min read

How We Resolved Repeated Kafka Message Backlogs in a High‑Traffic Restaurant System

Architect

Jul 16, 2025 · Backend Development

Mastering Spring Transaction Hooks: Async Kafka Logging After Commit

This article explains how to use Spring's TransactionSynchronizationManager to detect active transactions and reliably send Kafka messages either immediately or after transaction commit, illustrated with a payment‑system use case and complete code examples.

AsyncJavaKafka

0 likes · 10 min read

Mastering Spring Transaction Hooks: Async Kafka Logging After Commit

Sanyou's Java Diary

Jul 14, 2025 · Big Data

How Kafka Handles Millions of Messages per Second: Inside Its High‑Performance Architecture

This article explains how Kafka achieves extremely high throughput by using a Reactor‑based non‑blocking I/O model, zero‑copy data transfer, sequential disk writes, memory‑mapped files, sparse indexing, partition load‑balancing, compression, batch processing, and a lock‑free offset design.

KafkaPageCachePartition

0 likes · 27 min read

How Kafka Handles Millions of Messages per Second: Inside Its High‑Performance Architecture

Mike Chen's Internet Architecture

Jul 14, 2025 · Backend Development

Kafka Producer Idempotency: PID, Sequence Numbers, and Broker Deduplication

Kafka ensures that a producer’s repeated message sends, caused by network glitches or broker failures, result in only one persisted record per partition by using a unique Producer ID, monotonically increasing sequence numbers, and broker-side tracking of the latest committed sequence for each PID‑partition pair.

BackendKafkaMessage Deduplication

0 likes · 5 min read

Kafka Producer Idempotency: PID, Sequence Numbers, and Broker Deduplication

Mike Chen's Internet Architecture

Jul 9, 2025 · Backend Development

How to Prevent Duplicate Consumption in Kafka: Practical Strategies

This article explains why Kafka’s at‑least‑once delivery can cause duplicate message processing, outlines the business risks of such duplicates, and presents four practical solutions—including idempotent design, manual offset commits, exactly‑once semantics, and dead‑letter queues—to ensure reliable consumption.

Dead Letter QueueDuplicate ConsumptionExactly-Once

0 likes · 6 min read

How to Prevent Duplicate Consumption in Kafka: Practical Strategies

Mike Chen's Internet Architecture

Jul 7, 2025 · Big Data

How Kafka Achieves Billion-Message Throughput: Sequential Disk Writes, Page Cache, and Zero‑Copy

This article explains how Kafka sustains massive traffic by writing logs sequentially to disk, leveraging the operating system’s page cache for fast in‑memory writes, employing zero‑copy techniques like sendfile to avoid user‑space copying, and batching messages to reduce network overhead, thereby delivering high‑throughput, low‑latency streaming.

BatchingHigh ThroughputKafka

0 likes · 4 min read

How Kafka Achieves Billion-Message Throughput: Sequential Disk Writes, Page Cache, and Zero‑Copy

FunTester

Jul 5, 2025 · Big Data

Master Kafka: Core Concepts and Performance Testing Strategies

This article explains Kafka’s high‑performance distributed streaming architecture, key components such as topics, partitions, producers, consumers, brokers, offsets, and ZooKeeper, and provides step‑by‑step workflows for producers and consumers along with performance‑testing tips and Maven setup.

Big DataJavaKafka

0 likes · 9 min read

Master Kafka: Core Concepts and Performance Testing Strategies

Practical DevOps Architecture

Jul 4, 2025 · Operations

How to Configure Logstash to Ship Logs to Alibaba Cloud OSS

This guide walks you through installing Logstash and the logstash-output-oss plugin, then shows a complete logstash.conf that reads from Kafka, filters and enriches events, and outputs them to Alibaba Cloud OSS with compression, rotation, and recovery settings.

ConfigurationKafkaLog Shipping

0 likes · 3 min read

How to Configure Logstash to Ship Logs to Alibaba Cloud OSS

Code Wrench

Jul 4, 2025 · Frontend Development

Boost Web Performance: Leveraging navigator.sendBeacon and Kafka for Efficient Data Transmission

This article explains how to improve front‑end data transfer using the browser's navigator.sendBeacon API and how to streamline back‑end processing with Go, Kafka, and ClickHouse to achieve higher performance and stability in feature‑heavy web applications.

BackendClickHouseKafka

0 likes · 8 min read

Boost Web Performance: Leveraging navigator.sendBeacon and Kafka for Efficient Data Transmission

Big Data Tech Team

Jul 3, 2025 · Big Data

Master Kafka: A Complete Learning Roadmap from Basics to Advanced Projects

This guide presents a step‑by‑step Kafka learning roadmap covering core concepts, architecture, configuration, monitoring tools, practical project ideas, advanced components like Streams and KSQL, plus code samples and resource recommendations to help beginners become proficient in real‑time data streaming.

Code ExamplesKafkaStreaming

0 likes · 14 min read

Master Kafka: A Complete Learning Roadmap from Basics to Advanced Projects

Top Architect

Jul 3, 2025 · Backend Development

Building a Robust Asynchronous Processing SDK with Spring, Kafka and MySQL

This article introduces a non‑intrusive asynchronous processing SDK that leverages Spring transaction events, Kafka queues, and MySQL tables to ensure reliable execution, eventual consistency, and automatic retry without impacting the main business flow.

AsynchronousDesign PatternsKafka

0 likes · 13 min read

Building a Robust Asynchronous Processing SDK with Spring, Kafka and MySQL

Mike Chen's Internet Architecture

Jul 2, 2025 · Backend Development

Master Kafka: Core Principles, Architecture, and Workflow Explained

This article provides a comprehensive overview of Kafka, covering its high‑throughput distributed messaging model, key components such as producers, consumers, brokers, topics, partitions, and metadata management, as well as the end‑to‑end workflow from production to consumption.

Distributed MessagingKafkaStreaming

0 likes · 5 min read

Master Kafka: Core Principles, Architecture, and Workflow Explained

Mike Chen's Internet Architecture

Jun 30, 2025 · Backend Development

Why Slow Kafka Consumers Can Cripple Your System—and How to Fix It

When Kafka consumers can't keep up with producers, messages pile up, storage pressure rises, latency spikes, and downstream services suffer, but increasing consumer threads, batching, and asynchronous processing can alleviate these issues and restore system stability.

Consumer LagDistributed StreamingKafka

0 likes · 4 min read

Why Slow Kafka Consumers Can Cripple Your System—and How to Fix It

dbaplus Community

Jun 25, 2025 · Operations

How We Boosted Kafka Production Capacity by 35% with Simple Compression Tweaks

Facing petabyte‑scale log traffic, the Qunar team identified low compression rates in their Kafka‑Filebeat pipeline as the main bottleneck and, through systematic tuning of batch size, memory queues, and round‑robin settings, achieved a 35% reduction in traffic and a 30‑42% drop in request volume while raising per‑minute throughput by 35%.

BackendFilebeatKafka

0 likes · 10 min read

How We Boosted Kafka Production Capacity by 35% with Simple Compression Tweaks

Code Ape Tech Column

Jun 20, 2025 · Backend Development

Mastering Asynchronous Processing in Java: Build a Robust SDK with Spring

This article introduces a generic asynchronous processing SDK for Java, explaining its purpose, advantages, underlying principles, component choices such as Kafka, XXL‑Job, MySQL, and Spring AOP, and provides detailed configuration, database schema, usage examples, and best‑practice guidelines to achieve reliable, non‑blocking execution and eventual consistency in backend systems.

AsynchronousBackendKafka

0 likes · 9 min read

Mastering Asynchronous Processing in Java: Build a Robust SDK with Spring

Su San Talks Tech

Jun 20, 2025 · Backend Development

Why Environment Setup Is the Real Pain in Spring Boot Projects (And How to Fix It)

This article walks through the common frustrations of IDE and environment setup for Spring Boot projects, explains version compatibility between Spring Cloud, Spring Boot, and Kafka, demonstrates how Maven and auto‑configuration simplify dependencies, and provides ready‑to‑use code snippets and useful tooling recommendations for backend developers.

IDE setupKafkaSpring Boot

0 likes · 11 min read

Why Environment Setup Is the Real Pain in Spring Boot Projects (And How to Fix It)

Senior Tony

Jun 18, 2025 · Backend Development

Mastering High‑Concurrency Account Balance Deductions with Kafka, Sharding, and Parallel Processing

This article examines the challenges of high‑concurrency write scenarios in ad‑billing systems, explains why row‑level locking becomes a bottleneck, and presents a step‑by‑step solution using Kafka for asynchronous peak shaving, parallel consumer processing, database sharding, hotspot dispersion, and batch deduction to achieve reliable real‑time balance updates.

Kafkaaccount balance deductionhigh concurrency

0 likes · 8 min read

Mastering High‑Concurrency Account Balance Deductions with Kafka, Sharding, and Parallel Processing

Architect

Jun 6, 2025 · Backend Development

How to Prevent Duplicate Messages in Kafka and Pulsar: A Practical Guide

This article explains the three message delivery semantics, the common causes of duplicate messages in queue systems, and presents concrete producer‑side, broker‑side, and consumer‑side deduplication techniques for Kafka and Pulsar, including code samples and best‑practice recommendations.

BackendIdempotenceKafka

0 likes · 9 min read

How to Prevent Duplicate Messages in Kafka and Pulsar: A Practical Guide

Instant Consumer Technology Team

Jun 5, 2025 · Big Data

Mastering Kafka in Production: Boost Throughput, Ensure Reliability, and Avoid Data Loss

This article shares practical Kafka production insights, covering architecture overview, producer throughput tuning, message loss prevention, broker and consumer configurations, duplicate consumption avoidance, backlog mitigation, ordering guarantees, and the mechanics of consumer group rebalancing, helping engineers build stable, high‑performance streaming pipelines.

Big DataKafkaMessage Queue

0 likes · 15 min read

Mastering Kafka in Production: Boost Throughput, Ensure Reliability, and Avoid Data Loss

Mike Chen's Internet Architecture

Jun 3, 2025 · Big Data

Kafka High-Concurrency Core Design Explained

This article explains how Kafka achieves high concurrency through its distributed broker cluster, partitioned topics, sequential log writes, message compression, asynchronous producer mechanisms, and OS page‑cache techniques, illustrating the combined architectural and performance optimizations that enable massive throughput.

Kafkaasynchronous producerdistributed architecture

0 likes · 4 min read

Kafka High-Concurrency Core Design Explained

Full-Stack Internet Architecture

May 27, 2025 · Big Data

Understanding Event Streaming in Kafka: Core Concepts, Architecture, and Use Cases

This article explains Kafka's event streaming concept, detailing events and streams, core components such as producers, topics, partitions, consumers, persistence, and typical real‑time data pipeline, event‑driven architecture, stream processing, and log aggregation use cases, highlighting its role as a foundational big‑data infrastructure.

Event StreamingKafkaReal-time Processing

0 likes · 7 min read

Understanding Event Streaming in Kafka: Core Concepts, Architecture, and Use Cases

Full-Stack Internet Architecture

May 23, 2025 · Big Data

Step-by-Step Guide to Installing and Using Apache Kafka 3.8.1 on Linux

This tutorial walks through downloading, extracting, configuring, starting, creating topics, producing and consuming messages, and finally stopping Apache Kafka 3.8.1 on a Linux system, including all necessary command‑line instructions.

KafkaLinuxMessage Queue

0 likes · 4 min read

Step-by-Step Guide to Installing and Using Apache Kafka 3.8.1 on Linux

Java Captain

May 23, 2025 · Backend Development

Common Causes of Kafka Message Loss and Mitigation Strategies

This article examines the typical reasons Kafka messages are lost across producers, brokers, and consumers, and provides detailed configuration recommendations and best‑practice solutions to significantly reduce the risk of data loss in distributed streaming systems.

BrokerConfigurationConsumer

0 likes · 15 min read

Common Causes of Kafka Message Loss and Mitigation Strategies

Su San Talks Tech

May 23, 2025 · Backend Development

Kafka vs RabbitMQ vs RocketMQ vs ActiveMQ: Key Differences and How to Choose

This article explains the fundamentals, architectures, key concepts, advantages, and disadvantages of Kafka, RabbitMQ, RocketMQ, and ActiveMQ, and provides practical guidance on selecting the most suitable message queue for various application scenarios such as high‑throughput logging, financial transactions, or small‑scale services.

ComparisonKafkaRabbitMQ

0 likes · 19 min read

Kafka vs RabbitMQ vs RocketMQ vs ActiveMQ: Key Differences and How to Choose

Mike Chen's Internet Architecture

May 22, 2025 · Big Data

Optimizing Kafka Producer for High Throughput: Batching, Asynchronous Sending, Compression, and Concurrency

This article details practical techniques for boosting Kafka producer throughput, covering batch sending optimization, linger.ms tuning, asynchronous send handling, compression options, and concurrent sending strategies, while providing code examples and performance considerations for high‑throughput streaming applications.

AsynchronousBatchingHigh Throughput

0 likes · 4 min read

Optimizing Kafka Producer for High Throughput: Batching, Asynchronous Sending, Compression, and Concurrency

Full-Stack Internet Architecture

May 20, 2025 · Big Data

Why Learn Kafka? Core Benefits, Use Cases, and a Summary

This article explains why Kafka is widely adopted by top companies, outlines its high throughput, scalability, and durability, and describes key real‑time data pipeline, stream processing, and big‑data integration scenarios, concluding that mastering Kafka is essential for modern backend and data engineering roles.

KafkaReal-time Processingdata engineering

0 likes · 4 min read

Why Learn Kafka? Core Benefits, Use Cases, and a Summary

Code Ape Tech Column

May 19, 2025 · Backend Development

Mastering CQRS in Spring Microservices: From Basics to Event Sourcing

This article explains the CQRS architectural pattern, its benefits and pitfalls, and provides a step‑by‑step guide to implementing CQRS with Spring Boot, Axon, and Kafka, including command and query handling, event sourcing, and practical considerations for microservice architectures.

AxonBackend ArchitectureCQRS

0 likes · 13 min read

Mastering CQRS in Spring Microservices: From Basics to Event Sourcing

MaGe Linux Operations

May 15, 2025 · Big Data

Unlocking Kafka: Deep Dive into Architecture, Reliability, and Deployment

This article explains Kafka's core concepts—including topics, partitions, log segmentation, offset management, and acknowledgment levels—while also providing a step‑by‑step guide to deploying Zookeeper, Kafka, Filebeat, and ELK, troubleshooting common issues, and visualizing logs in Kibana.

DeploymentELKKafka

0 likes · 8 min read

Unlocking Kafka: Deep Dive into Architecture, Reliability, and Deployment

Selected Java Interview Questions

May 15, 2025 · Backend Development

Six Common Approaches to Synchronize MySQL Data to Elasticsearch

This article reviews six mainstream solutions for keeping MySQL and Elasticsearch in sync—including synchronous double‑write, asynchronous MQ‑based double‑write, Logstash polling, Canal binlog listening, DataX batch migration, and Flink stream processing—detailing their scenarios, advantages, drawbacks, and practical code examples to guide optimal technical selection.

CanalElasticsearchFlink

0 likes · 8 min read

Six Common Approaches to Synchronize MySQL Data to Elasticsearch

Airbnb Technology Team

May 13, 2025 · Backend Development

How Airbnb Scaled Its Postcard Generation System for Group Travel

Airbnb's media team engineered a scalable postcard generation pipeline that matches destinations, handles localized text layout, and delivers high‑performance images across iOS, Android, Web, and external messaging platforms using flexible templates, async processing, and pre‑generation strategies.

AirbnbBackend ArchitectureKafka

0 likes · 11 min read

How Airbnb Scaled Its Postcard Generation System for Group Travel

Java Architecture Diary

May 12, 2025 · Big Data

Why Kafka 4.0’s KRaft Mode Is a Game‑Changer for Distributed Messaging

Kafka 4.0 replaces ZooKeeper with the native KRaft (Kafka Raft) consensus layer, simplifying deployment, boosting scalability, cutting latency, and introducing a new incremental consumer rebalance protocol and share‑group model that together reshape how distributed streaming applications are built and operated.

Consumer RebalanceKRaftKafka

0 likes · 9 min read

Why Kafka 4.0’s KRaft Mode Is a Game‑Changer for Distributed Messaging

Qunar Tech Salon

May 9, 2025 · Operations

Kafka Production Optimization: Reducing Load and Improving Compression via Filebeat Tuning

This technical case study details how a high‑traffic Kafka logging cluster was optimized by adjusting Filebeat and Kafka parameters, increasing compression batch size, and tuning Kubernetes settings, resulting in significant reductions in request volume, network traffic, CPU usage, and overall resource consumption.

FilebeatKafkaOperations

0 likes · 10 min read

Kafka Production Optimization: Reducing Load and Improving Compression via Filebeat Tuning

Code Ape Tech Column

May 9, 2025 · Databases

Efficient Strategies for Importing One Billion Records into MySQL

This article explains how to import 1 billion 1 KB log records stored in HDFS or S3 into MySQL by analyzing single‑table limits, using batch inserts, choosing storage engines, sharding, optimizing file‑reading methods, and coordinating distributed tasks with Redis, Redisson, and Zookeeper to ensure ordered, reliable, and high‑throughput data loading.

Batch InsertDistributed SystemsKafka

0 likes · 19 min read

Efficient Strategies for Importing One Billion Records into MySQL

StarRocks

May 8, 2025 · Backend Development

How Grab Supercharged Spark Observability 10× with StarRocks – Inside the Iris Architecture

Grab replaced its fragmented Grafana‑Superset stack with a StarRocks‑backed Iris platform, achieving over ten‑fold query speedups, 40% lower resource usage, and a unified real‑time and historical data store for Spark observability across its Southeast Asian super‑app ecosystem.

Data PlatformKafkaMaterialized Views

0 likes · 16 min read

How Grab Supercharged Spark Observability 10× with StarRocks – Inside the Iris Architecture

Raymond Ops

May 7, 2025 · Operations

How Kafka Elects Leaders and Distributes Partitions: Inside the Mechanics

Kafka’s internal mechanisms for leader election, partition assignment, and file storage are explained, covering how the Controller uses ZooKeeper, the ISR-based leader selection process, partition distribution strategies, segment file structures, and the evolution of offset management from Zookeeper to the __consumer_offsets topic.

KafkaOffsetsPartition Assignment

0 likes · 5 min read

How Kafka Elects Leaders and Distributes Partitions: Inside the Mechanics

Su San Talks Tech

May 5, 2025 · Big Data

6 Proven Ways to Sync MySQL Data to Elasticsearch – Choose the Right Strategy

This article compares six mainstream MySQL‑to‑Elasticsearch synchronization methods—synchronous double‑write, asynchronous MQ, Logstash polling, Canal binlog listening, DataX batch sync, and Flink streaming—detailing scenarios, code samples, advantages, drawbacks, and practical selection guidance for developers.

CanalElasticsearchFlink

0 likes · 9 min read

6 Proven Ways to Sync MySQL Data to Elasticsearch – Choose the Right Strategy

Java Tech Enthusiast

May 4, 2025 · Fundamentals

Technical Interview Q&A: TCP, Redis, Kafka, CAP Theorem, Singleton, C++ STL, and Algorithms

This guide reviews common backend interview topics, explaining TCP TIME_WAIT behavior, multi‑port listening, full page load steps, Redis data types, Kafka consumer sizing and at‑most‑once semantics, the CAP theorem, Singleton usage, C++ std::map complexity, and an O(n) doubly‑linked list reversal algorithm.

AlgorithmsCCAP theorem

0 likes · 12 min read

Technical Interview Q&A: TCP, Redis, Kafka, CAP Theorem, Singleton, C++ STL, and Algorithms

Mike Chen's Internet Architecture

Apr 28, 2025 · Big Data

How Kafka Hits Million‑Message Throughput with Disk Writes, Caching & Zero‑Copy

Kafka attains million‑level message throughput by writing logs sequentially to disk, leveraging OS page cache for batch flushing, sending messages in large batches, and employing zero‑copy techniques that move data directly from disk to network sockets, dramatically reducing I/O overhead and CPU usage.

BatchingDisk I/OHigh Throughput

0 likes · 4 min read

How Kafka Hits Million‑Message Throughput with Disk Writes, Caching & Zero‑Copy

dbaplus Community

Apr 16, 2025 · Backend Development

How Ctrip’s Kafka Gatekeeper Boosts FinOps Data Quality and Automates Cost Governance

This article explains how Ctrip’s hybrid‑cloud FinOps billing system uses a custom Kafka Gatekeeper to detect, locate, and automatically remediate data‑quality issues across dozens of self‑built PaaS services, improving coverage, timeliness, and responsibility attribution while supporting high‑availability deployments.

BackendCloud NativeData Quality

0 likes · 19 min read

How Ctrip’s Kafka Gatekeeper Boosts FinOps Data Quality and Automates Cost Governance

Java Captain

Apr 14, 2025 · Backend Development

Project Initialization, Version Management, and Scaffold Setup for Spring Boot Backend Development

This guide walks through initializing a Spring Boot project, handling IDE setup challenges, managing version compatibility between Spring Cloud, Spring Boot, and Kafka, and provides practical code examples for global exception handling, logging, CORS configuration, Swagger integration, and useful development tools.

Exception HandlingKafkaSpring Boot

0 likes · 12 min read

Project Initialization, Version Management, and Scaffold Setup for Spring Boot Backend Development

Efficient Ops

Apr 13, 2025 · Backend Development

Master Kafka’s Core Concepts: A Visual Guide to Topics, Partitions, and Architecture

This article visually breaks down Kafka’s fundamental concepts—including producers, consumers, topics, consumer groups, partitions, and cluster architecture—explaining how messages flow, are stored, and achieve load balancing and ordering within a distributed streaming system.

Backend DevelopmentKafkaPartitions

0 likes · 7 min read

Master Kafka’s Core Concepts: A Visual Guide to Topics, Partitions, and Architecture

macrozheng

Apr 11, 2025 · Backend Development

How to Guarantee Exactly‑Once Message Consumption in High‑Concurrency Systems

This article explains common causes of duplicate message consumption in high‑traffic systems and presents a three‑layer defense—producer idempotence, broker de‑duplication, and consumer idempotent design—plus monitoring and reconciliation strategies to achieve reliable exactly‑once processing.

Exactly-OnceIdempotenceKafka

0 likes · 7 min read

How to Guarantee Exactly‑Once Message Consumption in High‑Concurrency Systems

Sanyou's Java Diary

Apr 10, 2025 · Backend Development

Why RocketMQ Beats Kafka: Architecture Simplified and Features Amplified

This article explains how RocketMQ, a Chinese‑origin message queue, simplifies Kafka’s architecture while adding powerful features such as tag‑based filtering, transactional messaging, delayed and dead‑letter queues, and a unified commit‑log storage model, making delayed processing and high‑throughput scenarios easier to implement.

Distributed SystemsKafkaMessage Queue

0 likes · 10 min read

Why RocketMQ Beats Kafka: Architecture Simplified and Features Amplified

Architecture Digest

Apr 6, 2025 · Backend Development

Design and Implementation of a General‑Purpose Asynchronous Processing SDK for Backend Systems

This article introduces a reusable asynchronous processing SDK built on Spring, Kafka, and MySQL that leverages @AsyncExec annotations, transactional event listeners, and configurable thread pools to ensure reliable, non‑blocking execution, data consistency, and fault‑tolerant handling of business logic in backend applications.

AsyncBackendKafka

0 likes · 8 min read

Design and Implementation of a General‑Purpose Asynchronous Processing SDK for Backend Systems

Ma Wei Says

Mar 30, 2025 · Fundamentals

How Kafka 4.0’s KRaft Replaces ZooKeeper with Raft Consensus

Kafka 4.0 introduces KRaft, a ZooKeeper‑free metadata layer built on the Raft consensus algorithm, detailing role transitions, leader election, log replication, controller and broker responsibilities, and fault‑tolerance mechanisms, enabling a more scalable and self‑managed architecture for large‑scale distributed streaming.

Consensus AlgorithmDistributed SystemsKRaft

0 likes · 13 min read

How Kafka 4.0’s KRaft Replaces ZooKeeper with Raft Consensus

Ma Wei Says

Mar 28, 2025 · Backend Development

Choosing the Right Message Queue: Kafka vs RocketMQ vs RabbitMQ Explained

This article compares Kafka, RocketMQ, and RabbitMQ, detailing their architectures, performance characteristics, strengths, and ideal use‑cases to help engineers select the most suitable message‑queue solution for high‑throughput, fault‑tolerant, and real‑time processing scenarios.

Distributed SystemsEvent StreamingHigh Throughput

0 likes · 11 min read

Choosing the Right Message Queue: Kafka vs RocketMQ vs RabbitMQ Explained

Linux Cloud Computing Practice

Mar 28, 2025 · Backend Development

200+ Essential Interview Questions for Linux, Nginx, MySQL, Kafka & Security

This article compiles a comprehensive collection of over two hundred interview questions covering Linux system administration, Nginx web server, MySQL databases, Redis caching, Kafka messaging, and core network security topics, providing a one‑stop resource for candidates preparing for major tech company interviews.

KafkaNginxinterview-questions

0 likes · 11 min read

200+ Essential Interview Questions for Linux, Nginx, MySQL, Kafka & Security

Didi Tech

Mar 27, 2025 · Operations

Performance Optimization and Architecture of iLogTail for High‑Scale Log Collection

Didi replaced its legacy agent with Alibaba’s open‑source iLogTail, re‑architected it to use a shared thread‑pool and SIMD‑accelerated parsing, rewrote critical plugins in C++ and added robust Kafka retry logic, achieving over twice the throughput while cutting CPU usage by more than half and maintaining near‑zero latency at massive scale.

C++KafkaPerformance Optimization

0 likes · 10 min read

Performance Optimization and Architecture of iLogTail for High‑Scale Log Collection

Linux Ops Smart Journey

Mar 27, 2025 · Cloud Native

Stream Kubernetes Events to Elasticsearch with Kafka & Logstash

This guide walks you through deploying the Kubernetes Event Exporter, packaging its Helm chart and Docker image, configuring Kafka TLS secrets, setting up Logstash to ingest events and forward them to Elasticsearch, creating an index template, and verifying the end‑to‑end pipeline.

ElasticsearchEvent ExporterKafka

0 likes · 10 min read

Stream Kubernetes Events to Elasticsearch with Kafka & Logstash

Mike Chen's Internet Architecture

Mar 27, 2025 · Backend Development

Kafka vs RabbitMQ: Which Messaging System Fits Your Needs?

This article compares Kafka and RabbitMQ across design philosophy, performance, data model, delivery semantics, ecosystem, and typical use cases, helping readers choose the right messaging middleware for high‑throughput or reliable delivery scenarios.

Backend ArchitectureComparisonKafka

0 likes · 5 min read

Kafka vs RabbitMQ: Which Messaging System Fits Your Needs?

iQIYI Technical Product Team

Mar 27, 2025 · Big Data

Cost‑Effective Real‑Time Data Warehouse 2.0: Migrating from Kafka to Iceberg

iQIYI transformed its real‑time data warehouse by replacing a costly Kafka‑based Lambda stack with a unified stream‑batch Iceberg lake, cutting storage expenses by 90%, halving compute costs, extending data retention, and delivering minute‑level freshness for 90% of use cases while preserving second‑level processing where needed.

Cost OptimizationFlinkIceberg

0 likes · 11 min read

Cost‑Effective Real‑Time Data Warehouse 2.0: Migrating from Kafka to Iceberg

Ma Wei Says

Mar 27, 2025 · Big Data

What’s New in Apache Kafka 4.0? A Deep Dive into KRaft, Java 17, and Next‑Gen Consumer Rebalance

Apache Kafka 4.0 eliminates Zookeeper with the KRaft consensus protocol, upgrades Java requirements to 17, introduces a next‑generation consumer rebalance protocol, adds Share Group queue semantics, and bundles numerous performance, security, and API improvements for modern streaming workloads.

Big DataConsumer RebalanceKRaft

0 likes · 7 min read

What’s New in Apache Kafka 4.0? A Deep Dive into KRaft, Java 17, and Next‑Gen Consumer Rebalance

Su San Talks Tech

Mar 27, 2025 · Operations

How to Ensure Data Consistency in Message Queues: 10 Hard‑Earned Lessons

This article explores why message queues can lose consistency, presents concrete solutions such as transactional two‑phase commits, persistence settings, replica configurations, unique IDs, idempotent designs, and dead‑letter queues, and shares ten practical lessons drawn from real‑world incidents.

Data ConsistencyKafkaRabbitMQ

0 likes · 12 min read

How to Ensure Data Consistency in Message Queues: 10 Hard‑Earned Lessons

Big Data Technology Architecture

Mar 25, 2025 · Big Data

Kafka 4.0 Release: KRaft Architecture, Consumer Group Optimizations, and New Queue Features

Kafka 4.0 marks a milestone release that replaces ZooKeeper with the KRaft consensus engine, improves scalability and performance, introduces a server‑side consumer‑group protocol, adds shared‑group queue capabilities, and updates Java requirements and documentation, delivering a more robust and flexible streaming platform.

Big DataDistributed StreamingJava11

0 likes · 6 min read

Kafka 4.0 Release: KRaft Architecture, Consumer Group Optimizations, and New Queue Features

Big Data Technology & Architecture

Mar 24, 2025 · Big Data

Apache Kafka 4.0: Major New Features – KRaft Architecture, Consumer Group Protocol, Queue Mode, Java Upgrade, API Simplifications and More

Apache Kafka 4.0 introduces a groundbreaking KRaft architecture that removes ZooKeeper, a revamped consumer‑group protocol that dramatically speeds up rebalancing, a new queue mode for point‑to‑point messaging, upgraded Java requirements, streamlined APIs, and numerous performance and security enhancements, reshaping both development and operations for large‑scale streaming workloads.

Java 11KRaftKafka

0 likes · 12 min read

Apache Kafka 4.0: Major New Features – KRaft Architecture, Consumer Group Protocol, Queue Mode, Java Upgrade, API Simplifications and More

Mike Chen's Internet Architecture

Mar 20, 2025 · Backend Development

Master Apache Kafka: Architecture, Setup, and Essential Commands Explained

This article provides a comprehensive overview of Apache Kafka, covering its core concepts, architecture, common use cases, installation steps, and essential command-line operations for managing topics and brokers in production environments and ensuring reliability.

Backend DevelopmentDistributed SystemsKafka

0 likes · 9 min read

Master Apache Kafka: Architecture, Setup, and Essential Commands Explained

Code Ape Tech Column

Mar 14, 2025 · Backend Development

Eight Common Use Cases of Message Queues in Backend Development

This article explores eight common scenarios for using message queues in backend development, covering asynchronous processing, service decoupling, traffic shaping, delayed tasks, log aggregation, distributed transactions, remote calls, and broadcast notifications, each illustrated with Java, RocketMQ, and Kafka code examples.

JavaKafkaMQ

0 likes · 15 min read

Eight Common Use Cases of Message Queues in Backend Development

Java Tech Enthusiast

Mar 9, 2025 · Backend Development

Handling Duplicate Messages in Message Queues

Message queues can deliver duplicates under at‑least‑once semantics, so to protect idempotent business logic such as orders and payments you should combine producer‑side idempotence (e.g., Kafka’s enable.idempotence), broker‑side deduplication (e.g., Pulsar), and a consumer‑side guard using unique IDs stored in a database or Redis.

IdempotenceKafkaMessage Queue

0 likes · 7 min read

Handling Duplicate Messages in Message Queues

IT Services Circle

Mar 8, 2025 · Backend Development

Handling Duplicate Messages in Message Queues: Semantics, Producer and Broker Deduplication, and Consumer Strategies

Message queues can cause duplicate messages that affect idempotent business processes, so this article explains the three delivery semantics (At Least Once, Exactly Once, At Most Once), the causes of duplication, and practical deduplication techniques for producers, brokers (Kafka, Pulsar), and consumers using code examples.

IdempotenceKafkaPulsar

0 likes · 8 min read

Handling Duplicate Messages in Message Queues: Semantics, Producer and Broker Deduplication, and Consumer Strategies

Selected Java Interview Questions

Mar 5, 2025 · Backend Development

Spring Boot Project Initialization, Version Management, and Common Backend Tools

This article walks through setting up a Spring Boot project, handling version compatibility with Spring Cloud, Kafka, and Maven, demonstrates global exception handling, logging, CORS and Swagger configurations, and recommends useful backend utilities such as embedded Redis, MyBatis Plus, and Redisson.

Backend DevelopmentCORSException Handling

0 likes · 9 min read

Spring Boot Project Initialization, Version Management, and Common Backend Tools

Linux Ops Smart Journey

Feb 27, 2025 · Information Security

Build Military‑Grade Kafka Security in Four Easy Steps

This guide walks you through four progressive stages—basic configuration, SSL encryption, SCRAM authentication, and combined SSL+SASL—showing how to harden Kafka with certificates, keystores, and client settings to achieve financial‑level protection.

Distributed MessagingKafkaSASL

0 likes · 9 min read

Build Military‑Grade Kafka Security in Four Easy Steps

dbaplus Community

Feb 26, 2025 · Operations

Choosing the Right Message Queue: A Deep Dive into Kafka, Pulsar, RocketMQ, RabbitMQ, and NSQ

This comprehensive guide explains the role of message queues in distributed systems, compares the architectures and core concepts of Kafka, Pulsar, RocketMQ, RabbitMQ, and NSQ, and provides practical criteria for selecting, configuring, and operating a message‑queue solution in high‑performance, high‑availability environments.

KafkaNSQPulsar

0 likes · 50 min read

Choosing the Right Message Queue: A Deep Dive into Kafka, Pulsar, RocketMQ, RabbitMQ, and NSQ

IT Services Circle

Feb 23, 2025 · Backend Development

DJI Java Backend Interview Experience, Salary Insights, and Core Concurrency Knowledge

The article shares DJI's impressive compensation data, details a 50‑minute Java backend interview covering concurrency tools, locks, CountDownLatch, ConcurrentHashMap, Kafka ordering, zero‑copy, Spring AOP, filters vs. interceptors, and provides code examples and explanations for each concept.

BackendDJIJava

0 likes · 28 min read

DJI Java Backend Interview Experience, Salary Insights, and Core Concurrency Knowledge

macrozheng

Feb 20, 2025 · Backend Development

Master Java Concurrency & SpringBoot: DJI Interview Insights & Advanced Locks

This article combines a DJI interview experience—including salary and bonus details—with an in‑depth guide to Java concurrency tools, lock implementations, SpringBoot AOP, filters vs. interceptors, Kafka ordering, zero‑copy techniques, and related backend development best practices.

Backend DevelopmentKafkaLocks

0 likes · 26 min read

Master Java Concurrency & SpringBoot: DJI Interview Insights & Advanced Locks

Linux Ops Smart Journey

Feb 18, 2025 · Cloud Native

Deploy Filebeat with Helm on Kubernetes: Automated Log Collection to Kafka

This step‑by‑step guide shows how to use a Helm chart to deploy Filebeat in a Kubernetes cluster, automatically collect container logs, and forward them to a Kafka cluster for reliable, scalable observability.

FilebeatKafkaKubernetes

0 likes · 7 min read

Deploy Filebeat with Helm on Kubernetes: Automated Log Collection to Kafka

Top Architect

Jan 19, 2025 · Backend Development

Understanding Kafka Consumer: Offset Management, Rebalance, Partition Assignment, and Thread Safety

This article provides a comprehensive technical walkthrough of KafkaConsumer, covering Java configuration code, delivery semantics (at‑most‑once, at‑least‑once, exactly‑once), offset commit strategies, rebalance mechanisms, partition assignment algorithms, thread‑safety concerns, and internal poll implementation, followed by unrelated promotional content.

Backend DevelopmentConsumerKafka

0 likes · 16 min read

Understanding Kafka Consumer: Offset Management, Rebalance, Partition Assignment, and Thread Safety

IT Architects Alliance

Jan 9, 2025 · Backend Development

Understanding Message Queues: Comparing Kafka, RabbitMQ, and RocketMQ

Message queues act as asynchronous communication bridges in distributed systems, and this article examines three leading solutions—Kafka, RabbitMQ, and RocketMQ—detailing their performance, scalability, reliability, strengths, drawbacks, and suitable use‑cases to guide developers in selecting the right technology.

KafkaMessage QueueRabbitMQ

0 likes · 13 min read

Understanding Message Queues: Comparing Kafka, RabbitMQ, and RocketMQ

dbaplus Community

Jan 5, 2025 · Big Data

How DeWu Halved Observability Costs Using AutoMQ and ClickHouse Storage‑Compute Separation

DeWu’s observability platform faced scalability, cost, and operational challenges from petabyte‑scale trace data, prompting a shift to a storage‑compute separated architecture that leverages AutoMQ’s Kafka‑compatible service and ClickHouse Enterprise’s SharedMergeTree engine, ultimately achieving up to 50% cost reduction and five‑fold cold‑read performance gains.

AutoMQBig DataClickHouse

0 likes · 20 min read

How DeWu Halved Observability Costs Using AutoMQ and ClickHouse Storage‑Compute Separation

IT Services Circle

Jan 3, 2025 · Backend Development

Eight Common Use Cases of Message Queues (MQ) with Code Examples

This article explains eight typical scenarios for using message queues—including asynchronous processing, service decoupling, traffic shaping, delayed tasks, log collection, distributed transactions, remote invocation, and broadcast notifications—providing clear explanations and Java code snippets for each case.

Distributed TransactionsKafkaMessage Queue

0 likes · 13 min read

Eight Common Use Cases of Message Queues (MQ) with Code Examples

Top Architect

Jan 3, 2025 · Backend Development

Optimizing Large-Scale Log Reporting in a Backend System Using Archival and Redis Caching

This article describes how a legacy SSM‑based backend reporting service was refactored by archiving log data, leveraging Redis for hourly counters, and scheduling synchronization tasks to dramatically reduce query time for millions of records without adding external processing frameworks.

JavaKafkaPerformance Optimization

0 likes · 13 min read

Optimizing Large-Scale Log Reporting in a Backend System Using Archival and Redis Caching

Ctrip Technology

Jan 3, 2025 · Big Data

Design and Implementation of a Kafka Gatekeeper for FinOps Billing Data Quality Governance

This article describes the challenges of data quality in Ctrip’s hybrid‑cloud FinOps billing system and presents the design, implementation, and high‑availability deployment of a custom Kafka Gatekeeper proxy that performs pre‑validation, configurable rules, self‑service dashboards, and automated alerts to improve coverage, timeliness, and responsibility attribution.

Big DataCloud NativeData Quality

0 likes · 17 min read

Design and Implementation of a Kafka Gatekeeper for FinOps Billing Data Quality Governance

Architect

Dec 30, 2024 · Backend Development

How to Speed Up Legacy Report Queries with Redis Archiving and Scheduled Tasks

This article details how a legacy reporting module suffering from slow page loads was refactored by archiving log data, caching hourly counts in Redis, and using scheduled synchronization to dramatically improve query performance without adding new middleware.

BackendJavaKafka

0 likes · 11 min read

How to Speed Up Legacy Report Queries with Redis Archiving and Scheduled Tasks

Bin's Tech Cabin

Dec 25, 2024 · Backend Development

Unveiling Netty and Kafka Time Wheels: High‑Performance Scheduling Explained

This article explores the design and implementation of time wheel algorithms in Netty and Kafka, comparing their single‑layer and multi‑layer approaches, analyzing performance trade‑offs, and detailing how these systems achieve O(1) scheduling for massive delayed tasks while avoiding empty ticks.

BackendKafkaNetty

0 likes · 76 min read

Unveiling Netty and Kafka Time Wheels: High‑Performance Scheduling Explained

MaGe Linux Operations

Dec 14, 2024 · Big Data

Master Kafka: From Core Concepts to Real-World Deployment

This comprehensive guide explains Kafka’s architecture, core APIs, topics and partitions, deployment steps, multi‑broker clustering, and practical use cases such as messaging, log aggregation, stream processing, and data import/export with Kafka Connect, providing a hands‑on tutorial for developers and engineers.

Distributed SystemsInstallationKafka

0 likes · 30 min read

Master Kafka: From Core Concepts to Real-World Deployment

Top Architect

Dec 12, 2024 · Backend Development

Using Spring Transaction Hooks to Send Kafka Messages After Transaction Commit

This article explains how to leverage Spring's TransactionSynchronizationManager to detect active transactions, register synchronization callbacks, and asynchronously send Kafka messages only after a transaction successfully commits, illustrated with a payment‑system example and complete Java code snippets, while also noting thread‑local considerations and promotional offers.

JavaKafkaSpringBoot

0 likes · 12 min read

Using Spring Transaction Hooks to Send Kafka Messages After Transaction Commit

JD Cloud Developers

Dec 11, 2024 · Operations

How to Prevent Message Queue Disorder in Distributed Systems

This article examines the causes of message queue disorder in distributed systems and presents practical solutions such as ordered messaging, pre‑consumption checks, state machines, and monitoring to improve system stability and data consistency.

KafkaMessage OrderingMessage Queue

0 likes · 9 min read

How to Prevent Message Queue Disorder in Distributed Systems

Alibaba Cloud Big Data AI Platform

Dec 9, 2024 · Big Data

Why Kafka Falls Short for Real‑Time Analytics and How Fluss Changes the Game

Flink Forward Asia 2024 highlighted the limitations of Kafka for real‑time analytics—lack of updates, poor data exploration, costly back‑tracking, and high network overhead—while introducing Fluss, a columnar streaming storage that offers low‑latency reads, CDC, lake‑stream integration, and efficient Delta Join for scalable, fast analytics.

Big DataDelta JoinFlink

0 likes · 15 min read

Why Kafka Falls Short for Real‑Time Analytics and How Fluss Changes the Game

Xiaohongshu Tech REDtech

Dec 5, 2024 · Big Data

Interview with Jianchen: Journey from Open Source Contributor to Data Engineer at Xiaohongshu

In this interview, Xiaohongshu data engineer Jianchen recounts his evolution from a computer‑science student discovering open‑source through MIT6.824 to contributing to SOFAJRaft and Apache RocketMQ, detailing his OSPP projects, the decision to join Xiaohongshu, and his work on a cloud‑native Kafka engine that cut storage and compute usage by half.

Apache RocketMQBig DataCareer Development

0 likes · 11 min read

Interview with Jianchen: Journey from Open Source Contributor to Data Engineer at Xiaohongshu

Java Architect Essentials

Dec 4, 2024 · Backend Development

Spring Boot Project Initialization, Version Management, and Common Scaffold Code Guide

This article walks through initializing a Spring Boot project, handling version compatibility between Spring Cloud, Spring Boot, and Kafka, demonstrates common scaffolding code such as global exception handling, logging, CORS configuration, and recommends useful tools like Maven, embedded Redis, MyBatis‑Plus, and MapStruct for backend development.

Backend DevelopmentJavaKafka

0 likes · 9 min read

Spring Boot Project Initialization, Version Management, and Common Scaffold Code Guide

Sanyou's Java Diary

Dec 2, 2024 · Big Data

Understanding Kafka: Core Architecture, Storage, and Reliability Explained

This article provides a comprehensive overview of Kafka, covering its overall structure, key components such as brokers, producers, consumers, topics, partitions, replicas, leader‑follower mechanics, logical and physical storage models, producer and consumer workflows, configuration parameters, partition assignment strategies, rebalancing, log retention and compaction, indexing, zero‑copy transmission, and the reliability concepts that ensure data durability.

Data StreamingDistributed SystemsKafka

0 likes · 18 min read

Understanding Kafka: Core Architecture, Storage, and Reliability Explained

dbaplus Community

Nov 27, 2024 · Backend Development

Designing a Robust Asynchronous Processing SDK with Spring, Kafka, and XXL‑Job

An extensible asynchronous processing SDK is presented, leveraging Spring transaction events, Kafka queues, XXL‑Job scheduling, and a non‑intrusive architecture to ensure high performance, data safety, and eventual consistency while simplifying integration across Java backend services.

JavaKafkaXXL-JOB

0 likes · 10 min read

Designing a Robust Asynchronous Processing SDK with Spring, Kafka, and XXL‑Job

Big Data Technology & Architecture

Nov 26, 2024 · Big Data

Understanding Full GC, Data Skew, and Parallelism in Flink Tasks

This article explains how to monitor and interpret Full GC in Flink TaskManagers, detect and address data skew through proper data distribution and parallelism settings, and recommends aligning consumer parallelism with Kafka partitions, while also providing practical tips for using tools like Prometheus and Arthas.

Data SkewFlinkFull GC

0 likes · 6 min read

Understanding Full GC, Data Skew, and Parallelism in Flink Tasks

Top Architect

Nov 25, 2024 · Backend Development

Using Spring Transaction Hooks to Send Kafka Messages After Transaction Commit

This article explains how to leverage Spring's TransactionSynchronizationManager to detect active transactions and register synchronization callbacks so that Kafka messages are sent asynchronously only after a transaction successfully commits, ensuring data consistency without impacting the main business flow.

JavaKafkaTransactionSynchronizationManager

0 likes · 12 min read

Aikesheng Open Source Community

Nov 25, 2024 · Big Data

Real-time Data Synchronization from OceanBase to Kafka Using ActionOMS and Flink

This article demonstrates how to use ActionOMS to capture incremental changes from OceanBase, stream them to Kafka in various formats, and employ Flink to deduplicate and aggregate transaction data into a daily summary, illustrating a complete real-time data pipeline for financial use cases.

ActionOMSFlinkKafka

0 likes · 10 min read

Real-time Data Synchronization from OceanBase to Kafka Using ActionOMS and Flink