Tagged articles

1273 articles

Page 8 of 13

Jul 23, 2021 · Backend Development

Preventing Message Loss in RabbitMQ and Kafka: Transactions, Confirm Mode, Persistence, and Configuration Best Practices

This article explains the common points where messages can be lost in RabbitMQ and Kafka, compares transaction and confirm mechanisms, describes how to enable persistence and proper acknowledgments, and provides concrete configuration recommendations for producers and consumers to ensure reliable message delivery.

KafkaMessage ReliabilityRabbitMQ

0 likes · 9 min read

Preventing Message Loss in RabbitMQ and Kafka: Transactions, Confirm Mode, Persistence, and Configuration Best Practices

360 Quality & Efficiency

Jul 23, 2021 · Big Data

Introduction to Apache Kafka: Architecture, APIs, Code Examples, and Optimization

This article provides a comprehensive overview of Apache Kafka, covering its definition, architecture, command‑line API usage, Java producer and consumer examples, core principles such as high availability and message ordering, configuration tuning, and a summary of its advantages as a high‑throughput, fault‑tolerant streaming platform.

ConfigurationConsumerKafka

0 likes · 12 min read

Introduction to Apache Kafka: Architecture, APIs, Code Examples, and Optimization

Java Interview Crash Guide

Jul 23, 2021 · Operations

How to Build a Scalable APM System: Inside the Dog Architecture

This article explains what an APM system is, compares logs, traces and metrics, reviews popular tools, and then details the design and implementation of the in‑house Dog APM platform—including client data models, Kafka pipelines, processing pipelines, storage in ClickHouse/Cassandra, and UI visualizations.

APMKafkaMetrics

0 likes · 28 min read

How to Build a Scalable APM System: Inside the Dog Architecture

Top Architect

Jul 22, 2021 · Backend Development

RabbitMQ vs Apache Kafka: In‑Depth Comparison and Guidance for Choosing the Right Messaging System

This article explains the fundamental differences between RabbitMQ and Apache Kafka, covering their architectures, message models, ordering guarantees, routing capabilities, timing controls, retention policies, fault‑tolerance mechanisms, scalability, and consumer complexity, and then provides practical criteria to help architects decide which solution best fits a given use case.

ComparisonKafkaMessage Queue

0 likes · 22 min read

RabbitMQ vs Apache Kafka: In‑Depth Comparison and Guidance for Choosing the Right Messaging System

Open Source Linux

Jul 18, 2021 · Operations

Why a Single Kafka Broker Failure Can Halt All Consumers – The HA Explained

This article explains Kafka's high‑availability mechanisms, covering multi‑replica design, ISR synchronization, leader election, acknowledgment settings, and the hidden risk of the __consumer_offset topic's single‑replica configuration that can cause an entire cluster to become unavailable when one broker fails.

Distributed SystemsISRKafka

0 likes · 9 min read

Why a Single Kafka Broker Failure Can Halt All Consumers – The HA Explained

Top Architect

Jul 16, 2021 · Fundamentals

Introduction to Message Queues, JMS, MQ, and Kafka

This article provides a comprehensive overview of message queues, explaining their purpose, usage scenarios, two communication models, and detailing Java Message Service (JMS) as well as popular MQ implementations such as RabbitMQ and Kafka, complete with diagrams and code examples.

JMSKafkaMessage Queue

0 likes · 15 min read

Introduction to Message Queues, JMS, MQ, and Kafka

Youzan Coder

Jul 15, 2021 · Backend Development

Message Queue Architecture Comparison: NSQ, Kafka, and RocketMQ in Distributed Systems

The article compares the architectures of NSQ (YouZan branch), Kafka, and RocketMQ—detailing their coordination mechanisms, storage models, consistency guarantees, and operational trade‑offs—while recommending Kafka for log‑big‑data workloads, RocketMQ for massive topic counts, and NSQ for extensibility and lightweight deployment.

Distributed SystemsKafkaMessage Queue

0 likes · 16 min read

Message Queue Architecture Comparison: NSQ, Kafka, and RocketMQ in Distributed Systems

IT Architects Alliance

Jul 12, 2021 · Big Data

Kafka Core Concepts, Architecture, Performance Optimization, and Operational Practices

This article explains Kafka's fundamental principles, cluster architecture, data performance techniques such as zero‑copy and log segmentation, resource planning for high‑throughput scenarios, and provides practical operational commands and custom partitioning examples for reliable, high‑availability deployments.

Cluster DeploymentKafkamessage queues

0 likes · 32 min read

Kafka Core Concepts, Architecture, Performance Optimization, and Operational Practices

DevOps

Jul 12, 2021 · Operations

The First Four Chaos Experiments to Run on Apache Kafka

This article explains how to use chaos engineering with Gremlin to design, execute, and analyze four experiments that test Kafka broker load, message loss, split‑brain scenarios, and ZooKeeper outages, helping improve the reliability and resilience of Kafka deployments.

Distributed SystemsGremlinKafka

0 likes · 18 min read

The First Four Chaos Experiments to Run on Apache Kafka

Full-Stack Internet Architecture

Jul 10, 2021 · Backend Development

Why Use Message Queues? Benefits, Drawbacks, High Availability, Idempotency, and Practical Tips

The article explains why message queues are essential for decoupling systems, improving latency, handling traffic spikes, ensuring high availability, maintaining order, and achieving idempotent consumption, while also discussing their disadvantages, configuration details for Kafka, RabbitMQ, RocketMQ, and practical troubleshooting strategies.

IdempotencyKafkaMQ

0 likes · 23 min read

Why Use Message Queues? Benefits, Drawbacks, High Availability, Idempotency, and Practical Tips

Spring Full-Stack Practical Cases

Jul 7, 2021 · Backend Development

Master Kafka Topic Management & Message Flow on CentOS with Java

This guide walks through setting up Kafka on CentOS, creating and configuring topics, inspecting topic and consumer group details, adjusting partitions, and implementing Java producer and consumer applications, complete with command-line examples, code snippets, and troubleshooting tips for remote server access.

CentOSConsumerJava Producer

0 likes · 9 min read

Master Kafka Topic Management & Message Flow on CentOS with Java

Top Architect

Jul 6, 2021 · Backend Development

Interview Guide: Why Use Message Queues, Their Pros and Cons, and Comparison of Kafka, ActiveMQ, RabbitMQ, and RocketMQ

This article explains why message queues are used in system architecture, outlines their advantages such as decoupling, asynchronous processing, and traffic shaping, discusses their drawbacks, and compares four popular MQ solutions—Kafka, ActiveMQ, RabbitMQ, and RocketMQ—to help candidates ace interview questions.

Backend ArchitectureInterview PreparationKafka

0 likes · 12 min read

Interview Guide: Why Use Message Queues, Their Pros and Cons, and Comparison of Kafka, ActiveMQ, RabbitMQ, and RocketMQ

JD Retail Technology

Jul 5, 2021 · Backend Development

Design and Implementation of JD's Real-Time Browsing Record System

The article describes JD's real-time browsing record system architecture, detailing its four modules—storage, query, real-time reporting, and offline reporting—along with hot‑cold data separation, use of Jimdb, HBase, Kafka, and Flink to achieve millisecond‑level latency and high throughput for billions of user records.

BrowsingFlinkHBase

0 likes · 12 min read

Design and Implementation of JD's Real-Time Browsing Record System

MaGe Linux Operations

Jul 3, 2021 · Backend Development

How to Build a Go Log Collector with etcd, Context, and Kafka Integration

This article walks through redesigning a Go‑based log‑collection framework, introducing etcd for distributed configuration, demonstrating context for timeout and data propagation, and showing how to integrate Kafka consumers while improving concurrency handling and adding rate‑limiting mechanisms.

BackendKafkacontext

0 likes · 16 min read

How to Build a Go Log Collector with etcd, Context, and Kafka Integration

Code Ape Tech Column

Jul 3, 2021 · Fundamentals

Introduction to Message Queues, JMS, and Popular MQ Technologies (RabbitMQ, Kafka)

This article explains the fundamentals of message queues, their role in asynchronous processing and system decoupling, introduces Java Message Service (JMS), compares point‑to‑point and publish‑subscribe models, and provides an overview of common MQ implementations such as RabbitMQ and Kafka.

AMQPDistributed SystemsJMS

0 likes · 14 min read

Introduction to Message Queues, JMS, and Popular MQ Technologies (RabbitMQ, Kafka)

dbaplus Community

Jun 30, 2021 · Backend Development

Unlock Kafka’s Speed: Deep Dive into Performance Secrets and Optimizations

This article provides a comprehensive technical guide to Kafka performance, covering the core bottlenecks of network, disk and complexity, detailing optimization techniques such as concurrency, compression, batching, caching and algorithms, and explaining how Kafka’s sequential write, zero‑copy, page cache, reactor‑based network model, batch handling, partition concurrency, and file structure contribute to high throughput.

KafkaZero Copyjava

0 likes · 17 min read

Unlock Kafka’s Speed: Deep Dive into Performance Secrets and Optimizations

Efficient Ops

Jun 28, 2021 · Backend Development

Why a Single Kafka Broker Failure Stops All Consumers – Understanding HA

This article explains Kafka's high‑availability mechanisms, covering multi‑replica design, ISR synchronization, leader election, the impact of the request.required.acks setting, and how the default __consumer_offset topic can become a single point of failure, with concrete steps to fix it.

KafkaReplicationconsumer-offset

0 likes · 9 min read

Why a Single Kafka Broker Failure Stops All Consumers – Understanding HA

Tencent Cloud Middleware

Jun 28, 2021 · Big Data

Getting Started with Kafka’s New KRaft Mode: A Step‑by‑Step Guide

This article introduces Apache Kafka’s KRaft (Kafka Raft) mode, explains its architectural differences from ZooKeeper‑based deployments, details essential configuration parameters, and provides a complete step‑by‑step procedure—including commands and utility tools—to set up and operate a KRaft cluster.

ConfigurationDeploymentDistributed Systems

0 likes · 14 min read

Getting Started with Kafka’s New KRaft Mode: A Step‑by‑Step Guide

DataFunTalk

Jun 26, 2021 · Big Data

Building a Scalable Big Data Service System at Didi: Practices and Lessons

Zhang Liang shares Didi's four-stage journey of constructing and governing large‑scale open‑source big‑data engine services—including engine selection, hardware sizing, PaaS platform building, proxy architecture, and governance—highlighting practical challenges, solutions, and ROI‑driven best practices for Kafka, Elasticsearch, Flink, and related technologies.

Big DataData InfrastructureElasticsearch

0 likes · 16 min read

Building a Scalable Big Data Service System at Didi: Practices and Lessons

Selected Java Interview Questions

Jun 25, 2021 · Backend Development

Message Queues for Interviews: Why Use MQ, Benefits, Drawbacks, and Comparison of Kafka, ActiveMQ, RabbitMQ, and RocketMQ

This article explains why message queues are used in modern systems, outlines common interview questions about MQ, discusses the advantages of decoupling, asynchronous processing, and traffic shaping, examines the pros and cons of MQ, and compares the four major MQ products—Kafka, ActiveMQ, RabbitMQ, and RocketMQ—to help candidates prepare for technical interviews.

ActiveMQKafkaMQ

0 likes · 11 min read

Message Queues for Interviews: Why Use MQ, Benefits, Drawbacks, and Comparison of Kafka, ActiveMQ, RabbitMQ, and RocketMQ

Java Interview Crash Guide

Jun 25, 2021 · Backend Development

Understanding Kafka Transactions: TC Service, Producer Flow, and Code Walkthrough

This article explains how Kafka implements transactions, detailing the role of the Transaction Coordinator (TC) service, the transaction flow diagram, producer initialization, partition handling, offset commits, commit and abort processes, and includes a complete Java code example with client‑side and server‑side components.

ConsumerKafkaProducer

0 likes · 22 min read

Understanding Kafka Transactions: TC Service, Producer Flow, and Code Walkthrough

dbaplus Community

Jun 23, 2021 · Big Data

How Ctrip Finance Built a Real‑Time Binlog‑Based Data Lake with MySQL‑Hive Sync

This article details Ctrip Finance's end‑to‑end data‑foundation architecture that uses MySQL binlog collection via Canal, Kafka streaming, Spark‑Streaming persistence to HDFS, and a merge process to produce timely MySQL‑Hive snapshots, addressing performance, consistency, and delete‑handling challenges.

BinlogKafkaSpark

0 likes · 17 min read

How Ctrip Finance Built a Real‑Time Binlog‑Based Data Lake with MySQL‑Hive Sync

DataFunTalk

Jun 21, 2021 · Big Data

Flink + Iceberg 0.11 Practices in Qunar Data Platform

This article shares Qunar's experience using Flink together with Apache Iceberg 0.11 to address real‑time data warehouse challenges, covering background pain points, Iceberg architecture, solutions for Kafka data loss and Hive latency, and optimization practices such as small‑file handling, sorting, and checkpoint management.

Big DataData LakeFlink

0 likes · 13 min read

Flink + Iceberg 0.11 Practices in Qunar Data Platform

Java Interview Crash Guide

Jun 21, 2021 · Backend Development

How to Prevent Message Loss, Duplicates, and Backlog in Distributed Queues

This article explains practical techniques for detecting lost messages, ensuring reliable delivery across production, storage, and consumption stages, handling duplicate deliveries with idempotent designs, managing message backlogs through performance tuning, and using transactional messages to achieve distributed transaction consistency.

IdempotencyKafkaMessage Loss

0 likes · 23 min read

How to Prevent Message Loss, Duplicates, and Backlog in Distributed Queues

Code Ape Tech Column

Jun 21, 2021 · Operations

Why Simple Kafka Retries Fail and How to Build a Robust Message‑Failure Strategy

This article analyzes common Kafka consumer failure scenarios, explains why naïve retry‑topic or message‑skip approaches can break ordering and data consistency, and presents practical patterns—including error classification, in‑consumer backoff, hidden topics, and DLQ handling—to design resilient asynchronous microservice communication.

Dead Letter QueueError HandlingKafka

0 likes · 21 min read

Why Simple Kafka Retries Fail and How to Build a Robust Message‑Failure Strategy

IT Architects Alliance

Jun 20, 2021 · Operations

Ensuring Message Reliability and Handling Duplicates in Distributed Message Queues

The article explains how to detect and prevent message loss, guarantee reliable delivery, handle duplicate consumption, manage message backlog, and implement distributed transaction messages in systems like Kafka, RocketMQ, and RabbitMQ, providing practical code examples and best‑practice guidelines.

Distributed SystemsIdempotencyKafka

0 likes · 23 min read

Ensuring Message Reliability and Handling Duplicates in Distributed Message Queues

IT Architects Alliance

Jun 20, 2021 · Backend Development

Kafka Architecture, Core Concepts, and Operational Best Practices

This article provides a comprehensive overview of Kafka's architecture, core concepts, high‑throughput design, replication, network model, capacity planning, producer and consumer tuning, custom partitioning, rebalance strategies, broker management, and operational tools for building and maintaining robust distributed messaging systems.

Kafkaperformance

0 likes · 29 min read

Kafka Architecture, Core Concepts, and Operational Best Practices

ITFLY8 Architecture Home

Jun 20, 2021 · Backend Development

How to Prevent Message Loss and Ensure Reliable Delivery in Distributed Systems

This article explains practical techniques for detecting lost messages, guaranteeing reliable production, storage, and consumption stages, handling duplicate deliveries with idempotent designs, managing message backlogs, and implementing distributed transactions using transactional messages in modern message queue systems.

IdempotencyKafkaReliability

0 likes · 18 min read

How to Prevent Message Loss and Ensure Reliable Delivery in Distributed Systems

Java Architect Essentials

Jun 15, 2021 · Big Data

Comprehensive Guide to Apache Kafka: Concepts, Installation, Configuration, and Usage

This article provides a thorough overview of Apache Kafka, covering its core streaming concepts, key components such as topics, partitions, producers and consumers, common use cases, step‑by‑step installation and multi‑broker configuration, fault‑tolerance testing, and an introduction to Kafka Connect for data import/export.

ConsumerDistributed StreamingInstallation

0 likes · 24 min read

Comprehensive Guide to Apache Kafka: Concepts, Installation, Configuration, and Usage

Programmer DD

Jun 14, 2021 · Databases

Master Real‑Time Change Data Capture with Debezium and Spring Boot

Learn how to capture and stream real‑time database changes using Debezium’s distributed CDC framework, configure MySQL binlog, integrate the embedded engine with Spring Boot, and process change events with sample code and Docker setup for robust data pipelines.

CDCChange Data CaptureDebezium

0 likes · 11 min read

Master Real‑Time Change Data Capture with Debezium and Spring Boot

DeWu Technology

Jun 12, 2021 · Backend Development

Design and Optimization of a High‑Throughput Messaging Platform

To handle exploding daily traffic, the team rebuilt the messaging platform with a unified API, concurrent consumption, MongoDB storage, and a priority scheme that uses separate Kafka topics and adjustable pull ratios, while employing a state‑machine‑driven thread pool, multi‑tier delay mechanisms, and MongoDB/Redis‑based fatigue control, delivering fast, traceable, hierarchical urgent delivery with visual metrics and content safety.

KafkaMessagingdelay queue

0 likes · 5 min read

Design and Optimization of a High‑Throughput Messaging Platform

IT Architects Alliance

Jun 11, 2021 · Backend Development

Understanding Message Queues: From Redis List to Kafka and Pulsar

This article explains the evolution of message‑queue middleware by comparing the basic double‑ended queue implementation, Redis list usage, Kafka’s partitioned log architecture with segment files and sparse indexes, and Pulsar’s compute‑storage separation using BookKeeper, highlighting their designs, strengths, and trade‑offs.

Distributed SystemsKafkaMessage Queue

0 likes · 28 min read

Understanding Message Queues: From Redis List to Kafka and Pulsar

Java High-Performance Architecture

Jun 10, 2021 · Backend Development

Why Does a Single Kafka Broker Failure Break the Whole Cluster?

This article explains why a single Kafka broker failure can render the whole cluster unusable, detailing Kafka’s multi‑replica design, leader election, ISR mechanism, acknowledgment settings, and how to configure replication factors and consumer offset topics to achieve true high availability.

ACKConsumer OffsetISR

0 likes · 11 min read

Why Does a Single Kafka Broker Failure Break the Whole Cluster?

ITFLY8 Architecture Home

Jun 10, 2021 · Backend Development

Why Kafka Beats Redis List: A Deep Dive into Message Queue Architecture

This article compares popular message middleware such as Redis, Kafka, and Pulsar, explaining their underlying data structures, strengths and weaknesses, and how concepts like partitions, replication, cursors, and storage segmentation enable high performance, scalability, and reliability in modern distributed messaging systems.

BackendDistributed SystemsKafka

0 likes · 29 min read

Why Kafka Beats Redis List: A Deep Dive into Message Queue Architecture

IT Architects Alliance

Jun 5, 2021 · Big Data

How to Build a Real‑Time Recommendation System with Flink, HBase, and Docker

This article walks through a complete real‑time recommendation system built on Apache Flink, detailing its v2.0 architecture, modules for user behavior, interest, and product profiling, the recommendation algorithms (hot‑list, collaborative filtering, item similarity), and step‑by‑step Docker deployment of MySQL, Redis, HBase, and Kafka.

DockerFlinkHBase

0 likes · 11 min read

How to Build a Real‑Time Recommendation System with Flink, HBase, and Docker

MaGe Linux Operations

Jun 3, 2021 · Big Data

Why Kafka Handles Billions of Messages: Architecture, Use Cases, and Fast Performance

This article introduces Kafka, LinkedIn’s high‑throughput distributed messaging system, explains its core concepts such as brokers, topics, partitions, offsets, producers, consumers, and consumer groups, outlines common use cases like asynchronous decoupling and data‑stream processing, and details its fast performance mechanisms, fault‑tolerance, installation, and configuration steps.

Big DataData StreamingInstallation

0 likes · 11 min read

Why Kafka Handles Billions of Messages: Architecture, Use Cases, and Fast Performance

ITFLY8 Architecture Home

Jun 3, 2021 · Big Data

Building a Real‑Time Flink Recommendation System: Architecture, Code & Deployment

This article walks through a complete Flink‑based recommendation system, detailing its v2.0 architecture, recommendation algorithms, front‑end and back‑end components, and step‑by‑step Docker deployment of MySQL, Redis, HBase, and Kafka services.

Big DataDockerFlink

0 likes · 10 min read

Building a Real‑Time Flink Recommendation System: Architecture, Code & Deployment

macrozheng

Jun 1, 2021 · Big Data

Master Kafka on Linux: Install, Visualize with Kafka‑Eagle, and Use Spring Boot

This guide walks you through installing Kafka on CentOS, configuring Zookeeper, using command‑line tools, setting up the Kafka‑Eagle visual interface, and integrating Kafka with a Spring Boot application, providing code snippets and step‑by‑step instructions for a quick start.

KafkaKafka EagleLinux

0 likes · 14 min read

Master Kafka on Linux: Install, Visualize with Kafka‑Eagle, and Use Spring Boot

Spring Full-Stack Practical Cases

May 28, 2021 · Big Data

How to Set Up a Zookeeper & Kafka Cluster on CentOS 8 with Shell Scripts

This guide walks you through preparing a CentOS 8 environment, installing JDK, Zookeeper and Kafka, configuring each service with proper data directories and ports, and automating startup using shell scripts for a three‑node Zookeeper cluster and a three‑node Kafka cluster.

CentOSCluster SetupKafka

0 likes · 7 min read

How to Set Up a Zookeeper & Kafka Cluster on CentOS 8 with Shell Scripts

Tencent Cloud Developer

May 27, 2021 · Big Data

An Introduction to Kafka: Architecture, Core Components, Service Governance, Performance Optimizations, and Installation Guide

Kafka is a high‑throughput distributed publish‑subscribe system that uses brokers, topics, partitions, offsets, producers, consumers, and Zookeeper for metadata and leader election, offering fast sequential disk writes, page‑cache zero‑copy transfers, ISR‑based replication, and includes step‑by‑step installation of JDK, Zookeeper, and Kafka.

Big DataDistributed MessagingInstallation

0 likes · 11 min read

An Introduction to Kafka: Architecture, Core Components, Service Governance, Performance Optimizations, and Installation Guide

Top Architect

May 26, 2021 · Big Data

Comprehensive Introduction to Apache Kafka: Concepts, Architecture, Installation, and Usage

This article provides a comprehensive guide to Apache Kafka, covering its core concepts, architecture, key APIs, topics and partitions, deployment steps, multi‑broker clustering, fault tolerance, and data integration using Kafka Connect, with detailed command‑line examples.

Big DataConsumerDistributed Streaming

0 likes · 26 min read

Comprehensive Introduction to Apache Kafka: Concepts, Architecture, Installation, and Usage

Full-Stack Internet Architecture

May 25, 2021 · Big Data

Deep Dive into Kafka Producer Configuration Parameters and Performance Tuning

This article provides a comprehensive overview of Kafka Producer configuration properties, categorizing them into basic and performance‑related parameters, explains their impact on producer behavior, and offers practical guidance for tuning throughput, latency, and reliability in real‑world deployments.

ConfigurationKafkaProducer

0 likes · 13 min read

Deep Dive into Kafka Producer Configuration Parameters and Performance Tuning

Top Architect

May 22, 2021 · Big Data

Kafka Basics: Topics, Partitions, Producers, Consumers, and Cluster Architecture

This article provides a comprehensive introduction to Kafka, covering its role as a message system, core concepts such as topics, partitions, producers, consumers, messages, the cluster architecture with replicas and controllers, performance optimizations, log segmentation, and network design, all illustrated with diagrams and code examples.

Big DataKafkaMessage Queue

0 likes · 13 min read

Kafka Basics: Topics, Partitions, Producers, Consumers, and Cluster Architecture

IT Architects Alliance

May 22, 2021 · Big Data

Flink-Based Real‑Time Recommendation System: Architecture, Logic, and Docker Deployment Guide

This article presents a comprehensive walkthrough of a Flink‑powered recommendation system, detailing its v2.0 architecture, module functions, recommendation algorithms (hotness, product similarity, collaborative filtering), front‑end and back‑end UI, and step‑by‑step Docker deployment of MySQL, Redis, HBase, and Kafka services.

Big DataDockerFlink

0 likes · 11 min read

Flink-Based Real‑Time Recommendation System: Architecture, Logic, and Docker Deployment Guide

Dada Group Technology

May 21, 2021 · Backend Development

Implementing Data Heterogeneity for JD Daojia Order Fulfillment: Architecture, Canal Integration, and Lessons Learned

This article examines JD Daojia's order fulfillment system, detailing the challenges of high‑volume prompt‑sound queries, the division of responsibilities among Redis, MySQL, and Elasticsearch, the adoption of Canal for asynchronous data replication, deployment practices with Kafka and Zookeeper, and the key operational lessons learned.

Backend ArchitectureCanalElasticsearch

0 likes · 14 min read

Implementing Data Heterogeneity for JD Daojia Order Fulfillment: Architecture, Canal Integration, and Lessons Learned

dbaplus Community

May 19, 2021 · Big Data

Why Kafka 2.8 Is Dropping Zookeeper: Architecture, Challenges, and the Path to KIP‑500

The article explains how Kafka 2.8 removes its dependency on Zookeeper, detailing Kafka's core concepts, the role of Zookeeper in broker registration, load balancing, and controller election, the operational drawbacks of this coupling, and how KIP‑500 with a Quorum Controller modernizes the architecture.

Distributed SystemsKIP-500Kafka

0 likes · 10 min read

Architect

May 19, 2021 · Big Data

Flink-Based Real-Time Recommendation System Architecture and Deployment Guide

This article presents a comprehensive overview of a Flink-powered real-time recommendation system, detailing its v2.0 architecture, module functions, recommendation algorithms, front‑end and back‑end interfaces, Docker‑based deployment of MySQL, Redis, HBase, Kafka, and step‑by‑step startup procedures.

DockerFlinkHBase

0 likes · 9 min read

Flink-Based Real-Time Recommendation System Architecture and Deployment Guide

Full-Stack Internet Architecture

May 19, 2021 · Backend Development

Understanding Message Queues: Benefits, Design Challenges, and Transactional Solutions

This article explores the role of message queues in microservice architectures, discussing their advantages such as decoupling, asynchronous processing, and load shedding, while also addressing design challenges like concurrency, ordering, duplicate handling, and transactional messaging with solutions including Kafka partitions, outbox patterns, CDC, and RocketMQ.

CDCKafkaMessage Queue

0 likes · 12 min read

Understanding Message Queues: Benefits, Design Challenges, and Transactional Solutions

Architecture Digest

May 17, 2021 · Big Data

Technical Architecture Overview of Toutiao: Data Pipeline, User Modeling, Recommendation System, and Microservices

The article provides a comprehensive technical overview of Toutiao's rapid growth, detailing its massive user base, data collection and processing pipelines, user modeling, cold‑start strategies, recommendation engines, storage solutions, push notification mechanisms, and the underlying microservice and PaaS architecture.

Big DataHadoopKafka

0 likes · 8 min read

Technical Architecture Overview of Toutiao: Data Pipeline, User Modeling, Recommendation System, and Microservices

Java High-Performance Architecture

May 17, 2021 · Backend Development

How to Tackle Message Queue Backlogs and Prevent Data Loss

This article explains why message queues accumulate, the risks of discarded messages, disk exhaustion, and massive pending loads, and provides practical strategies—including avoiding TTL, using monitoring alerts, temporary queues, and partition scaling—to quickly recover and process backlogged messages.

BacklogConsumerKafka

0 likes · 5 min read

How to Tackle Message Queue Backlogs and Prevent Data Loss

Code Ape Tech Column

May 17, 2021 · Backend Development

Ensuring Reliable Message Delivery and Idempotence in RabbitMQ and Kafka

This article explains common scenarios that cause message loss or non‑idempotent processing in RabbitMQ and Kafka, and presents practical solutions such as persistent delivery, confirm mechanisms, delayed delivery, and unique‑ID plus fingerprint strategies to achieve reliable and idempotent message transmission.

IdempotenceKafkaRabbitMQ

0 likes · 6 min read

Ensuring Reliable Message Delivery and Idempotence in RabbitMQ and Kafka

Java High-Performance Architecture

May 15, 2021 · Backend Development

How to Ensure Ordered Messaging with RabbitMQ and Kafka

This article explains how to achieve ordered message processing by coordinating both producers and consumers, covering the differences between RabbitMQ's simple queue ordering and Kafka's partition-based approach, and offering practical techniques for global and partial ordering.

ConsumerKafkaProducer

0 likes · 5 min read

How to Ensure Ordered Messaging with RabbitMQ and Kafka

MaGe Linux Operations

May 14, 2021 · Big Data

Build a Billion-Scale ELK Logging Platform with Filebeat, Kafka, Elasticsearch

This guide walks through the complete architecture and step‑by‑step deployment of a billion‑scale ELK logging system, covering Filebeat agents, Kafka buffering, Logstash processing, Elasticsearch indexing, and Kibana visualization, including configuration files, version details, and best‑practice tips for scaling and security.

ELKElasticsearchFilebeat

0 likes · 12 min read

Build a Billion-Scale ELK Logging Platform with Filebeat, Kafka, Elasticsearch

Programmer DD

May 14, 2021 · Backend Development

Why Simple Retries Fail in Kafka and How to Build Robust Failure Strategies

This article explains Kafka's core concepts, the challenges of consumer failures in microservice architectures, why naïve retry loops or message skipping are insufficient, and presents a nuanced approach that distinguishes recoverable from unrecoverable errors, using back‑off retries and hidden topics to preserve ordering and data integrity.

Distributed SystemsKafkaMicroservices

0 likes · 24 min read

Why Simple Retries Fail in Kafka and How to Build Robust Failure Strategies

Java High-Performance Architecture

May 13, 2021 · Backend Development

How to Prevent Message Loss in RabbitMQ and Kafka: Proven Strategies

This article explains why messages can be lost in RabbitMQ and Kafka, identifies three critical points—producer, broker storage, and consumer—and provides concrete configurations and acknowledgment mechanisms to ensure reliable, loss‑free message delivery.

AcknowledgmentConsumerKafka

0 likes · 7 min read

How to Prevent Message Loss in RabbitMQ and Kafka: Proven Strategies

MaGe Linux Operations

May 12, 2021 · Big Data

Visualizing Kafka: Core Concepts Explained with Diagrams

This article visually breaks down Kafka’s core concepts—including producers, consumers, topics, partitions, consumer groups, and cluster architecture—explaining how messages flow, are stored, and replicated across partitions and nodes, while highlighting the role of ZooKeeper in managing metadata.

Distributed SystemsKafkaMessaging

0 likes · 5 min read

Visualizing Kafka: Core Concepts Explained with Diagrams

vivo Internet Technology

May 12, 2021 · Big Data

Kafka at Trillion-Scale: Ensuring High Availability, Performance, and Operational Best Practices

The article presents a comprehensive guide for running Kafka at trillion‑record daily traffic, detailing version upgrades, data migration, traffic throttling, monitoring, load balancing, resource isolation, security, disaster recovery, Linux tuning, platform automation, performance evaluation, future roadmap, and community contribution practices.

Kafkaperformance

0 likes · 34 min read

Kafka at Trillion-Scale: Ensuring High Availability, Performance, and Operational Best Practices

IT Architects Alliance

May 11, 2021 · Big Data

Demystifying Kafka: Core Concepts of Topics, Partitions, and Architecture

This article provides a clear, visual walkthrough of Kafka’s fundamental architecture, explaining how producers and consumers interact, the role of topics and partitions, consumer groups, and ZooKeeper’s coordination, helping readers grasp message flow, storage, ordering, and fault‑tolerance in a distributed streaming system.

KafkaMessage QueuePartition

0 likes · 6 min read

Demystifying Kafka: Core Concepts of Topics, Partitions, and Architecture

Java High-Performance Architecture

May 10, 2021 · Backend Development

How Do RabbitMQ, Kafka, and RocketMQ Achieve High Availability?

This article compares RabbitMQ, Kafka, and RocketMQ from a high‑availability perspective, explaining each system's deployment modes, replication mechanisms, and the trade‑offs between scalability and fault tolerance.

KafkaMessagingRabbitMQ

0 likes · 6 min read

How Do RabbitMQ, Kafka, and RocketMQ Achieve High Availability?

Code Ape Tech Column

May 10, 2021 · Industry Insights

Why Kafka Beats Redis List: A Deep Dive into Modern Messaging Middleware

This article compares Redis list, Kafka, and Pulsar as messaging middleware, explaining their architectures, strengths, and weaknesses—including queue fundamentals, partitioning, cursor management, consumer groups, high‑availability mechanisms, storage strategies, and consumption models—to help readers choose the right solution for large‑scale systems.

Distributed SystemsKafkaMessaging

0 likes · 30 min read

Why Kafka Beats Redis List: A Deep Dive into Modern Messaging Middleware

macrozheng

May 8, 2021 · Big Data

Why Kafka 2.8 Drops Zookeeper: Architecture, Challenges, and KIP‑500

This article explains how Kafka 2.8 removes its dependency on Zookeeper, describes Kafka's core concepts and its interaction with Zookeeper, outlines the role of the Controller, discusses operational complexities and upgrade paths with KIP‑500, and highlights the benefits of the new KRaft‑based architecture.

Distributed SystemsKIP-500KRaft

0 likes · 10 min read

Why Kafka 2.8 Drops Zookeeper: Architecture, Challenges, and KIP‑500

Architect's Tech Stack

May 7, 2021 · Backend Development

Kafka 2.8 Introduces KRaft: Removing ZooKeeper with an Internal Quorum Controller

Kafka 2.8 replaces the external ZooKeeper dependency with an internal Quorum controller (KRaft), enabling ZooKeeper‑free deployments that reduce resource usage, improve performance, support larger clusters, but currently lack some features such as ACLs, transactions, and partition reassignment, making it unsuitable for production yet.

KRaftKafkaMessage Queue

0 likes · 4 min read

Kafka 2.8 Introduces KRaft: Removing ZooKeeper with an Internal Quorum Controller

Java High-Performance Architecture

May 7, 2021 · Big Data

Mastering Kafka Partitions: Boost Scalability, Fault Tolerance, and Ordering

Kafka partitions are the fundamental storage units that enable topics to scale horizontally, maintain message order within each partition, provide fault‑tolerance through replication, and support parallel consumption, with various write strategies such as key‑based, round‑robin, and custom rules.

KafkaMessage OrderingPartitions

0 likes · 9 min read

Mastering Kafka Partitions: Boost Scalability, Fault Tolerance, and Ordering

Full-Stack Internet Architecture

May 6, 2021 · Big Data

Why Kafka Dropped Zookeeper in Version 2.8: Design Philosophy and Alternatives

The article explains the design philosophy behind Kafka 2.8’s removal of Zookeeper, reviews Zookeeper’s classic leader‑election use cases, highlights its limitations, and shows how the Raft protocol provides a decentralized alternative for high‑availability leader selection in distributed messaging systems.

KafkaRaftbig-data

0 likes · 8 min read

Why Kafka Dropped Zookeeper in Version 2.8: Design Philosophy and Alternatives

Java High-Performance Architecture

May 6, 2021 · Big Data

Why Kafka 2.8’s Raft Integration Eliminates ZooKeeper – Benefits & Caveats

Kafka 2.8.0 introduces a major improvement by implementing the Raft consensus algorithm, allowing the platform to operate without ZooKeeper, which reduces system complexity and improves efficiency, though the ZooKeeper‑less mode is still early‑stage and not recommended for production environments.

Distributed SystemsKafkaRaft

0 likes · 3 min read

Why Kafka 2.8’s Raft Integration Eliminates ZooKeeper – Benefits & Caveats

Sohu Tech Products

May 5, 2021 · Big Data

Kafka Architecture and Implementation Principles – Part 2

This article provides an in‑depth, English‑language explanation of Kafka's overall architecture, including the roles of producers, consumers, topics, partitions, replication, Zookeeper coordination, controller election, and the NIO‑based network model, helping readers understand both concepts and practical configuration implications.

KafkaZooKeeper

0 likes · 17 min read

Kafka Architecture and Implementation Principles – Part 2

DataFunTalk

May 2, 2021 · Big Data

Continuous Optimization and Practice of Flink at Kuaishou

This article presents Kuaishou's comprehensive engineering practices for improving Flink's stability, task startup latency, and SQL performance, including high‑availability Kafka connectors, fault‑recovery mechanisms, I/O reductions, asynchronous job upgrades, aggregation optimizations, and future resource‑utilization plans.

Big DataFlinkKafka

0 likes · 10 min read

Continuous Optimization and Practice of Flink at Kuaishou

IT Architects Alliance

May 1, 2021 · Big Data

Comprehensive Guide to ELK Stack (Elasticsearch, Logstash, Kibana) Installation, Configuration, and Architecture

This article provides a detailed overview of the ELK stack—including Elasticsearch, Logstash, Kibana, and Beats—explaining its components, why to use it for centralized log management, various deployment architectures, system tuning, security setup, and step‑by‑step installation and configuration commands for a production‑grade environment.

Big DataELKElasticsearch

0 likes · 22 min read

Comprehensive Guide to ELK Stack (Elasticsearch, Logstash, Kibana) Installation, Configuration, and Architecture

Programmer DD

Apr 30, 2021 · Big Data

Kafka 2.8.0 Release: Say Goodbye to ZooKeeper with Raft Metadata Mode

Kafka 2.8.0, released on April 19, 2021, introduces the groundbreaking Raft Metadata mode that eliminates the need for ZooKeeper, alongside numerous new features, bug fixes, and enhancements such as API controls for stream threads, SASL_SSL mutual TLS, and IP rate limiting.

Big DataKafkaRaft

0 likes · 5 min read

Kafka 2.8.0 Release: Say Goodbye to ZooKeeper with Raft Metadata Mode

IT Architects Alliance

Apr 28, 2021 · Industry Insights

Why Kafka Beats Redis List: A Deep Dive into Modern Message Queue Architecture

This article compares the fundamentals and limitations of Redis list‑based queues with Kafka and Pulsar, explaining how partitions, cursors, replication, segment storage, and consumption models overcome Redis’s drawbacks and enable high‑performance, scalable, and highly available messaging systems.

Distributed SystemsKafkaMessage Queue

0 likes · 30 min read

Architect

Apr 27, 2021 · Fundamentals

Understanding Message Queue Architectures: Redis List, Kafka, and Pulsar

This article compares the fundamentals and design trade‑offs of popular message‑queue middleware—Redis list, Kafka, and Pulsar—explaining their data structures, partitioning, persistence, consumer models, high‑availability mechanisms, and scalability challenges for developers and architects.

KafkaPulsarredis

0 likes · 28 min read

Understanding Message Queue Architectures: Redis List, Kafka, and Pulsar

Java High-Performance Architecture

Apr 26, 2021 · Fundamentals

Visualizing Kafka: Core Concepts Explained with Diagrams

This article provides a visual walkthrough of Kafka's fundamental concepts—including producers, consumers, topics, partitions, and cluster architecture—illustrated with diagrams to help readers clearly understand how messages flow and are stored in a distributed streaming system.

Distributed SystemsKafkaPartitions

0 likes · 6 min read

Intelligent Backend & Architecture

Apr 23, 2021 · Backend Development

Why Message Queues Are Essential: Benefits, Pitfalls, and Best Practices

This article explains the role of message queues in handling traffic spikes, decoupling services, and ensuring reliability, compares popular MQ solutions such as RabbitMQ, Kafka, RocketMQ and ActiveMQ, and discusses their architectures, advantages, drawbacks, idempotency, ordering, high‑availability and scaling strategies.

IdempotencyKafkaMQ

0 likes · 38 min read

Why Message Queues Are Essential: Benefits, Pitfalls, and Best Practices

IT Architects Alliance

Apr 20, 2021 · Big Data

Real-time Log Processing System Based on Flink and Drools

This article describes a real-time log processing platform that integrates Kafka, Flink, Drools rule engine, Redis, and Elasticsearch to unify heterogeneous log formats, extract business metrics, and provide configurable, dynamic data processing for large‑scale logging scenarios.

DroolsElasticsearchFlink

0 likes · 6 min read

Real-time Log Processing System Based on Flink and Drools

ITFLY8 Architecture Home

Apr 19, 2021 · Backend Development

Mastering Distributed Systems: CAP, Redis, Zookeeper, Kafka and More

This comprehensive guide covers core distributed system theories, CAP consistency, transaction protocols, Redis features and persistence, Zookeeper coordination, message queue fundamentals, Kafka performance tricks, and practical solutions for caching, locking, and high‑concurrency scenarios.

Distributed SystemsKafkaZooKeeper

0 likes · 34 min read

Mastering Distributed Systems: CAP, Redis, Zookeeper, Kafka and More

dbaplus Community

Apr 17, 2021 · Big Data

How a Traditional Finance Firm Tackles Real‑Time Analytics with Flink

This article details a financial company's exploration of Apache Flink for real‑time processing, covering its unique business constraints, end‑to‑end data pipeline, single‑table and multi‑table use cases, implementation challenges, code snippets, data initialization, testing strategies, and lessons learned.

FinancialFlinkHBase

0 likes · 13 min read

How a Traditional Finance Firm Tackles Real‑Time Analytics with Flink

Tencent Cloud Developer

Apr 12, 2021 · Backend Development

Understanding Message Queues: From Basic Queues to Redis, Kafka, and Pulsar

The article compares basic in‑memory queues, Redis lists and streams, Kafka’s partitioned log architecture, and Pulsar’s compute‑storage separation, explaining each system’s core mechanisms, strengths, and limitations so readers can choose the most suitable message‑queue solution for their workloads.

KafkaMessage QueuePulsar

0 likes · 29 min read

Understanding Message Queues: From Basic Queues to Redis, Kafka, and Pulsar

Spring Full-Stack Practical Cases

Apr 12, 2021 · Big Data

Master Kafka’s Delivery Guarantees: At‑Most, At‑Least, and Exactly‑Once Explained

This article explains Kafka’s three delivery semantics—At most once, At least once, and Exactly once—from both producer and consumer perspectives, details the required configuration settings, and discusses how Kafka ensures idempotence, transaction support, and prevents data loss, duplication, and ordering issues.

Delivery GuaranteesExactly-OnceIdempotence

0 likes · 23 min read

Master Kafka’s Delivery Guarantees: At‑Most, At‑Least, and Exactly‑Once Explained

iQIYI Technical Product Team

Apr 9, 2021 · Big Data

Real-Time Data Warehouse at iQIYI Video Production Using Spark and ClickHouse

To meet iQIYI video production’s thousands‑QPS, petabyte‑scale, frequently‑updated data and large‑table join requirements, the team built a Spark‑plus‑ClickHouse real‑time warehouse that streams Kafka changes, joins HBase dimensions, and writes to ClickHouse, reducing reporting development time from days to hours while supporting both offline and real‑time analytics.

HBaseKafkaSpark

0 likes · 12 min read

Real-Time Data Warehouse at iQIYI Video Production Using Spark and ClickHouse

Code Ape Tech Column

Apr 9, 2021 · Backend Development

Comprehensive Comparison of Kafka, RabbitMQ, RocketMQ, and ActiveMQ Across 17 Dimensions

This article provides a detailed side‑by‑side analysis of four major distributed message‑queue systems—Kafka, RabbitMQ, RocketMQ, and ActiveMQ—examining them across seventeen criteria such as documentation, language support, protocols, storage, transactions, load balancing, clustering, management UI, availability, duplication, throughput, subscription models, ordering, acknowledgments, replay, retry, and concurrency.

ActiveMQDistributed SystemsKafka

0 likes · 22 min read

Comprehensive Comparison of Kafka, RabbitMQ, RocketMQ, and ActiveMQ Across 17 Dimensions

Full-Stack Internet Architecture

Apr 8, 2021 · Backend Development

Kafka Interview Guide: Concepts, Architecture, Configuration, and Performance

This article provides a comprehensive overview of Kafka, covering its role as a distributed messaging middleware, core concepts, architecture components, common interview questions, command‑line tools, producer and consumer configurations, high‑availability mechanisms, delivery semantics, and performance optimizations for backend developers.

ConsumerDistributed MessagingKafka

0 likes · 20 min read

Kafka Interview Guide: Concepts, Architecture, Configuration, and Performance

Aikesheng Open Source Community

Apr 3, 2021 · Databases

DTLE 3.21.03.0 Release Notes – New Features, Fixes, and Download Links

The DTLE 3.21.03.0 release introduces new MySQL data‑transfer configurations, performance improvements for Kafka full copies, and several bug fixes, while providing repository, documentation, and download instructions for users seeking a reliable open‑source replication component.

DTLEDatabase ToolsKafka

0 likes · 4 min read

DTLE 3.21.03.0 Release Notes – New Features, Fixes, and Download Links

Wukong Talks Architecture

Apr 3, 2021 · Backend Development

Two Years of Kafka in a Restaurant Order System: Problems, Solutions, and Lessons Learned

This article recounts the author's two‑year experience with Kafka in a high‑traffic restaurant ordering system, detailing why message ordering matters, the pitfalls of synchronous retries, message backlog, partition routing, primary‑key conflicts, database replication lag, and practical mitigation strategies for reliable backend processing.

Kafkadistributed-systemsmessage-queue

0 likes · 17 min read

Two Years of Kafka in a Restaurant Order System: Problems, Solutions, and Lessons Learned

dbaplus Community

Apr 1, 2021 · Backend Development

Avoid Kafka Pitfalls: Ensuring Message Order, Handling Retries, and Preventing Backlog

This article shares a two‑year journey of using Kafka in a high‑traffic restaurant ordering system, covering why message order matters, how network glitches and partition routing cause failures, and the practical retry, partition‑balancing, and database strategies that finally eliminated backlog and duplication issues.

KafkaMessage OrderingPartition Balancing

0 likes · 19 min read

Avoid Kafka Pitfalls: Ensuring Message Order, Handling Retries, and Preventing Backlog

Senior Brother's Insights

Mar 29, 2021 · Backend Development

Why Does a Single Kafka Broker Failure Break All Consumers?

A Kafka broker outage can halt consumer consumption despite remaining brokers, due to replication settings, ISR mechanics, and the internal __consumer_offsets topic’s default replication factor, which this article explains and resolves with practical configuration steps.

ACKBackendConsumer Offsets

0 likes · 11 min read

Why Does a Single Kafka Broker Failure Break All Consumers?

Programmer DD

Mar 29, 2021 · Big Data

Mastering Kafka: High‑Throughput Distributed Messaging Explained

This comprehensive guide introduces Kafka as a high‑throughput, distributed, publish‑subscribe messaging system, detailing its core concepts, architecture, features, replication, log management, reliability guarantees, and typical use cases such as log collection, real‑time analytics, and cross‑cluster mirroring.

Big DataDistributed MessagingKafka

0 likes · 15 min read

Mastering Kafka: High‑Throughput Distributed Messaging Explained

iQIYI Technical Product Team

Mar 26, 2021 · Big Data

Evolution of iQIYI's Real-Time Big Data Ecosystem

iQIYI transformed its data infrastructure from a traditional offline T+1 model to a comprehensive real‑time ecosystem—leveraging Kafka, Flink, a three‑layer Stream Data Service Platform, the Talos drag‑and‑drop pipeline, and a Druid‑based analytics platform—to enable low‑latency monitoring, personalized recommendations, ad targeting, and continuous machine‑learning workflows while planning future stream‑batch integration and lake‑warehouse convergence.

AnalyticsBig DataFlink

0 likes · 13 min read

Evolution of iQIYI's Real-Time Big Data Ecosystem

Kuaishou Tech

Mar 25, 2021 · Backend Development

Designing a High‑Availability Cache Consistency Solution for the Creator Red Packet System

This article explains how the creator red‑packet feature was engineered to guarantee idempotent, fault‑tolerant, and high‑throughput red‑packet claims by using multi‑level caching, empty placeholders, binlog‑driven synchronization, active cache invalidation, ordered Kafka consumption, and fallback strategies to resolve cache‑DB consistency issues.

BinlogCache ConsistencyIdempotency

0 likes · 11 min read

Designing a High‑Availability Cache Consistency Solution for the Creator Red Packet System

Architecture Digest

Mar 25, 2021 · Big Data

Uber's Multi-Region Kafka Architecture and Disaster Recovery

This article explains how Uber built a multi‑region Kafka infrastructure with disaster‑recovery capabilities, detailing its replication topology, active/active and active/passive consumption modes, offset‑management service, and the challenges of ensuring reliable, low‑latency data streaming across regions.

Data StreamingKafkaOffset Management

0 likes · 9 min read

Uber's Multi-Region Kafka Architecture and Disaster Recovery

Architecture Digest

Mar 14, 2021 · Big Data

Understanding Kafka Message Loss: Causes and Mitigation in Broker, Producer, and Consumer

This article explains why Kafka can lose messages at the broker, producer, and consumer levels, describes the underlying mechanisms such as page cache flushing and acknowledgment settings, and provides practical code examples and mitigation strategies to improve reliability.

ACKBrokerConsumer

0 likes · 12 min read

Understanding Kafka Message Loss: Causes and Mitigation in Broker, Producer, and Consumer

ITPUB

Mar 12, 2021 · Backend Development

How to Sync MySQL Data to Elasticsearch in Real-Time Using Binlog and Kafka

This article explains how a growing e‑commerce platform replaced a heavyweight MySQL intermediate table with a binlog‑driven pipeline that streams changes to Elasticsearch via Kafka, detailing the architecture, modules, customizations, monitoring, and performance results.

BackendBinlogElasticsearch

0 likes · 13 min read

How to Sync MySQL Data to Elasticsearch in Real-Time Using Binlog and Kafka

Top Architect

Mar 9, 2021 · Backend Development

Handling Kafka Consumer Failures and Retry Strategies in Microservices

This article explains how Apache Kafka is used for asynchronous microservice communication, identifies the common pitfall of consumer message failures, and evaluates retry‑topic patterns, their drawbacks, and alternative approaches such as back‑off retries and hidden topics while preserving message ordering and data consistency.

Consumer FailureKafkaMicroservices

0 likes · 23 min read

Handling Kafka Consumer Failures and Retry Strategies in Microservices

dbaplus Community

Mar 2, 2021 · Databases

How ByteDance Scaled Real‑Time Analytics with ClickHouse and Kafka Engine

This article details ByteDance's evolution from offline ClickHouse ingestion to a robust real‑time analytics pipeline, covering external transaction handling, risks of direct INSERTs, recommendation and ad‑delivery use cases, Kafka Engine design, multi‑threaded consumption, fault‑tolerance improvements, platform tooling, and future roadmap.

KafkaReal-time analyticsbackend-development

0 likes · 22 min read

How ByteDance Scaled Real‑Time Analytics with ClickHouse and Kafka Engine

Code Ape Tech Column

Feb 26, 2021 · Backend Development

Why Kafka Messages Get Lost and How to Prevent It

This article explains the three places where Kafka can lose messages—Broker, Producer, and Consumer—detailing the underlying mechanisms, the impact of flush and ack settings, and practical configuration and coding strategies to minimize data loss.

Ack SettingsBrokerConfiguration

0 likes · 15 min read

Why Kafka Messages Get Lost and How to Prevent It

dbaplus Community

Feb 23, 2021 · Big Data

How NetEase Game Teams Built a Scalable Flink‑Based Streaming ETL Platform

This article explains how NetEase games collect heterogeneous logs, design a Flink‑driven streaming ETL pipeline, handle schema‑free sources, implement Python UDFs with Jython, optimize HDFS writes, manage real‑time and offline warehouses, and share practical tuning and fault‑tolerance techniques.

ETLFlinkKafka

0 likes · 22 min read

How NetEase Game Teams Built a Scalable Flink‑Based Streaming ETL Platform

Programmer DD

Feb 20, 2021 · Big Data

How Uber Built a Multi‑Region Kafka Architecture for Disaster Recovery

Uber operates the world’s largest Kafka cluster, handling trillions of messages daily, and has engineered a multi‑region deployment with active/active and active/passive consumption modes, offset management, and uReplicator to ensure high‑availability and seamless disaster recovery across data centers.

Active-ActiveActive-PassiveKafka

0 likes · 10 min read

How Uber Built a Multi‑Region Kafka Architecture for Disaster Recovery

ITFLY8 Architecture Home

Feb 10, 2021 · Big Data

How We Migrated Over 2 Billion MySQL Records to Google BigQuery Using Kafka

This case study explains how a client with a 2‑billion‑row MySQL table avoided disk exhaustion and poor query performance by streaming data through Kafka to Google BigQuery, partitioning the data, and cleaning it to reclaim storage—all without downtime.

BigQueryData MigrationKafka

0 likes · 8 min read

How We Migrated Over 2 Billion MySQL Records to Google BigQuery Using Kafka

Alibaba Cloud Native

Feb 8, 2021 · Big Data

How Serverless Architecture Supercharges Game Data Collection and Scaling

This article explains how to build a highly scalable, cost‑effective game data collection pipeline using Serverless function compute, Kafka, and cloud services, covering architecture design, function implementation, deployment with Fun, Kafka configuration, and performance testing to handle massive traffic spikes.

Big Data IngestionFunction ComputeKafka

0 likes · 22 min read

How Serverless Architecture Supercharges Game Data Collection and Scaling

Practical DevOps Architecture

Feb 8, 2021 · Backend Development

Comparison of Common Message Queues: ActiveMQ, RocketMQ, and Kafka

This article compares ActiveMQ, RocketMQ, and Kafka across multiple dimensions such as messaging models, API completeness, language support, throughput, latency, availability, message loss risk, documentation, community activity, and commercial backing, helping readers choose the most suitable queue for their backend needs.

ActiveMQBackendComparison

0 likes · 4 min read

Comparison of Common Message Queues: ActiveMQ, RocketMQ, and Kafka

Open Source Linux

Feb 7, 2021 · Big Data

Mastering Kafka: Core Concepts, Architecture, and High‑Performance Deployment

This comprehensive guide explains Kafka's role as a message system, detailing topics, partitions, producers, consumers, replication, controller, ZooKeeper coordination, performance optimizations like sequential writes and zero‑copy, and practical recommendations for hardware, configuration, and cluster deployment.

Big DataCluster DeploymentKafka

0 likes · 22 min read

Mastering Kafka: Core Concepts, Architecture, and High‑Performance Deployment