Tagged articles
1273 articles
Page 7 of 13
Selected Java Interview Questions
Selected Java Interview Questions
Feb 5, 2022 · Backend Development

Message Queue Fundamentals: Use Cases, Product Comparison, High Availability, and Reliability Strategies

This article explains why message queues are used, outlines common scenarios such as decoupling, asynchronous processing and traffic shaping, compares major MQ products, and provides practical guidance on high availability, preventing loss, duplicate consumption, ordering, backlog handling, and expiration.

KafkaMessage QueueRabbitMQ
0 likes · 8 min read
Message Queue Fundamentals: Use Cases, Product Comparison, High Availability, and Reliability Strategies
Code Ape Tech Column
Code Ape Tech Column
Feb 4, 2022 · Backend Development

Ensuring Zero Message Loss in MQ Systems: Interview Strategies and Solutions

This article explains how to guarantee that messages are never lost when using MQ middleware such as Kafka, RabbitMQ, or RocketMQ, outlines the key interview points, and provides practical design patterns, detection mechanisms, idempotency, and scaling strategies for reliable message delivery.

Distributed SystemsKafkaMessage Queue
0 likes · 13 min read
Ensuring Zero Message Loss in MQ Systems: Interview Strategies and Solutions
IT Architects Alliance
IT Architects Alliance
Feb 3, 2022 · Backend Development

Common Issues in Message Queues and Distributed Transaction Solutions

This article explains the typical problems encountered with message queues, such as message loss, duplicate delivery, and distributed transaction handling, and details various solutions including local message tables, MQ‑based transactions, and the specific mechanisms used by RocketMQ, Kafka, and RabbitMQ to ensure reliability and consistency.

KafkaMQMessage Queue
0 likes · 20 min read
Common Issues in Message Queues and Distributed Transaction Solutions
DataFunTalk
DataFunTalk
Feb 1, 2022 · Big Data

Kafka at Meituan: Practices, Challenges, and Optimizations for Large‑Scale Data Platforms

This article presents Meituan's large‑scale Kafka deployment, describing the current state and challenges of massive data ingestion, detailing latency‑reduction techniques, cluster‑level optimizations, SSD‑based caching, isolation strategies, full‑link monitoring, lifecycle management, and future directions for high availability.

Cluster ManagementKafkaMeituan
0 likes · 22 min read
Kafka at Meituan: Practices, Challenges, and Optimizations for Large‑Scale Data Platforms
Top Architect
Top Architect
Jan 31, 2022 · Backend Development

Delayed Queue Technology Research and Implementation Overview

This article surveys various delayed queue implementations—including Kafka, RocketMQ, Redis (Redisson), and Netty’s HashedWheelTimer—examining their design principles, advantages, drawbacks, and integration strategies, and proposes a unified micro‑service architecture leveraging Kafka topics, Redis ZSETs, and thread‑pool optimizations for reliable message scheduling.

KafkaMicroservicesNetty
0 likes · 15 min read
Delayed Queue Technology Research and Implementation Overview
Aikesheng Open Source Community
Aikesheng Open Source Community
Jan 27, 2022 · Databases

DTLE 4.22.01.0 Release Notes – New Features and Bug Fixes

DTLE 4.22.01.0, an open‑source MySQL‑focused data transfer component, has been released, introducing Oracle‑MySQL incremental DDL/DML sync, enhanced WHERE clause support, initial Kafka schema‑change handling, and numerous bug fixes including binlog relay issues, character‑set handling, and CREATE TABLE … LIKE renaming problems.

DTLEData TransferKafka
0 likes · 4 min read
DTLE 4.22.01.0 Release Notes – New Features and Bug Fixes
HomeTech
HomeTech
Jan 26, 2022 · Operations

Design and Practice of Autohome's Performance Testing Platform PTS

The article details the architecture, key components, testing types, and operational results of Autohome's PTS platform, which uses Docker Swarm, gRPC, JMeter, Flume‑Kafka, and Flink to conduct large‑scale distributed load testing for the 818 event and outlines future improvements toward Kubernetes and direct Kafka logging.

Docker SwarmFlinkJMeter
0 likes · 8 min read
Design and Practice of Autohome's Performance Testing Platform PTS
ByteDance Data Platform
ByteDance Data Platform
Jan 17, 2022 · Big Data

How ByteHouse Scales Real‑Time Analytics on ClickHouse: Challenges & Solutions

This article details ByteHouse’s evolution from ClickHouse, presenting two real‑time analytics use cases, the technical selection process, performance bottlenecks such as write throughput and Kafka consumption, and the engineered solutions—including asynchronous indexing, multi‑threaded Kafka engines, and enhanced Buffer engines—that enable reliable, high‑throughput data processing at massive scale.

ByteHouseKafkaReal-time analytics
0 likes · 11 min read
How ByteHouse Scales Real‑Time Analytics on ClickHouse: Challenges & Solutions
Big Data Technology & Architecture
Big Data Technology & Architecture
Jan 12, 2022 · Big Data

Common Production Issues and Troubleshooting Guide for Apache Flink

This article compiles a comprehensive list of common production problems encountered with Apache Flink, covering cluster sizing, checkpoint failures, backpressure analysis, resource allocation, deployment errors, UDF definitions, data skew, Kafka configurations, and provides detailed troubleshooting steps and best‑practice recommendations.

Apache FlinkCheckpointKafka
0 likes · 39 min read
Common Production Issues and Troubleshooting Guide for Apache Flink
Java Interview Crash Guide
Java Interview Crash Guide
Jan 12, 2022 · Backend Development

Designing Scalable Delayed Queues: Kafka, RocketMQ, Redis & More

This article surveys delayed‑queue implementations, comparing Kafka, RocketMQ, and Redis (Redisson) designs, outlining their architectures, pros and cons, integration details, timing‑wheel mechanisms, and practical considerations for building a reliable distributed delayed‑queue service.

Distributed SystemsKafkaTime Wheel
0 likes · 13 min read
Designing Scalable Delayed Queues: Kafka, RocketMQ, Redis & More
Su San Talks Tech
Su San Talks Tech
Jan 7, 2022 · Big Data

Unlock Kafka’s Speed: Deep Dive into Performance Optimizations

This article explores Kafka’s performance architecture, covering network and disk bottlenecks, sequential writes, zero‑copy techniques, page cache usage, Reactor‑based networking, batch processing, compression, partition concurrency, and file structures, providing practical optimization methods for high‑throughput streaming applications.

Kafkadisk-iojava
0 likes · 17 min read
Unlock Kafka’s Speed: Deep Dive into Performance Optimizations
21CTO
21CTO
Dec 25, 2021 · Backend Development

Message Queue Mastery: Interview Q&A, Pros/Cons, and Choosing the Best MQ

This article explains why message queues are used, outlines core interview questions, analyzes interviewers' expectations, compares major MQ products, and provides practical guidance on selecting the right queue for different scenarios.

Backend ArchitectureKafkaMessage Queue
0 likes · 12 min read
Message Queue Mastery: Interview Q&A, Pros/Cons, and Choosing the Best MQ
21CTO
21CTO
Dec 23, 2021 · Backend Development

Choosing the Right Message Broker for Microservices: RabbitMQ vs Kafka vs Redis

This article explains the difference between synchronous and asynchronous microservice communication, outlines the benefits of async messaging, and compares three popular message brokers—RabbitMQ, Kafka, and Redis—based on scale, persistence, and consumer patterns, offering guidance on selecting the right broker for various use cases.

KafkaRabbitMQredis
0 likes · 8 min read
Choosing the Right Message Broker for Microservices: RabbitMQ vs Kafka vs Redis
Architecture Digest
Architecture Digest
Dec 20, 2021 · Backend Development

Understanding Kafka: Core Design, Architecture, and Performance

This article explains Kafka’s fundamental design concepts—including topics, partitions, replicas, consumer groups, and its network architecture—while highlighting performance features such as sequential writes, zero‑copy, log segmentation, and how the controller coordinates with ZooKeeper, providing a comprehensive overview for backend developers.

Big DataKafkaMessage Queue
0 likes · 12 min read
Understanding Kafka: Core Design, Architecture, and Performance
HelloTech
HelloTech
Dec 13, 2021 · Big Data

Smart Matching Engine for Ride-Sharing: Technical Implementation and Algorithms

The Smart Matching Engine for Haolo’s ride‑sharing service ingests driver and passenger orders via Kafka‑Flink pipelines into Elasticsearch, then applies multi‑stage matching—nearby search, itinerary‑based filtering using ETA, angle, distance, route‑similarity and shared‑mileage calculations—and finally ranks results with evolving pre‑sorting and algorithmic models, including PMML and deep‑learning, to optimize driver‑passenger pairing.

ElasticsearchFlinkKafka
0 likes · 9 min read
Smart Matching Engine for Ride-Sharing: Technical Implementation and Algorithms
Wukong Talks Architecture
Wukong Talks Architecture
Dec 8, 2021 · Big Data

Understanding Kafka Core Concepts: Architecture, Messaging Models, Partitioning, Consumer Groups, and Reliability

This article provides a comprehensive overview of Kafka, covering its layered architecture with Zookeeper, core concepts such as topics, partitions and consumer groups, communication workflow, partition selection strategies, rebalancing mechanisms, reliability configurations, replica synchronization, and reasons for moving away from Zookeeper, all explained in clear English.

Distributed SystemsKafkaReliability
0 likes · 19 min read
Understanding Kafka Core Concepts: Architecture, Messaging Models, Partitioning, Consumer Groups, and Reliability
DataFunSummit
DataFunSummit
Dec 6, 2021 · Big Data

Design and Performance Optimization of a Real‑Time Billion‑Scale Data Processing Pipeline

This article reviews the background, architecture, and a series of performance‑optimizing techniques—including consumption, batch, storage, and execution‑engine tweaks—applied to a real‑time pipeline that processes hundreds of billions of records daily, and presents the resulting resource savings and latency improvements.

KafkaReal-time ProcessingSparkSQL
0 likes · 9 min read
Design and Performance Optimization of a Real‑Time Billion‑Scale Data Processing Pipeline
IT Architects Alliance
IT Architects Alliance
Dec 3, 2021 · Backend Development

Design and Performance Optimization of a High‑Concurrency WebSocket Gateway (Version 2.0)

This article details the evolution from a Node.js‑based WebSocket gateway to a Go‑implemented, gRPC‑driven architecture, describing the redesign of connection handling, TLS off‑loading, socket ID generation, session management, custom Kafka headers, code‑level optimizations, and extensive performance testing that validates the new gateway’s scalability and resource efficiency.

GoKafkaWebSocket
0 likes · 22 min read
Design and Performance Optimization of a High‑Concurrency WebSocket Gateway (Version 2.0)
Top Architect
Top Architect
Dec 2, 2021 · Backend Development

Design and Performance Optimization of a High‑Scale WebSocket Gateway (Version 2.0)

This article describes the architectural evolution from Gateway 1.0 to Gateway 2.0 for a high‑traffic document collaboration platform, detailing the redesign of the WebSocket layer, resource‑usage optimizations, heartbeat mechanisms, custom Kafka headers, message serialization, load‑testing results, and the operational lessons learned to support millions of concurrent connections.

GoKafkaMicroservices
0 likes · 22 min read
Design and Performance Optimization of a High‑Scale WebSocket Gateway (Version 2.0)
Architecture Digest
Architecture Digest
Dec 2, 2021 · Backend Development

Design and Performance Optimization of a Million‑Scale WebSocket Gateway at Shimo Docs

This article details the redesign of Shimo Docs' WebSocket gateway—from a Node.js/Socket.IO 1.0 version to a Go‑based 2.0 architecture—covering handshake degradation, TLS memory savings, SnowFlake SocketID generation, Redis‑based session broadcasting, heartbeat tuning, custom Kafka headers, object pooling, MessagePack compression, extensive performance testing, and the resulting stability and scalability improvements for handling half‑a‑million concurrent connections.

GoKafkaMicroservices
0 likes · 23 min read
Design and Performance Optimization of a Million‑Scale WebSocket Gateway at Shimo Docs
21CTO
21CTO
Nov 30, 2021 · Backend Development

Building a Million-Scale WebSocket Gateway: Architecture, Optimization & Performance

This article details the design, refactoring, and performance testing of a high‑traffic WebSocket gateway for Shimo Docs, covering the evolution from a Node.js Socket.IO version to a Go‑based microservice architecture, TLS memory tuning, socket ID generation, heartbeat handling, custom Kafka headers, and resource‑efficient scaling to half‑a‑million concurrent connections.

GoKafkaWebSocket
0 likes · 25 min read
Building a Million-Scale WebSocket Gateway: Architecture, Optimization & Performance
High Availability Architecture
High Availability Architecture
Nov 30, 2021 · Backend Development

Design and Performance Optimization of a Million‑Scale WebSocket Gateway

This article details the evolution from a Node.js Socket.IO gateway to a Go‑based, gRPC‑enabled WebSocket gateway that supports millions of concurrent connections, covering architecture redesign, TLS handling, socket ID generation, session management, heartbeat optimization, custom Kafka headers, code‑level refinements, and extensive performance testing results.

GoKafkaWebSocket
0 likes · 21 min read
Design and Performance Optimization of a Million‑Scale WebSocket Gateway
dbaplus Community
dbaplus Community
Nov 22, 2021 · Databases

Transforming MySQL Monitoring: From Nagios to Kafka‑Powered Alerts

Qunar’s DBA team overhauled their MySQL monitoring and alert system—originally built on Nagios and NRPE—by integrating a Kafka‑based pipeline, a custom alarm service, and MySQL‑stored alert templates, achieving flexible thresholds, granular silencing, high‑availability processing, and early‑stage intelligent management of alerts, slow queries, and disk space.

AlertingDBAKafka
0 likes · 14 min read
Transforming MySQL Monitoring: From Nagios to Kafka‑Powered Alerts
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 22, 2021 · Big Data

Comprehensive Big Data Learning Path and Resource Guide

This article presents a detailed learning roadmap for aspiring big‑data experts, covering foundational programming languages, data structures, Linux basics, databases, distributed system theory, and essential frameworks such as Hadoop, Spark, Flink, Kafka, and provides curated B‑site video links and reference materials.

Big DataFlinkHadoop
0 likes · 9 min read
Comprehensive Big Data Learning Path and Resource Guide
IT Architects Alliance
IT Architects Alliance
Nov 22, 2021 · Backend Development

Understanding Message Queues, JMS, and Kafka: A Practical Guide

This article demystifies message queues by comparing them to parcel lockers, explains their role in asynchronous processing and system decoupling, details point‑to‑point and publish‑subscribe models, introduces JMS, AMQP, RabbitMQ, ActiveMQ, RocketMQ, and Kafka, and outlines their architectures and high‑availability mechanisms.

Backend ArchitectureJMSKafka
0 likes · 16 min read
Understanding Message Queues, JMS, and Kafka: A Practical Guide
Top Architect
Top Architect
Nov 19, 2021 · Big Data

Understanding Kafka’s Design: Topics, Partitions, Consumer Groups, and Cluster Architecture

This article explains Kafka’s core design concepts—including the role of a message system, topics, partitions, producers, consumers, consumer groups, replica management, controller coordination with Zookeeper, performance optimizations like sequential writes and zero‑copy, and its network thread model—illustrated with diagrams and code snippets.

Cluster ArchitectureKafkaconsumer groups
0 likes · 12 min read
Understanding Kafka’s Design: Topics, Partitions, Consumer Groups, and Cluster Architecture
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 15, 2021 · Operations

A Comprehensive Overview of Kafka Monitoring Tools

This article provides a comprehensive overview of popular Kafka monitoring solutions—including JMX, Kafka Manager (CMAK), Kafka Eagle, and Logi‑KafkaManager—detailing their features, installation steps, configuration examples, and comparative advantages, while also mentioning custom setups using JMXTrans, InfluxDB, and Grafana.

CMAKKafkaKafka Eagle
0 likes · 8 min read
A Comprehensive Overview of Kafka Monitoring Tools
IT Architects Alliance
IT Architects Alliance
Nov 15, 2021 · Industry Insights

Why Kafka’s Topic‑Partition Design Powers Scalable Messaging

This article explains Kafka’s core architecture—including topics, partitions, replication, consumer groups, controller coordination with Zookeeper, and performance tricks like sequential writes and zero‑copy—to show how it achieves high‑throughput, fault‑tolerant messaging for large‑scale systems.

Distributed SystemsKafkaMessage Queue
0 likes · 11 min read
Why Kafka’s Topic‑Partition Design Powers Scalable Messaging
dbaplus Community
dbaplus Community
Nov 10, 2021 · Databases

How HuoLaLa Built a Hybrid‑Cloud Database Governance Platform

This article outlines HuoLaLa's journey from a fragmented multi‑cloud environment to a unified, platform‑driven database governance system, detailing the technical challenges, architectural decisions, key components for MySQL, Redis, Kafka, and other middleware, and the measurable stability and cost improvements achieved after a year of operation.

Database GovernanceKafkahybrid cloud
0 likes · 44 min read
How HuoLaLa Built a Hybrid‑Cloud Database Governance Platform
IT Architects Alliance
IT Architects Alliance
Nov 8, 2021 · Big Data

Kafka Core Concepts, Architecture, Performance Optimization, and Operational Practices

This comprehensive guide explains Kafka's core value for decoupling and asynchronous processing, details its producer‑consumer model, cluster architecture, log segmentation, zero‑copy I/O, hardware sizing, network planning, operational commands, throughput tuning, exception handling, consumer group mechanics, offset management, rebalance strategies, and internal mechanisms such as LEO/HW, controllers, delayed tasks, and time‑wheel scheduling.

Cluster DeploymentKafkaMessage Queue
0 likes · 31 min read
Kafka Core Concepts, Architecture, Performance Optimization, and Operational Practices
Tencent Qidian Tech Team
Tencent Qidian Tech Team
Nov 1, 2021 · Backend Development

How to Build a Scalable Distributed Timer with Redis and Time Wheel

This article explains the design of a distributed timer service using a time‑wheel data structure stored in Redis, covering application scenarios, required features, architecture components such as access layer, scheduler, worker, and management center, and detailing reliability and performance techniques.

KafkaTime Wheeldistributed timer
0 likes · 13 min read
How to Build a Scalable Distributed Timer with Redis and Time Wheel
Top Architect
Top Architect
Oct 30, 2021 · Backend Development

RabbitMQ vs Kafka: Architectural Comparison and Throughput Considerations

This article compares RabbitMQ and Kafka by explaining their architectures, queue consumption and production mechanisms, highlighting the master‑mirror design of RabbitMQ versus Kafka's partitioned approach, and summarizing their throughput characteristics to help engineers choose the appropriate messaging middleware for their needs.

KafkaRabbitMQThroughput
0 likes · 7 min read
RabbitMQ vs Kafka: Architectural Comparison and Throughput Considerations
MaGe Linux Operations
MaGe Linux Operations
Oct 28, 2021 · Fundamentals

Demystifying Message Queues, JMS, and Kafka: A Beginner’s Guide

This article explains the fundamentals of message queues, their role in asynchronous processing and system decoupling, compares point‑to‑point and publish‑subscribe patterns, introduces Java Message Service (JMS) and its API, and outlines popular implementations such as ActiveMQ, RabbitMQ, RocketMQ, and Kafka with their architectures and high‑availability mechanisms.

AMQPJMSKafka
0 likes · 15 min read
Demystifying Message Queues, JMS, and Kafka: A Beginner’s Guide
High Availability Architecture
High Availability Architecture
Oct 28, 2021 · Cloud Native

Design and Implementation Differences between Pulsar, Kafka, RocketMQ, and InLong‑TubeMQ

This article compares Pulsar with Kafka, RocketMQ and InLong‑TubeMQ, focusing on cloud‑native multi‑tenant design, multi‑level flow control, compute‑storage separation, partitioning strategies, metadata storage, and message delivery mechanisms, highlighting each system's architectural advantages and trade‑offs.

Cloud NativeInLong‑TubeMQKafka
0 likes · 18 min read
Design and Implementation Differences between Pulsar, Kafka, RocketMQ, and InLong‑TubeMQ
Tencent Cloud Developer
Tencent Cloud Developer
Oct 26, 2021 · Cloud Native

Design and Implementation Differences between Pulsar, Kafka, RocketMQ, and InLong‑TubeMQ

Pulsar’s cloud‑native, multi‑tenant architecture with hierarchical flow control, stateless brokers separated from BookKeeper storage, global partitions supporting multiple consumers, push‑based dispatch, and precise acknowledgment outperforms Kafka, RocketMQ, and InLong‑TubeMQ, though its reliance on ZooKeeper and higher resource usage demand careful operational planning.

Flow ControlInLong‑TubeMQKafka
0 likes · 17 min read
Design and Implementation Differences between Pulsar, Kafka, RocketMQ, and InLong‑TubeMQ
MaGe Linux Operations
MaGe Linux Operations
Oct 16, 2021 · Operations

Why Does One Kafka Broker Failure Halt All Consumers? HA & Replication Explained

The article examines Kafka’s high‑availability mechanisms, detailing its multi‑replica design, ISR synchronization, leader election, and the critical role of the __consumer_offset topic, and explains why a single broker outage can render the entire cluster unusable unless replication factors are properly configured.

Consumer OffsetDistributed SystemsKafka
0 likes · 10 min read
Why Does One Kafka Broker Failure Halt All Consumers? HA & Replication Explained
Java High-Performance Architecture
Java High-Performance Architecture
Oct 14, 2021 · Operations

Build a Real‑Time Log Collection Pipeline with SpringBoot, Kafka, Filebeat, Logstash & Kibana

This guide walks through setting up a complete log‑collection and visualization pipeline—including SpringBoot log4j2 configuration, Kafka broker creation, Filebeat forwarding, Logstash processing, and Kibana dashboard setup—so you can capture, ship, and analyze application logs in real time.

ElasticsearchFilebeatKafka
0 likes · 17 min read
Build a Real‑Time Log Collection Pipeline with SpringBoot, Kafka, Filebeat, Logstash & Kibana
Architecture Digest
Architecture Digest
Oct 13, 2021 · Fundamentals

Introduction to Message Queues, JMS, and Kafka

This article explains the fundamentals of message queues, compares JMS and various MQ implementations such as RabbitMQ and Kafka, describes their architectures, usage scenarios, and key concepts like producers, consumers, topics, partitions, and high‑availability mechanisms.

AMQPJMSKafka
0 likes · 15 min read
Introduction to Message Queues, JMS, and Kafka
Big Data Technology Architecture
Big Data Technology Architecture
Oct 9, 2021 · Big Data

Apache Kafka 3.0 Release Highlights and New Features

Apache Kafka 3.0 introduces major enhancements including KRaft consensus, deprecation of Java 8 and Scala 2.12 support, stronger producer guarantees, updated APIs, improved Kafka Connect, MirrorMaker 2 flexibility, and numerous KIP-driven feature upgrades, marking a significant step forward for the distributed streaming platform.

KafkaKafka 3.0Streaming
0 likes · 13 min read
Apache Kafka 3.0 Release Highlights and New Features
Selected Java Interview Questions
Selected Java Interview Questions
Oct 9, 2021 · Backend Development

RocketMQ vs Kafka: Detailed Feature, Performance, and Reliability Comparison

This article provides a comprehensive comparison between RocketMQ and Kafka, covering data reliability, performance, queue capacity, real‑time delivery, retry mechanisms, ordering guarantees, scheduled messages, transactional support, query capabilities, message tracing, consumer parallelism, filtering, and commercial backing, helping engineers choose the right messaging middleware for their workloads.

Distributed SystemsKafkaMessage Queue
0 likes · 11 min read
RocketMQ vs Kafka: Detailed Feature, Performance, and Reliability Comparison
Tencent Cloud Developer
Tencent Cloud Developer
Oct 8, 2021 · Operations

Unveiling Kafka’s Controller: Architecture, Election, and Monitoring Deep Dive

This article provides a comprehensive technical analysis of Kafka’s Controller component, covering its background, core responsibilities, data storage, election process, version‑specific improvements, monitoring techniques, and key source‑code excerpts to help engineers understand and manage Kafka clusters effectively.

Cluster ManagementControllerDistributed Systems
0 likes · 27 min read
Unveiling Kafka’s Controller: Architecture, Election, and Monitoring Deep Dive
Efficient Ops
Efficient Ops
Oct 8, 2021 · Operations

Why a Single Kafka Broker Failure Can Halt the Entire Cluster

This article explains Kafka's high‑availability architecture, covering multi‑replica redundancy, ISR synchronization, producer ACK settings, and the critical role of the __consumer_offset topic, and shows how to configure replication factors to prevent a single‑node outage from stopping consumption.

Consumer OffsetKafkaReplication
0 likes · 11 min read
Why a Single Kafka Broker Failure Can Halt the Entire Cluster
21CTO
21CTO
Oct 6, 2021 · Big Data

Building a Real-Time TB-Scale Bill Query System with Kafka, Kudu, and Presto

This article details the design and implementation of a real‑time, TB‑scale bill‑detail query platform that leverages Kafka for streaming, Debezium and Confluent Platform for change capture, Kudu for low‑latency storage, and Presto/Kylin for fast OLAP queries, while outlining deployment, integration, and future enhancements.

KafkaKuduPresto
0 likes · 19 min read
Building a Real-Time TB-Scale Bill Query System with Kafka, Kudu, and Presto
Architect
Architect
Oct 6, 2021 · Big Data

Design and Implementation of a Real-time and Offline Integrated Query System

This article details the requirements, architecture, and implementation of a real-time and offline integrated query system, covering data ingestion via Debezium and Confluent Platform, storage in Kudu and HDFS, query engines Presto and Kylin, and strategies for data synchronization, partitioning, and scaling.

Big DataDebeziumKafka
0 likes · 19 min read
Design and Implementation of a Real-time and Offline Integrated Query System
Code Ape Tech Column
Code Ape Tech Column
Oct 6, 2021 · Operations

An Overview of Four Popular Distributed Message Queue Middleware: Kafka, ActiveMQ, RabbitMQ, and RocketMQ

This article introduces the concepts, core features, deployment environments, architectures, high‑availability solutions, advantages and disadvantages of four widely used open‑source distributed message queue systems—Kafka, ActiveMQ, RabbitMQ, and RocketMQ—and provides a comparative analysis to help practitioners choose the right middleware for their distributed applications.

ActiveMQDistributed SystemsKafka
0 likes · 37 min read
An Overview of Four Popular Distributed Message Queue Middleware: Kafka, ActiveMQ, RabbitMQ, and RocketMQ
Top Architect
Top Architect
Oct 2, 2021 · Backend Development

RabbitMQ vs Kafka: Architectural Comparison and Selection Guidance

This article compares the architectures of RabbitMQ and Kafka, explaining their queue models, master‑mirror and partition mechanisms, performance trade‑offs, and provides guidance on choosing between them based on throughput requirements and system design considerations.

KafkaMessage QueueRabbitMQ
0 likes · 9 min read
RabbitMQ vs Kafka: Architectural Comparison and Selection Guidance
Programmer DD
Programmer DD
Oct 1, 2021 · Backend Development

How Wix Scales 1400+ Microservices with Event‑Driven Kafka Patterns

This article explains how Wix engineers built a robust, event‑driven messaging infrastructure on Kafka that serves over 1,400 microservices, detailing six key patterns—including consumption‑projection, end‑to‑end event flows, in‑memory KV stores, scheduling, transactional events, and aggregation—that improve scalability, resilience, and operational simplicity.

Event-Driven ArchitectureKafkabackend design
0 likes · 19 min read
How Wix Scales 1400+ Microservices with Event‑Driven Kafka Patterns
macrozheng
macrozheng
Sep 22, 2021 · Backend Development

Build a Real-Time Log Pipeline with SpringBoot, Kafka, Filebeat, Logstash and Kibana

This guide walks through setting up a complete log‑collection and visualization pipeline—preparing servers, configuring a SpringBoot project with Log4j2, deploying Kafka, installing Filebeat, creating Logstash pipelines, and visualizing logs in Elasticsearch and Kibana—so you can monitor application logs in real time.

ElasticsearchFilebeatKafka
0 likes · 17 min read
Build a Real-Time Log Pipeline with SpringBoot, Kafka, Filebeat, Logstash and Kibana
Java Architect Essentials
Java Architect Essentials
Sep 21, 2021 · Big Data

Interview on Kuaishou's Billion‑Scale Big Data Architecture Evolution and Practices

The interview with Kuaishou senior architect Zhao Jianbo details the three‑phase evolution of its trillion‑scale big data platform, covering foundational Hadoop services, real‑time and OLAP extensions, deep customizations, Spring Festival Gala challenges, scheduling innovations, Hadoop usage, and the relationship between big data and cloud architectures.

Big DataFlinkHadoop
0 likes · 19 min read
Interview on Kuaishou's Billion‑Scale Big Data Architecture Evolution and Practices
Top Architect
Top Architect
Sep 17, 2021 · Backend Development

Kafka Storage Mechanism and Reliability Guarantees

This article explains Kafka's storage architecture, including segment files and indexing, and details the reliability mechanisms such as ISR, OSR, LEO, HW, producer acknowledgment levels, and leader election strategies to ensure data consistency and availability.

KafkaMessage QueueReliability
0 likes · 9 min read
Kafka Storage Mechanism and Reliability Guarantees
Top Architect
Top Architect
Sep 16, 2021 · Backend Development

Building a Log Collection and Visualization Pipeline with SpringBoot, Log4j2, Kafka, Filebeat, Logstash, Elasticsearch, and Kibana

This tutorial walks through the end‑to‑end setup of a logging pipeline that starts with a SpringBoot application using Log4j2, forwards logs to Kafka, collects them with Filebeat, processes them via Logstash, and finally visualizes them in Elasticsearch and Kibana, covering server preparation, configuration files, and essential code snippets.

ElasticsearchFilebeatKafka
0 likes · 17 min read
Building a Log Collection and Visualization Pipeline with SpringBoot, Log4j2, Kafka, Filebeat, Logstash, Elasticsearch, and Kibana
Open Source Linux
Open Source Linux
Sep 15, 2021 · Fundamentals

How DMA and Zero‑Copy Boost Linux I/O Performance

This article explains how DMA and zero‑copy techniques reduce the four memory copies and context switches typical of Linux I/O, detailing their mechanisms, implementations such as sendfile, mmap and Direct I/O, and real‑world usage in Kafka and MySQL to boost performance.

DMAI/O optimizationKafka
0 likes · 18 min read
How DMA and Zero‑Copy Boost Linux I/O Performance
Tencent Cloud Developer
Tencent Cloud Developer
Sep 14, 2021 · Backend Development

A Comparative Overview of Transactional Messaging in RocketMQ, Kafka, and Pulsar

The article compares how RocketMQ, Apache Kafka, and Apache Pulsar implement transactional messaging—each using a two‑phase commit with half‑messages or transaction buffers, distinct coordinators, and idempotent producers—to provide atomicity and either strong consistency (RocketMQ) or exactly‑once delivery for high‑throughput stream processing (Kafka and Pulsar).

Distributed SystemsExactly-OnceKafka
0 likes · 22 min read
A Comparative Overview of Transactional Messaging in RocketMQ, Kafka, and Pulsar
Liangxu Linux
Liangxu Linux
Sep 11, 2021 · Fundamentals

How DMA and Zero‑Copy Transform Linux I/O Performance

This article explains the costly four‑copy, four‑context‑switch data path in traditional Linux I/O, introduces DMA as a co‑processor that offloads memory transfers, describes zero‑copy techniques such as sendfile, mmap and Direct I/O, and shows how Kafka and MySQL leverage these methods to reduce CPU overhead and improve throughput.

DMADirect I/OKafka
0 likes · 16 min read
How DMA and Zero‑Copy Transform Linux I/O Performance
Ctrip Technology
Ctrip Technology
Sep 9, 2021 · Big Data

Building Data Lineage at Ctrip: Architecture, Implementation, and Real‑World Applications

This article describes how Ctrip built a data lineage system for its big data platform, covering the concept of data lineage, collection methods, open‑source tools such as Apache Atlas and DataHub, the in‑house table‑level and field‑level solutions, implementation details for Hive, Spark and Presto, storage in JanusGraph, and practical applications in data governance, metadata management, scheduling and sensitivity labeling.

Big DataJanusGraphKafka
0 likes · 16 min read
Building Data Lineage at Ctrip: Architecture, Implementation, and Real‑World Applications
dbaplus Community
dbaplus Community
Sep 8, 2021 · Operations

Why Does a Single Kafka Broker Crash Bring Down All Consumers?

An in‑depth look at Kafka’s high‑availability mechanisms reveals how multi‑replica design, ISR leader election, and the request.required.acks setting interact, why a single broker failure—especially of the __consumer_offset topic—can halt consumption, and how to configure replication factors to prevent such outages.

Distributed SystemsISRKafka
0 likes · 10 min read
Why Does a Single Kafka Broker Crash Bring Down All Consumers?
YunZhu Net Technology Team
YunZhu Net Technology Team
Sep 8, 2021 · Backend Development

Design and Performance Evaluation of a Scalable Like System

This article analyzes common like‑system business scenarios, derives functional requirements, proposes three architectural solutions (Redis‑based priority queue, relational database, and a hybrid cache‑MQ‑DB design), presents detailed implementation code, and evaluates each approach with extensive performance tests to guide practical backend design choices.

Backend ArchitectureKafkaLike System
0 likes · 18 min read
Design and Performance Evaluation of a Scalable Like System
Alibaba Cloud Native
Alibaba Cloud Native
Sep 7, 2021 · Cloud Native

Scaling Millions of IoT Vehicles: RocketMQ & Cloud‑Native Architecture in Action

Facing over a million concurrent connected vehicles, Chinese IoT leader ZhongRui chose RocketMQ over Kafka, leveraging Alibaba Cloud’s managed service to achieve low‑latency, high‑throughput, fault‑tolerant messaging, while adopting cloud‑native microservices, containerization, and serverless techniques to streamline operations and reduce costs.

Cloud NativeIoTKafka
0 likes · 13 min read
Scaling Millions of IoT Vehicles: RocketMQ & Cloud‑Native Architecture in Action
Volcano Engine Developer Services
Volcano Engine Developer Services
Sep 6, 2021 · Databases

How ByteDance Optimized ClickHouse for Real‑Time Recommendation and Ad Analytics

ByteDance’s ByteHouse, an enterprise‑grade ClickHouse, powers real‑time recommendation and ad‑delivery analytics at massive scale, detailing two case studies, technical selections, architectural designs, and performance optimizations such as asynchronous indexing, multi‑threaded Kafka consumption, and enhanced buffer engines to ensure data integrity.

Big DataByteHouseKafka
0 likes · 10 min read
How ByteDance Optimized ClickHouse for Real‑Time Recommendation and Ad Analytics
Open Source Linux
Open Source Linux
Aug 30, 2021 · Big Data

Why Kafka’s Message System Is Essential for High‑Throughput Applications

This article explains why a message system like Kafka is crucial for decoupling services, handling asynchronous workflows such as e‑commerce flash sales, controlling traffic, and achieving high concurrency, high availability, and high performance through sequential disk writes, zero‑copy reads, replication, and careful resource planning.

Distributed SystemsKafkaMessage Queue
0 likes · 35 min read
Why Kafka’s Message System Is Essential for High‑Throughput Applications
Xiaolei Talks DB
Xiaolei Talks DB
Aug 30, 2021 · Backend Development

Unlocking TiCDC: Efficient Incremental Data Sync for TiDB in Real‑World Scenarios

This article explains how TiCDC, a change‑data‑capture tool for TiDB, addresses incremental extraction, cross‑region hot‑standby, and stream processing needs, outlines its architecture, discusses early‑version issues, and provides best‑practice recommendations for stable, high‑performance data synchronization.

Change Data CaptureKafkaTiCDC
0 likes · 13 min read
Unlocking TiCDC: Efficient Incremental Data Sync for TiDB in Real‑World Scenarios
Programmer DD
Programmer DD
Aug 30, 2021 · Big Data

Why Is Kafka So Fast? Unveiling the Secrets Behind Its High Throughput

This article explains how Kafka achieves remarkable speed and massive throughput by using sequential disk I/O, OS page cache, zero‑copy transfers, partitioned log segments with indexes, batch processing, and efficient compression, making it a cornerstone of modern big‑data pipelines.

Big DataHigh ThroughputKafka
0 likes · 9 min read
Why Is Kafka So Fast? Unveiling the Secrets Behind Its High Throughput
Ops Development Stories
Ops Development Stories
Aug 28, 2021 · Operations

Inside Kafka's Topic Deletion: Code Walkthrough & Process Explained

This article explains the complete Kafka topic deletion workflow, from the client’s deleteTopics request through Zookeeper node creation, controller coordination, broker StopReplica handling, log renaming, delayed file removal, and final cleanup, while providing code excerpts and practical Q&A for common pitfalls.

BrokerKafkaScala
0 likes · 17 min read
Inside Kafka's Topic Deletion: Code Walkthrough & Process Explained
360 Tech Engineering
360 Tech Engineering
Aug 25, 2021 · Big Data

Cross‑IDC Kafka Hot‑Standby with MirrorMaker 2: Architecture, Design, and Productization

This article explains how 360 Commercialization implements cross‑IDC hot‑standby for Kafka using MirrorMaker 2, covering MM2 fundamentals, architecture, internal topics, deployment on Kubernetes, design goals, solution details, challenges such as dynamic configuration and offset reverse‑mapping, and productized risk mitigation.

KafkaMirrorMaker2cross-IDC
0 likes · 11 min read
Cross‑IDC Kafka Hot‑Standby with MirrorMaker 2: Architecture, Design, and Productization
Top Architect
Top Architect
Aug 21, 2021 · Backend Development

RabbitMQ vs Kafka: Architectural Comparison and Performance Considerations

This article compares RabbitMQ and Kafka by explaining their architectures, queue consumption and production mechanisms, highlighting RabbitMQ's single‑master limitation and Kafka's partition‑based design for higher throughput, and provides guidance on selecting the appropriate middleware based on performance needs.

KafkaMessage QueueRabbitMQ
0 likes · 7 min read
RabbitMQ vs Kafka: Architectural Comparison and Performance Considerations
Tencent Cloud Middleware
Tencent Cloud Middleware
Aug 19, 2021 · Backend Development

Fast Kafka Cluster Expansion: Practical Strategies to Reduce Data Migration

When a Kafka cluster reaches load limits or experiences sudden traffic spikes, urgent expansion is needed, but data migration can be time‑consuming and risky; this guide outlines several practical techniques—including adjusting retention, adding partitions, leader switching, and single‑replica operation—to quickly scale clusters while minimizing data movement and service disruption.

Data MigrationKafkaPartition Reassignment
0 likes · 21 min read
Fast Kafka Cluster Expansion: Practical Strategies to Reduce Data Migration
Tencent Cloud Developer
Tencent Cloud Developer
Aug 19, 2021 · Backend Development

Message Queue Design Principles and Implementation Analysis

The article explains message queues, their primary use cases—async processing, system decoupling, and traffic peak shaving—compares five mainstream products (ActiveMQ, RabbitMQ, Kafka, RocketMQ, Pulsar), and examines core design principles such as architecture, storage choices, push/pull models, and consumer groups, highlighting broader relevance to distributed systems.

Backend ArchitectureDistributed SystemsKafka
0 likes · 17 min read
Message Queue Design Principles and Implementation Analysis
Top Architect
Top Architect
Aug 17, 2021 · Backend Development

Introduction to Message Queues, JMS, RabbitMQ, and Kafka

This article explains the fundamentals of message queues, compares point-to-point and publish‑subscribe models, introduces Java Message Service (JMS) and its programming model, and provides an overview of popular MQ implementations such as RabbitMQ, ActiveMQ, RocketMQ, and Kafka, highlighting their architectures and use cases.

AMQPJMSKafka
0 likes · 17 min read
Introduction to Message Queues, JMS, RabbitMQ, and Kafka
Spring Full-Stack Practical Cases
Spring Full-Stack Practical Cases
Aug 12, 2021 · Backend Development

Master Kafka Streams in Spring Boot: Real‑Time Data Processing with Code Samples

This guide walks through setting up Kafka Streams with Spring Boot 2.3, covering environment configuration, core concepts, topology design, and multiple practical examples—including message sending, listening, transformations, aggregations, filtering, branching, and multi‑field grouping—complete with full code snippets and execution results.

KafkaSpring Bootjava
0 likes · 13 min read
Master Kafka Streams in Spring Boot: Real‑Time Data Processing with Code Samples
Top Architect
Top Architect
Aug 7, 2021 · Backend Development

Understanding Message Queues, JMS, RabbitMQ, and Kafka

This article explains the fundamentals of message queues, compares JMS with popular implementations such as RabbitMQ and Kafka, describes their architectures, usage patterns, and how they improve system performance, scalability, and decoupling in backend applications.

JMSKafkaMessage Queue
0 likes · 15 min read
Understanding Message Queues, JMS, RabbitMQ, and Kafka
Architects Research Society
Architects Research Society
Jul 30, 2021 · Fundamentals

RabbitMQ vs Apache Kafka: Messaging Patterns, Guarantees and Design Trade‑offs

This article compares RabbitMQ and Apache Kafka, explaining their core architectures, exchange and routing mechanisms, delivery guarantees, push vs pull models, dead‑letter handling, partitioning, consumer groups, log compaction, and the strengths and limitations of each system for scalable, ordered message processing.

KafkaMessaging SystemsRabbitMQ
0 likes · 21 min read
RabbitMQ vs Apache Kafka: Messaging Patterns, Guarantees and Design Trade‑offs
Java Architect Essentials
Java Architect Essentials
Jul 27, 2021 · Backend Development

Kafka Overview: Architecture, Core Features, and Operational Details

This article provides a comprehensive technical overview of Apache Kafka, covering its distributed messaging architecture, key features such as high‑throughput read/write, replication, partitioning, consumer group mechanics, offset management, rebalance processes, and practical code examples for synchronous and asynchronous offset commits.

Consumer OffsetsDistributed MessagingKafka
0 likes · 22 min read
Kafka Overview: Architecture, Core Features, and Operational Details