Tagged articles
12 articles
Page 1 of 1
Java Architect Essentials
Java Architect Essentials
Jun 15, 2021 · Big Data

Comprehensive Guide to Apache Kafka: Concepts, Installation, Configuration, and Usage

This article provides a thorough overview of Apache Kafka, covering its core streaming concepts, key components such as topics, partitions, producers and consumers, common use cases, step‑by‑step installation and multi‑broker configuration, fault‑tolerance testing, and an introduction to Kafka Connect for data import/export.

ConsumerDistributed StreamingInstallation
0 likes · 24 min read
Comprehensive Guide to Apache Kafka: Concepts, Installation, Configuration, and Usage
Big Data Technology & Architecture
Big Data Technology & Architecture
Mar 2, 2021 · Big Data

An Introduction to Kafka Connect: Architecture, Components, and Hands‑On Setup

This article introduces Kafka Connect, explaining its purpose as a scalable and reliable tool for moving data between Apache Kafka and external systems, detailing its core concepts, architecture, deployment modes, configuration files, and a step‑by‑step example that streams data from a file source to a file sink.

Data IntegrationETLStreaming
0 likes · 12 min read
An Introduction to Kafka Connect: Architecture, Components, and Hands‑On Setup
Beike Product & Technology
Beike Product & Technology
Dec 10, 2020 · Big Data

Overview and Practical Guide to Debezium MongoDB Source Connector

This article explains how Debezium's MongoDB Source Connector captures change events from replica sets or sharded clusters, streams them to Kafka topics, and provides detailed configuration, deployment, monitoring, and troubleshooting steps for building reliable change‑data‑capture pipelines.

Change Data CaptureConnectorDebezium
0 likes · 11 min read
Overview and Practical Guide to Debezium MongoDB Source Connector
DataFunTalk
DataFunTalk
Nov 27, 2020 · Big Data

Evolution of Kafka‑Based Data Pipeline at Chehaoduo Group: Architecture, Scaling, and Best Practices

This article chronicles the four‑year evolution of Chehaoduo Group’s Kafka ecosystem—from its initial role as a simple data‑ingestion layer to becoming the core of the company’s large‑scale data pipeline—detailing cluster management, upgrade strategies, multi‑cluster deployment, AVRO schema handling, SDK development, and operational lessons learned.

AvroCluster ManagementKafka
0 likes · 21 min read
Evolution of Kafka‑Based Data Pipeline at Chehaoduo Group: Architecture, Scaling, and Best Practices
Architects Research Society
Architects Research Society
Aug 31, 2020 · Databases

What Is Debezium? Overview, Architecture, and Features

Debezium is an open‑source distributed platform built on Apache Kafka that captures row‑level changes from databases via change data capture, providing source connectors, an optional embedded engine, and features like low‑latency streaming, snapshots, filtering, masking, and integration with various sink systems.

CDCChange Data CaptureDatabase Streaming
0 likes · 8 min read
What Is Debezium? Overview, Architecture, and Features
Architects Research Society
Architects Research Society
Jul 29, 2020 · Big Data

Static Members and Incremental Cooperative Rebalancing in Apache Kafka

Apache Kafka 2.3 introduced static members and incremental cooperative rebalancing to reduce disruptive global rebalances, allowing workers to retain assignments during failures, schedule delayed rebalances, and improve scalability for Kafka Connect clusters, balancing availability and fault tolerance.

Apache KafkaDistributed SystemsIncremental Rebalancing
0 likes · 12 min read
Static Members and Incremental Cooperative Rebalancing in Apache Kafka
Architects Research Society
Architects Research Society
Oct 13, 2019 · Databases

What is Debezium? Overview, Architecture, and Features

Debezium is an open‑source distributed platform built on Apache Kafka that turns existing databases into real‑time event streams by capturing row‑level changes via change data capture, offering source and embedded connectors, flexible topic routing, and features such as snapshots, filtering, masking, and monitoring.

CDCChange Data CaptureDebezium
0 likes · 7 min read
What is Debezium? Overview, Architecture, and Features
JavaEdge
JavaEdge
Aug 25, 2019 · Big Data

Which Kafka Distribution Fits Your Needs? A Detailed Comparison

This article compares the main Kafka distributions—Apache Kafka, Confluent Kafka, and CDH/HDP Kafka—examining their origins, feature sets, ecosystem support, and trade‑offs to help you choose the most suitable version for your streaming workloads.

Streamingbig-dataconfluent
0 likes · 10 min read
Which Kafka Distribution Fits Your Needs? A Detailed Comparison