Tagged articles
122 articles
Page 2 of 2
DataFunTalk
DataFunTalk
Feb 17, 2021 · Big Data

Apache Iceberg 0.11.0: New Partition Support, SortOrder, Flink Streaming Reader, and Ecosystem Integrations

The article details Apache Iceberg 0.11.0's core enhancements—including partition changes, SortOrder, extensive Flink and Spark integrations, CDC/Upsert support, hash‑based write distribution to reduce small files, and upcoming 0.12.0 roadmap—while providing practical SQL and API examples for data‑lake practitioners.

Apache IcebergBig DataCDC
0 likes · 13 min read
Apache Iceberg 0.11.0: New Partition Support, SortOrder, Flink Streaming Reader, and Ecosystem Integrations
Architect's Journey
Architect's Journey
Jan 26, 2021 · Backend Development

Three Storage Solutions for Cross-Database Aggregated Full-Text Search

The article compares three approaches—synchronous dual write, asynchronous dual write with a message queue, and CDC via Canal—to keep Elasticsearch and a relational database consistent for cross‑database aggregated full‑text search, outlining their steps, advantages, and drawbacks.

Backend ArchitectureCDCData Consistency
0 likes · 6 min read
Three Storage Solutions for Cross-Database Aggregated Full-Text Search
Architects Research Society
Architects Research Society
Aug 31, 2020 · Databases

What Is Debezium? Overview, Architecture, and Features

Debezium is an open‑source distributed platform built on Apache Kafka that captures row‑level changes from databases via change data capture, providing source connectors, an optional embedded engine, and features like low‑latency streaming, snapshots, filtering, masking, and integration with various sink systems.

CDCChange Data CaptureDatabase Streaming
0 likes · 8 min read
What Is Debezium? Overview, Architecture, and Features
Architects Research Society
Architects Research Society
Mar 6, 2020 · Backend Development

Ensuring Data Consistency Across Microservices: Saga, Reconciliation, Event Sourcing, and Change Data Capture

The article explains why achieving data consistency across multiple microservices is challenging, reviews the limitations of two‑phase commit, and presents practical techniques such as the Saga pattern, reconciliation, event logs, orchestration vs. choreography, and change‑data‑capture to keep distributed systems eventually consistent.

CDCData ConsistencyDistributed Transactions
0 likes · 12 min read
Ensuring Data Consistency Across Microservices: Saga, Reconciliation, Event Sourcing, and Change Data Capture
Architects Research Society
Architects Research Society
Oct 13, 2019 · Databases

What is Debezium? Overview, Architecture, and Features

Debezium is an open‑source distributed platform built on Apache Kafka that turns existing databases into real‑time event streams by capturing row‑level changes via change data capture, offering source and embedded connectors, flexible topic routing, and features such as snapshots, filtering, masking, and monitoring.

CDCChange Data CaptureDebezium
0 likes · 7 min read
What is Debezium? Overview, Architecture, and Features
JD Retail Technology
JD Retail Technology
Sep 20, 2019 · Databases

Follower Reads, Closed Timestamp, and Minimum Proposal Tracker in CB‑SQL

This article explains how CB‑SQL implements follower reads by using safe (closed) timestamps, describes the CT update mechanism with a Minimum Proposal Tracker, and discusses routing, replica read validation, timestamp forwarding, range split/merge handling, and recovery strategies for consistent distributed reads.

CB-SQLCDCClosed Timestamp
0 likes · 15 min read
Follower Reads, Closed Timestamp, and Minimum Proposal Tracker in CB‑SQL
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 4, 2019 · Big Data

How Structured Big Data Storage Powers Modern Data Systems

This article explores the core components of data systems, the evolution toward lightweight, intelligent big data architectures, the distinction between primary and secondary storage, challenges of data replication, and how Alibaba Cloud's Tablestore implements advanced features such as storage‑compute separation, CDC, and multi‑model indexing for scalable, cost‑effective structured big data storage.

Big DataCDCCloud Services
0 likes · 24 min read
How Structured Big Data Storage Powers Modern Data Systems
JD Retail Technology
JD Retail Technology
Jul 3, 2019 · Databases

CB‑SQL: JD Retail’s Elastic Distributed Database Built on CockroachDB – Architecture and Implementation Details

The article introduces CB‑SQL, JD Retail’s next‑generation elastic database based on CockroachDB, and comprehensively explains its overview, layered architecture, storage replication, cluster management, MVCC, hybrid logical clocks, lock‑free distributed transactions, CDC changefeeds, and SQL interface support.

CDCCockroachDBMVCC
0 likes · 14 min read
CB‑SQL: JD Retail’s Elastic Distributed Database Built on CockroachDB – Architecture and Implementation Details
Aikesheng Open Source Community
Aikesheng Open Source Community
Dec 30, 2018 · Databases

Introducing DTLE: An Open‑Source MySQL Data Transfer Middleware for CDC, Replication, and Cloud Synchronization

The article presents DTLE, an open‑source MySQL data‑transfer middleware that extends replication capabilities with high‑performance CDC, multi‑topology support, cloud‑to‑cloud synchronization, and robust cluster management, while comparing it with other open‑source solutions and showcasing real‑world demos.

CDCDTLEcloud sync
0 likes · 14 min read
Introducing DTLE: An Open‑Source MySQL Data Transfer Middleware for CDC, Replication, and Cloud Synchronization
dbaplus Community
dbaplus Community
Apr 6, 2016 · Databases

Seamless DB2 Major Version Upgrade Using CDC Replication

This article explains why DB2 customers must upgrade to avoid EOS risks, compares replication options, and provides a detailed CDC‑based step‑by‑step process to achieve minute‑level downtime, zero performance impact, and fast rollback for large‑scale DB2 version migrations.

CDCDB2Version Upgrade
0 likes · 10 min read
Seamless DB2 Major Version Upgrade Using CDC Replication