Big Data Technology Architecture
Author

Big Data Technology Architecture

Exploring Open Source Big Data and AI Technologies

290
Articles
0
Likes
602
Views
0
Comments
Recent Articles

Latest from Big Data Technology Architecture

100 recent articles max
Big Data Technology Architecture
Big Data Technology Architecture
Nov 13, 2021 · Big Data

Case Study: Migrating Baicaowei's On‑Premise Hadoop Data Platform to Alibaba Cloud Native Data Lake

This article details Baicaowei's migration from an IDC‑hosted Hadoop cluster to a cloud‑native data lake on Alibaba Cloud, outlining the business drivers, pain points of the legacy platform, architectural goals, design principles, solution selection, implementation steps, and future outlook for the new big‑data ecosystem.

Alibaba CloudDelta LakeETL
0 likes · 16 min read
Case Study: Migrating Baicaowei's On‑Premise Hadoop Data Platform to Alibaba Cloud Native Data Lake
Big Data Technology Architecture
Big Data Technology Architecture
Nov 2, 2021 · Big Data

Comprehensive Guide to FlinkSQL and Table API: Background, Dependencies, Planners, and Usage

This article provides a detailed introduction to FlinkSQL, covering its background, the Table API, required dependencies, differences between old and Blink planners, various API usage patterns, connector configurations for CSV, Kafka, Elasticsearch, MySQL, and how to convert between DataStream and Table in Flink's unified batch‑stream processing model.

ConnectorDataStreamFlinkSQL
0 likes · 23 min read
Comprehensive Guide to FlinkSQL and Table API: Background, Dependencies, Planners, and Usage
Big Data Technology Architecture
Big Data Technology Architecture
Oct 15, 2021 · Backend Development

Key Concepts and Internal Mechanisms of Apache Kafka

This article explains Kafka's internal topics, preferred replicas, partition assignment processes, log directory layout, index files, offset and timestamp lookup, log retention and compaction policies, storage architecture, delayed operations, controller role, legacy consumer design flaws, rebalance workflow, and producer idempotence, providing a comprehensive overview of Kafka's backend architecture.

Consumer OffsetsIdempotenceLog Retention
0 likes · 16 min read
Key Concepts and Internal Mechanisms of Apache Kafka
Big Data Technology Architecture
Big Data Technology Architecture
Oct 14, 2021 · Backend Development

Effective Code Review and Software Design Principles for Go Backend Development

This article shares practical insights on why engineers and leaders must conduct code reviews, identifies common causes of poor code such as duplication and premature decisions, and presents concrete design principles—including simplicity, composition, transparency, and disciplined logging—illustrated with Go code examples to help teams build maintainable, high‑quality backend systems.

ArchitectureBackend EngineeringGolang
0 likes · 39 min read
Effective Code Review and Software Design Principles for Go Backend Development
Big Data Technology Architecture
Big Data Technology Architecture
Oct 14, 2021 · Databases

Performance Evaluation and Optimization of HBase 2.x Write Operations

This article presents a detailed performance test of HBase 2.x write throughput on a five‑node SSD cluster, identifies latency spikes caused by MemStore flush and ConcurrentSkipListMap size() overhead, and demonstrates how fixing the bug and applying in‑memory compaction dramatically reduces P999 and P9999 latency while preserving throughput.

HBaseIn-Memory CompactionMemStore
0 likes · 10 min read
Performance Evaluation and Optimization of HBase 2.x Write Operations
Big Data Technology Architecture
Big Data Technology Architecture
Oct 9, 2021 · Big Data

Apache Kafka 3.0 Release Highlights and New Features

Apache Kafka 3.0 introduces major enhancements including KRaft consensus, deprecation of Java 8 and Scala 2.12 support, stronger producer guarantees, updated APIs, improved Kafka Connect, MirrorMaker 2 flexibility, and numerous KIP-driven feature upgrades, marking a significant step forward for the distributed streaming platform.

KafkaKafka 3.0Streaming
0 likes · 13 min read
Apache Kafka 3.0 Release Highlights and New Features