Tagged articles
8 articles
Page 1 of 1
Big Data Technology & Architecture
Big Data Technology & Architecture
Jan 7, 2021 · Databases

Comprehensive HBase Optimization Guide: Table Design, RowKey, JVM Tuning, Cache Settings, and Read/Write Performance

This article provides a detailed, practical guide to optimizing HBase in production, covering table pre‑splitting, RowKey design, JVM memory and GC settings, MSLAB and BucketCache configuration, read‑side client and server tuning, write‑side strategies, and additional tips such as compression and scan caching.

CacheDatabase TuningHBase
0 likes · 29 min read
Comprehensive HBase Optimization Guide: Table Design, RowKey, JVM Tuning, Cache Settings, and Read/Write Performance
Big Data Technology & Architecture
Big Data Technology & Architecture
Aug 30, 2020 · Big Data

Kylin Cube Construction Principles and Optimization Techniques

This article explains the fundamentals of Kylin Cube construction—including dimensions, measures, Cuboid generation, layer-by-layer and in‑memory building algorithms, storage mechanisms, and various optimization strategies such as derived dimensions, aggregation groups, row‑key design, and concurrency granularity—providing a comprehensive guide for big‑data OLAP practitioners.

Big DataCubeKylin
0 likes · 14 min read
Kylin Cube Construction Principles and Optimization Techniques
DataFunTalk
DataFunTalk
Oct 25, 2019 · Big Data

Migrating Data from HBase to Kafka Using MapReduce

This article explains how to reverse the typical data flow by extracting massive Rowkeys from HBase with MapReduce, storing them on HDFS, and then using batch Get operations to retrieve the full records and write them into Kafka, while handling retries and monitoring progress.

Big DataData MigrationHBase
0 likes · 9 min read
Migrating Data from HBase to Kafka Using MapReduce
Sohu Tech Products
Sohu Tech Products
Oct 9, 2019 · Databases

HBase Table Design Strategies: Data Model, Column Descriptors, RowKey, Region and Performance Optimization

This article explains HBase’s data model and provides comprehensive table‑design strategies—including column‑descriptor options, row‑key best practices, high‑vs‑wide table trade‑offs, region splitting and pre‑splitting techniques—to help achieve optimal performance and scalability in large‑scale NoSQL workloads.

Big DataColumn FamilyHBase
0 likes · 16 min read
HBase Table Design Strategies: Data Model, Column Descriptors, RowKey, Region and Performance Optimization
Big Data Technology & Architecture
Big Data Technology & Architecture
May 5, 2019 · Databases

Designing Effective RowKeys in HBase

This article explains why HBase rowkey design is critical for performance, outlines common interview expectations, and provides visual guidelines to help developers create efficient rowkeys for production workloads, including best‑practice tips on key length, salting, and ordering to avoid hotspotting.

Big DataDatabase designrowKey
0 likes · 1 min read
Designing Effective RowKeys in HBase
DataFunTalk
DataFunTalk
Dec 6, 2018 · Databases

HBase RowKey and Index Design: Principles, Practices, and Case Studies

This article introduces HBase fundamentals, explores effective RowKey and secondary index design principles, discusses demand analysis, presents techniques such as reversing, salting, hashing, and reviews real-world case studies for OpenTSDB, JanusGraph, and GeoMesa, offering practical guidance for scalable NoSQL data modeling.

Database ArchitectureHBaseNoSQL
0 likes · 19 min read
HBase RowKey and Index Design: Principles, Practices, and Case Studies
21CTO
21CTO
Apr 16, 2016 · Databases

Optimizing HBase Log Queries: Index Design and RowKey Strategies

This article examines the challenges of storing and querying log data in HBase, outlines the drawbacks of custom indexing, and presents practical rowKey design, filter usage, and integration with external search engines to improve query performance.

Big DataHBaseNoSQL
0 likes · 15 min read
Optimizing HBase Log Queries: Index Design and RowKey Strategies