HBase Ecosystem Introduction
This article introduces HBase's ecosystem, including its components like OpenTSDB for time-series data, Kylin for cube analysis, Phoenix for SQL operations, and GeoMesa for spatial data, along with the author's experience in deploying these in a production environment.
HBase生态介绍
本文介绍了HBase生态系统,包括OpenTSDB(时序数据库)、Kylin(Cube分析)、Phoenix(SQL组件)和GeoMesa(时空数据)等组件,以及作者在贝壳找房的部署经验。
1.1 时序数据: OpenTSDB基于HBase,提供海量数据实时读写和聚合计算,常用于监控领域。
1.2 Cube分析: Kylin提供亚秒级多维分析。
1.3 SQL On HBase: Phoenix支持标准SQL和JDBC,可建立索引提升查询速度。
1.4 时空数据: GeoMesa提供大规模地理空间数据查询和分析。
2. 贝壳HBase的生态介绍
贝壳已部署Kylin实时分析服务、OpenTSDB集群监控存储、Phoenix SQL组件,GeoMesa暂未建设。
3. HBase时序数据存储目前情况
从2018年8月开始使用OpenTSDB存储Hadoop和HBase集群Metrics数据,HBase集群平均每秒处理3W请求。
4. HBase 2.0新特性介绍
2.0版本通过AssignmentManager V2、Offheapping、In-Memory Compaction等特性提升性能,仅使用原有一半内存。
5. OpenTSDB介绍
OpenTSDB基于HBase的可拓展时序数据读写服务,采用读写分离部署,需初始化表、开启随机映射等配置。
6.1 HBase相关配置
配置Offheap缓存、G1垃圾回收、堆外memstore等,提升读写性能。
6.2 In-Memory Compaction 配置
全局或表级开启BASIC、EAGER或ADAPTIVE策略,优化索引和清理冗余。
6.3 OpenTSDB相关配置
在opentsdb.conf中配置tag允许字符、随机映射等,提升读写性能。
7. 经验总结
遇到Netty buffer堆积问题,通过调整缓冲区大小解决;RIT问题可通过hbck2或重启Master修复;集群balance需切换HMaster节点。
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
Beike Product & Technology
As Beike's official product and technology account, we are committed to building a platform for sharing Beike's product and technology insights, targeting internet/O2O developers and product professionals. We share high-quality original articles, tech salon events, and recruitment information weekly. Welcome to follow us.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
