调研:HBase性能 郝伟 2021/10/13 [TOC]

1. 结论

  • HBase适合存储PB级别的海量数据(百亿千亿量级条记录),如果根据记录主键Rowkey来查询,能在几十到百毫秒内返回数据。
  • Hive功能更强大,但查询效率较低,只适合做离线分析,不合适做实时查询。

2. 参考资料

  • IBM Apache HBase https://www.ibm.com/topics/hbase HBase is a column-oriented non-relational database management system that runs on top of Hadoop Distributed File System (HDFS). HBase provides a fault-tolerant way of storing sparse data sets, which are common in many big data use cases. It is well suited for real-time data processing or random read/write access to large volumes of data.

  • Apache HBase ™ Reference Guide http://hbase.apache.org/1.4/book.html

  • Apache HBase Performance Tuning HBase1.2官方文档——Apache HBase Performance Tuning

  • 面对百亿数据,Hbase为什么查询速度依然非常快? https://blog.csdn.net/shb_derek1/article/details/114274121 第一句话:“HBase适合存储PB级别的海量数据(百亿千亿量级条记录),如果根据记录主键Rowkey来查询,能在几十到百毫秒内返回数据。”

results matching ""

    No results matching ""