Hive and HBase difference

  • Hive

    • Data warehouse, the essence of Hive is actually equivalent to making a bijective relationship in Mysql for files already stored in HDFS to facilitate the use of HQL to manage queries< /span>

    • For data analysis and cleaning, Hive is suitable for offline Data analysis and cleaning, high latency

    • based on HDFS, MapReduce, the data stored in Hive is still on the DataNode, and the written HQL statements will eventually be converted to MapReduce code execution

  • HBase

    • Database is a non-relational database oriented to column storage

    • is used to store structured and unstructured data, suitable for single-table non-relational data storage, not suitable Do related queries, similar to JOIN and other operations

    • Based on HDFS, the form of data persistent storage is Hfile, which is stored in DataNode and managed by ResionServer in the form of region

    • Low latency, access to online business use, facing a large amount of enterprise data, HBase can directly store a large amount of data in a single table, while providing high efficiency Data access speed of

Hive

Leave a Comment

Your email address will not be published.