-
-
Data warehouse, the essence of Hive is actually equivalent to making a bijective relationship in Mysql for files already stored in HDFS to facilitate the use of HQL to manage queries< /span>
-
For data analysis and cleaning, Hive is suitable for offline Data analysis and cleaning, high latency
-
based on HDFS, MapReduce, the data stored in Hive is still on the DataNode, and the written HQL statements will eventually be converted to MapReduce code execution
-
-
HBase
-
Database is a non-relational database oriented to column storage
-
is used to store structured and unstructured data, suitable for single-table non-relational data storage, not suitable Do related queries, similar to JOIN and other operations
-
Based on HDFS, the form of data persistent storage is Hfile, which is stored in DataNode and managed by ResionServer in the form of region
-
Low latency, access to online business use, facing a large amount of enterprise data, HBase can directly store a large amount of data in a single table, while providing high efficiency Data access speed of
-
Hive