January 2019
Beginner to intermediate
670 pages
18h 32m
English
You can also use Impala. It is also an open-source tool in the Hadoop ecosystem. Instead of using MapReduce, as Hive does, it is using MPP (similar to the analytical data warehouse), which gives us a better query performance but requires additional steps. Hive and Impala both use Hive Metastore, but Hive requires more time to process the query and it isn’t very efficient for improvised analyses. Impala requires data to be stored on HDFS. Moreover, it depends on the memory resource of clusters. As a result, every tool has its own pros and cons. You can read more here:
Read now
Unlock full access