This recipe will guide you on how to use Hive to perform a join across two datasets. The first dataset is the book details dataset of the Book-Crossing database and the second dataset is the reviewer ratings for those books. This recipe will use Hive to find the authors with the most number of ratings of more than 3 stars.
Follow the previous Hive batch mode – using a query file recipe.
This section demonstrates how to perform a join using Hive. Proceed with the following steps:
$ hive hive > USE bookcrossing;
create-book-crossing.hqlHive query file after referring to the previous ...