April 2016
Beginner
268 pages
5h 32m
English
In previous chapters, you learned different types of joins in Hive and optimizations available in Hive joins.
In this chapter, we will cover the following recipes in detail:
Statistics in terms of the number of records in a table or partitions or histograms of a column is important. Also, it could help in query optimization. Statistical data is required as an input to many functions so that it can compare different plans. Statistics also help users by storing answers to some of the most frequently queried data and prevent long-running execution plans each time a query is ...