© Scott Shaw, Andreas Francois Vermeulen, Ankur Gupta, David Kjerrumgaard 2016

Scott Shaw, Andreas François Vermeulen, Ankur Gupta and David Kjerrumgaard, Practical Hive, 10.1007/978-1-4842-0271-5_7

7. Querying Semi-Structured Data

Scott Shaw, Andreas François Vermeulen2, Ankur Gupta3 and David Kjerrumgaard4

(1)Saint Louis, Missouri, USA

(2)West Kilbride North Ayrshire, UK

(3)Uxbridge, UK

(4)Henderson, Nevada, USA

Hive would not be much of a useful data warehouse tool without the ability to query data. Luckily, querying and providing schema-on-read capabilities at scale is the core foundation for Hive use cases. The power Hive provides is the ability to translate a large variety of data formats as well as the ability to customize translations to fit ...

Get Practical Hive: A Guide to Hadoop's Data Warehouse System now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.