on-demand course

From 0 to 1: Hive for Processing Big Data

with Loonycorn Ravi

December 2017

Beginner to intermediate

15h 16m

English

Packt Publishing

Closed Captioning available in English

Watch now

Unlock full access

Includes

Earns Badge

Course outline

You, Us & This Course
2m 3s
Hive: An Open-Source Data Warehouse
12m 59s
Hive and Hadoop
9m 19s
Hive vs Traditional Relational DBMS
13m 52s
HiveQL and SQL
7m 21s
Hadoop Install Modes
8m 33s
Hadoop Install Step 1: Standalone Mode
15m 47s
Hadoop Install Step 2: Pseudo-Distributed Mode
11m 45s
Hive install
12m 5s
Code-Along: Getting started
6m 25s
What is Hadoop?
7m 25s
HDFS or the Hadoop Distributed File System
11m 1s
Primitive Datatypes
17m 8s
Collections_Arrays_Maps
9m 29s
Structs and Unions
5m 58s
Create Table
13m 15s
Insert Into Table
12m 5s
Insert into Table 2
6m 51s
Alter Table
7m 22s
HDFS
9m 25s
HDFS CLI - Interacting with HDFS
10m 59s
Code-Along: Create Table
9m 54s
Code-Along: Hive CLI
3m 7s
Three types of Hive functions
6m 46s
The Case-When statement, the Size function, the Cast function
10m 10s
The Explode function
13m 7s
Code-Along: Hive Built - in functions
4m 28s
Quirky Sub-Queries
7m 14s
More on subqueries: Exists and In
15m 14s
Inserting via subqueries
5m 23s
Code-Along: Use Subqueries to work with Collection Datatypes
5m 57s
Views
12m 18s
Indices
6m 41s
Partitioning Introduced
6m 37s
The Rationale for Partitioning
6m 16s
How Tables are partitioned
9m 53s
Using Partitioned Tables
5m 27s
Dynamic Partitioning: Inserting data into partitioned tables
12m 44s
Code-Along: Partitioning
4m 4s
Introducing Bucketing
11m 57s
The Advantages of Bucketing
4m 55s
How Tables are bucketed
12m 37s
Using Bucketed Tables
7m 22s
Sampling
11m 13s
Windowing Introduced
12m 59s
Windowing - A Simple Example: Cumulative Sum
9m 39s
Windowing - A More Involved Example: Partitioning
11m 55s
Windowing - Special Aggregation Functions
15m 8s

Overview

In this 15 hr course, you'll learn to use Apache Hive for big data processing, from the basics of SQL-like HiveQL to advanced topics like optimization and user-defined functions. Whether you're analyzing data or tuning Hive for performance, this hands-on course offers step-by-step insights into real-world use.

What I will be able to do after this course

Master the core features of HiveQL and SQL syntax for big data analysis.
Understand and implement Hive query optimizations like partitioning and bucketing.
Develop custom User Defined Functions (UDFs) in Java and Python for tailored analyses.
Utilize advanced Hive functionalities including subqueries, joins, and windowing.
Gain a comprehensive understanding of Hive's interaction with Hadoop and MapReduce.

Course Instructor(s)

Ravi Loonycorn is an experienced technology professional with a rich background in software development and data analysis. With a passion for teaching, Ravi has created numerous in-depth courses that make complex topics clear and practical. His goal is to empower learners with skills they can directly apply in their careers.

Who is it for?

This course is ideal for data analysts seeking to harness Hive for complex data queries and engineers looking to optimize and extend Hive in a data warehousing setup. Beginners in SQL will find the included primer helpful, while professionals will appreciate the advanced content to scale their skillset.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

CCA 159: Expert in Big Data Analytics - Advance Hive and Sqoop

Publisher Resources

ISBN: 9781788995054Supplemental Content

From 0 to 1: Hive for Processing Big Data

with Loonycorn Ravi

Chapter 1 : You, Us & This Course

Chapter 2 : Introducing Hive

Chapter 3 : Hadoop and Hive Install

Chapter 4 : Hadoop and HDFS Overview

Chapter 5 : Hive Basics

Chapter 6 : Built-in Functions

Chapter 7 : Sub-Queries

Chapter 8 : Partitioning

Chapter 9 : Bucketing

Chapter 10 : Windowing

Chapter 11 : Understanding MapReduce

Chapter 12 : MapReduce logic for queries: Behind the scenes

Chapter 13 : Join Optimizations in Hive

Chapter 14 : Custom Functions in Python

Chapter 15 : Custom functions in Java

Chapter 16 : SQL Primer - Select Statements

Chapter 17 : SQL Primer - Group By, Order by and Having

Chapter 18 : SQL Primer - Joins

Chapter 19 : Appendix

Overview

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

CCA 159: Expert in Big Data Analytics - Advance Hive and Sqoop

Practical Hive: A Guide to Hadoop's Data Warehouse System

Creating Big Data Solutions with Impala

Azure Storage, Streaming, and Batch Analytics

Publisher Resources

Chapter 1 : You, Us & This Course

Chapter 2 : Introducing Hive

Chapter 3 : Hadoop and Hive Install

Chapter 4 : Hadoop and HDFS Overview

Chapter 5 : Hive Basics

Chapter 6 : Built-in Functions

Chapter 7 : Sub-Queries

Chapter 8 : Partitioning

Chapter 9 : Bucketing

Chapter 10 : Windowing

Chapter 11 : Understanding MapReduce

Chapter 12 : MapReduce logic for queries: Behind the scenes

Chapter 13 : Join Optimizations in Hive

Chapter 14 : Custom Functions in Python

Chapter 15 : Custom functions in Java

Chapter 16 : SQL Primer - Select Statements

Chapter 17 : SQL Primer - Group By, Order by and Having

Chapter 18 : SQL Primer - Joins

Chapter 19 : Appendix

Overview

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

CCA 159: Expert in Big Data Analytics - Advance Hive and Sqoop

Practical Hive: A Guide to Hadoop's Data Warehouse System

Creating Big Data Solutions with Impala

Azure Storage, Streaming, and Batch Analytics

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.