Video description
In this Learning Apache Pig training course, expert author Tom Hanlon will teach you how to explore, manipulate, and analyze data stored on a Hadoop cluster. This course is designed for the absolute beginner, meaning no experience with Pig is required.
You will start by learning how to use Pig, then jump into learning about Pig and HCatalog. From there, Tom will teach you about advanced Pig, including Pig scripts, parameters in Pig scripts, and Pig and Oozie. Finally, this video tutorial will teach you about Pig user defined functions and streaming.
Once you have completed this computer based training course, you will have learned how to explore, manipulate, and analyze big data in the Hadoop ecosystem. Working files are included, allowing you to follow along with the author throughout the lessons.
Publisher resources
Table of contents
-
Introduction
- What Is Apache Pig And Who Uses It 00:02:52
- What You Should Expect From This Video 00:02:56
- About The Author 00:01:15
-
Using Pig
- Pig CLI, The Grunt Shell 00:06:27
- Pig Latin Order, Join 00:11:06
- Pig Latin Group By, For Each And Parallel 00:10:57
- Hue With Pig 00:03:48
-
Pig And HCatalog
- What Is HCatalog 00:02:38
- Using Catalog With Pig 00:11:01
-
Advanced Pig
- Pig Scripts 00:06:40
- Parameters In Pig Scripts 00:06:47
- Pig And Oozie 00:08:13
-
Programming With Pig
- Embedded Pig 00:03:18
-
Pig UDF's And Streaming
- User Defined Functions 00:04:13
- Streaming Pig Data Through Custom Scripts 00:09:22
- Using Hive UDFs In Pig 00:04:58
-
Conclusion
- Wrap Up 00:03:01
Product information
- Title: Learning Apache Pig
- Author(s):
- Release date: March 2016
- Publisher(s): Infinite Skills
- ISBN: 9781771375818
You might also like
video
Introduction to Apache Hive
In this Introduction to Apache Hive training course, expert author Tom Hanlon will teach you how …
book
Apache Hive Essentials
This book takes you on a fantastic journey to discover the attributes of big data using …
video
Apache ZooKeeper and The Art of Building Distributed Systems
Implementing distributed systems can be hard. Servers crash, become slow, and get partitioned away. These are …
video
Data Engineering Foundations LiveLessons Part 1: Using Spark, Hive, and Hadoop Scalable Tools
6+ Hours of Video Instruction One Line Sell The perfect way to get started with scalable …