In this Analytic Data Storage in Hadoop training course, expert author Ryan Blue will teach you about typical storage and ingest patterns in Hadoop. This course is designed for users that are already familiar with Hadoop.
You will start by learning how to create the dataset, load sample data, and query a dataset. From there, Ryan will teach you about partitioning, formats, and Avro. This video tutorial also covers parquet, bulk data drops, and database snapshots and mirroring. Finally, you will learn about event stream processing, including how to build a test pipeline and move to production.
Once you have completed this computer based training course, you will have gained a solid understanding of typical storage and ingest patterns in Hadoop. Working files are included, allowing you to follow along with the author throughout the lessons.
Table of contents
- Getting Started With Hadoop
- Bulk Data Drops
- Database Snapshots And Mirroring
- Event Stream Processing
- Wrap Up 00:00:51
- Title: Analytic Data Storage in Hadoop
- Release date: October 2015
- Publisher(s): Infinite Skills
- ISBN: 9781771375214
You might also like
Microsoft Power BI - The Complete Masterclass [2023 EDITION]
Microsoft Power BI is an interactive data visualization software primarily focusing on business intelligence, part of …
Complete Git Guide: Understand and Master Git and GitHub
Complete with practical activities, this comprehensive Git and GitHub guide will help you understand how Git …
Introduction to Apache NiFi (Hortonworks DataFlow - HDF 2.0)
Apache NiFi was initially used by the NSA so they could move data at scale and …
Statistics and Mathematics for Data Science and Data Analytics
If you aim for a career in data science or data analytics, this course will equip …