Video description
Apache Spark 2.0 has become the gold standard for processing large datasets. This course, designed for learners with basic Python programming experience, takes you on an introductory journey into the world of big data analysis using Spark 2.0, Python, and the Spark DataFrame API.
Beginning with an overview of Spark 2.0 and Python, and then moving into a detailed examination of DataFrames, you'll learn about using SQL with DataFrames, DataFrame dates and timestamps, DataFrame aggregate operations, and about DataFrames and missing data. The course includes a hands-on data analysis exercise using real stock data. Learners should have Python and Spark installed on their computers before starting the class.
- Gain a core understanding of Spark 2.0 and Spark DataFrames
- Learn how to use Python with Spark DataFrames
- Gain big data experience analyzing stock data with Python and Spark DataFrames
Product information
- Title: Analyzing Data Using Spark 2.0 DataFrames With Python
- Author(s):
- Release date: May 2017
- Publisher(s): Infinite Skills
- ISBN: 9781491986844
You might also like
book
Beginning Data Analysis with Python And Jupyter
Use powerful industry-standard tools to unlock new, actionable insight from your existing dataAbout This Book Get …
book
Frank Kane's Taming Big Data with Apache Spark and Python
Frank Kane’s hands-on Spark training course, based on his bestselling Taming Big Data with Apache Spark …
video
Jupyter Notebook for Data Science Teams
In this Jupyter Notebook for Data Science Teams training course, expert author Jonathan Whitmore will teach …
video
Creating Big Data Solutions with Impala
In this Creating Big Data Solutions with Impala training course, expert author Jesse Anderson will teach …