O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Data Wrangling with Python 3.x

Video Description

Extract, transform, and load data for effective data analysis

About This Video

  • Perform effective data wrangling to achieve your analytical goals by working with real-world problems.
  • A step-by-step guide to acquiring and then pre-processing datasets to draw useful insights from them.
  • Use the in-built features of Python to acquire, clean, analyze, and present data efficiently.

In Detail

You might be working in an organization, or have your own business, where data is being generated continuously (structured or unstructured) and you are looking to develop your skillset so you can jump into the field of Data Science. This hands-on guide shows programmers how to process information.

In this course, you will gather data, prepare data for analysis, perform simple statistical analyses, create meaningful data visualizations, and more! This course will equip us with the tools and technologies, also we need to analyze the datasets using Python so that we can confidently jump into the field and enhance our skill set. The best part of this course is the takeaway code templates generated using the real-life dataset.

Towards the end of the course, we will build an intuitive understanding of all the aspects available in Python for Data Wrangling.

All codes and supporting files are placed on GitHub at this link: https://github.com/PacktPublishing/-Data-Wrangling-with-Python-3.x

Downloading the example code for this course: You can download the example code files for all Packt video courses you have purchased from your account at http://www.PacktPub.com. If you purchased this course elsewhere, you can visit http://www.PacktPub.com/support and register to have the files e-mailed directly to you.

Table of Contents

  1. Chapter 1 : Gathering and Parsing Data
    1. The Course Overview 00:04:11
    2. Installing Anaconda Navigator on Windows/Linux 00:05:15
    3. Importing and Parsing CSV in Python 00:07:47
    4. Importing and Parsing JSON in Python 00:05:56
    5. Scraping Data from Public Web – Part 1 00:04:50
    6. Scraping Data from Public Web – Part 2 00:11:50
  2. Chapter 2 : Working with Data from Excel and PDF Files
    1. Importing and Parsing Excel Files – Part 1 00:05:18
    2. Importing and Parsing Excel Files – Part 2 00:05:24
    3. Manipulating PDF Files in Python – Part 1 00:04:51
    4. Manipulating PDF Files in Python – Part 2 00:05:53
  3. Chapter 3 : Storing Data in Persistent Storage
    1. Difference between Relational and Non-Relational Databases 00:03:46
    2. Storing Data in SQLite Databases 00:08:27
    3. Storing Data in MongoDB 00:06:26
    4. Storing Data in Elasticsearch 00:07:18
    5. Comparative Study of Databases for Storage 00:02:29
  4. Chapter 4 : Cleaning Structured Data
    1. The Most Important Step in Data Analysis 00:02:36
    2. Viewing/Inspecting DataFrames 00:06:44
    3. Renaming/Adding/Removing the DataFrame Columns 00:06:05
    4. Dropping Duplicate Rows 00:06:42
    5. Indexing DataFrame to Retrieve Specific Columns and Rows 00:07:08
    6. Merging/Concatenating/Joining DataFrames 00:08:04
    7. Dealing with Missing Values 00:08:37
  5. Chapter 5 : More Data Cleaning and Transformation
    1. Filtering and Sorting of DataFrame 00:06:38
    2. Encoding/Mapping Existing Values – Part 1 00:04:54
    3. Encoding/Mapping Existing Values – Part 2 00:04:42
    4. Rescale/Standardize Column Values 00:07:33
    5. Common Cleaning Operations 00:06:59
    6. Exporting Datasets for Future Use 00:05:35
  6. Chapter 6 : Performing Statistical Analysis
    1. Different Uses of Packages (Pandas, NumPy, SciPy, and Matplotlib) 00:01:59
    2. Types of Column Names/Features/Attributes in Structured Data 00:01:53
    3. Split-Apply-Combine (Performing Group By Operation) 00:05:37
    4. Descriptive Statistics Using Python – Part 1 00:05:32
    5. Descriptive Statistics Using Python – Part 2 00:05:13
  7. Chapter 7 : Let the Visualizations Tell the Story
    1. Using Visualizations 00:01:49
    2. Cool Visualization of Real-World Datasets of World Population Evolution 00:02:35
    3. Visualizations in Python – Part 1 00:09:33
    4. Visualizations in Python – Part 2 00:05:55
    5. Exploring an Online Visualization Tool (RAWGraphs) 00:03:46