Chapter 6

Learning the Hidden Secrets of Data Wrangling

Learning Objectives

By the end of this chapter, you will be able to:

  • Clean and handle real-life messy data
  • Prepare data for data analysis by formatting data in the format required by downstream systems
  • Identify and remove outliers from data

In this chapter, you will learn about data issues that happen in real-life. You will also learn how to solve these issues.

Introduction

In this chapter, we will learn about the secret sauce behind creating a successful data wrangling pipeline. In the previous chapters, we were introduced to the basic data structures and building blocks of Data Wrangling, such as pandas and NumPy. In this chapter, we will look at the data handling section of data ...

Get Data Wrangling with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.