© Hisham El-Amir and Mahmoud Hamdy 2020
H. El-Amir, M. HamdyDeep Learning Pipelinehttps://doi.org/10.1007/978-1-4842-5349-6_6

6. Data Wrangling and Preprocessing

Hisham El-Amir1  and Mahmoud Hamdy1
(1)
Jizah, Egypt
 

In the previous chapter, we defined what data means; we also discussed types and levels of data. So, we are now just getting into action with data! In this chapter, you’ll learn how to understand and clean your dataset.

In some books or references you will find the topic of this chapter has a different name; they might call it data munging.

Munging means to manipulate or change, in a series of well-specified and reversible steps, a piece of original data to a completely different—and hopefully more useful—one. You might see some data scientist ...

Get Deep Learning Pipeline: Building a Deep Learning Model with TensorFlow now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.