H. El-Amir, M. HamdyDeep Learning Pipelinehttps://doi.org/10.1007/978-1-4842-5349-6_6

6. Data Wrangling and Preprocessing

Hisham El-Amir¹ and Mahmoud Hamdy¹

(1)

Jizah, Egypt

In the previous chapter, we defined what data means; we also discussed types and levels of data. So, we are now just getting into action with data! In this chapter, you’ll learn how to understand and clean your dataset.

In some books or references you will find the topic of this chapter has a different name; they might call it data munging.

Munging means to manipulate or change, in a series of well-specified and reversible steps, a piece of original data to a completely different—and hopefully more useful—one. You might see some data scientist ...

Get Deep Learning Pipeline: Building a Deep Learning Model with TensorFlow now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Deep Learning Pipeline: Building a Deep Learning Model with TensorFlow by Hisham El-Amir, Mahmoud Hamdy

6. Data Wrangling and Preprocessing

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly