4 Preparing the data, part 2: Transforming the data

This chapter covers

Dealing with more incorrect values
Mapping complex, multiword values to single tokens
Fixing type mismatches
Dealing with rows that still contain bad values after cleanup
Creating new columns derived from existing columns
Preparing categorical and text columns to train a deep learning model
Reviewing the end-to-end solution introduced in chapter 2

In chapter 3, we corrected a set of errors and anomalies in the input dataset. There’s still more cleanup and preparation to be done in the dataset, and that’s what we’ll do in this chapter. We’ll deal with remaining issues (including multiword tokens and type mismatches) and go over your choices about how to deal with the bad ...

Get Deep Learning with Structured Data now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Deep Learning with Structured Data by Mark Ryan

4 Preparing the data, part 2: Transforming the data

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly