4 Preparing the data, part 2: Transforming the data
This chapter covers
- Dealing with more incorrect values
- Mapping complex, multiword values to single tokens
- Fixing type mismatches
- Dealing with rows that still contain bad values after cleanup
- Creating new columns derived from existing columns
- Preparing categorical and text columns to train a deep learning model
- Reviewing the end-to-end solution introduced in chapter 2
In chapter 3, we corrected a set of errors and anomalies in the input dataset. There’s still more cleanup and preparation to be done in the dataset, and that’s what we’ll do in this chapter. We’ll deal with remaining issues (including multiword tokens and type mismatches) and go over your choices about how to deal with the bad ...
Get Deep Learning with Structured Data now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.