Chapter 3. Feature Extraction and Preprocessing

The examples discussed in the previous chapter used simple numeric explanatory variables, such as the diameter of a pizza. Many machine learning problems require learning from observations of categorical variables, text, or images. In this chapter, you will learn basic techniques for preprocessing data and creating feature representations of these observations. These techniques can be used with the regression models discussed in Chapter 2, Linear Regression, as well as the models we will discuss in subsequent chapters.

Extracting features from categorical variables

Many machine learning problems have categorical, or nominal, rather than continuous features. For example, an application that predicts ...

Get scikit-learn : Machine Learning Simplified now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

scikit-learn : Machine Learning Simplified by Raúl Garreta, Guillermo Moncecchi, Trent Hauck, Gavin Hackeling

Chapter 3. Feature Extraction and Preprocessing

Extracting features from categorical variables

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly