Technical requirements

In this chapter, we will use the following Python libraries: pandas, NumPy, SciPy, and scikit-learn. I recommend installing the free Anaconda Python distribution (, which contains all of these packages. For details on how to install the Python Anaconda distribution, visit the Technical requirements section in Chapter 1, Foreseeing Variable Problems in Building ML Models

In this chapter, we will also use the open source Python library, Feature-engine, which I created and can be installed using pip:

pip install feature-engine

To find out more about Feature-engine, visit its documentation at

Get Python Feature Engineering Cookbook now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.