Exploring data preprocessing tools and methods

  1. Start the NotebookApp from the project directory by executing jupyter notebook. Navigate to the chapter-2 directory and open up the chapter-2-workbook.ipynb file. Find the cell near the top where the packages are loaded, and run it.

We are going to start by showing off some basic tools from Pandas and scikit-learn. Then, we'll take a deeper dive into methods for rebuilding missing data.

  1. Scroll down to Subtopic Preprocessing data for machine learning and run the cell containing pd.merge? to display the docstring for the merge function in the notebook:

As we can see, the function accepts left ...

Get Applied Deep Learning with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.