Understanding the task by understanding the data

What is always the first step in tackling a new machine learning problem?

You are absolutely right: to get a sense of the data. The better we understand the data, the better we understand the problem we are trying to solve. In our future endeavors, this will also help us choose an appropriate machine learning algorithm.

The first thing to realize is that the 'drug' column is actually not a feature value like all the other columns. Since it is our goal to predict which drug will be prescribed based on a patient's blood values, the 'drug' column effectively becomes the target labels. In other words, the inputs to our machine learning algorithm will be all blood values, age, and gender of a patient. ...

Get Machine Learning for OpenCV now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.