Finding the best ML pipeline for product line prediction

Let's write a small wrapper function first to prepare a dataset by encoding categorical variables:

# Importing necessary variablesimport numpy as npimport pandas as pdfrom autosklearn.classification import AutoSklearnClassifierfrom autosklearn.regression import AutoSklearnRegressorfrom sklearn.model_selection import train_test_splitfrom sklearn.metrics import accuracy_scorefrom sklearn.preprocessing import LabelEncoderimport wgetimport pandas as pd# Machine learning algorithms work with numerical inputs and you need to transform all non-numerical inputs to numerical ones# Following snippet encode the categorical variableslink_to_data = 'https://apsportal.ibm.com/exchange-api/v1/entries/8044492073eb964f46597b4be06ff5ea/data?accessKey=9561295fa407698694b1e254d0099600' ...

Get Hands-On Automated Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.