January 2020
Beginner to intermediate
372 pages
10h
English
To complete this recipe, we need to import the necessary tools and load the data:
import pandas as pdfrom sklearn.model_selection import train_test_splitfrom feature_engine.missing_data_imputers import EndTailImputer
data = pd.read_csv('creditApprovalUCI.csv')
The values at the end of the distribution should be calculated from the variables in the train set.
X_train, X_test, y_train, y_test = train_test_split( data.drop('A16', axis=1), data['A16'], test_size=0.3, random_state=0)
Read now
Unlock full access