November 2019
Intermediate to advanced
346 pages
9h 36m
English
In the following steps, we will read in a featurized dataset of URLs and train a classifier on it.
import pandas as pdimport ostrain_CSV = os.path.join("phishing-dataset", "train.csv")test_CSV = os.path.join("phishing-dataset", "test.csv")train_df = pd.read_csv(train_CSV)test_df = pd.read_csv(test_CSV)
y_train = train_df.pop("target").valuesy_test = test_df.pop("target").values
X_train = train_df.valuesX_test = test_df.values
from sklearn.ensemble import RandomForestClassifierfrom sklearn.metrics import ...