November 2019
Intermediate to advanced
346 pages
9h 36m
English
In the following steps, we will see a demonstration of how scikit-learn's K-means clustering algorithm performs on a toy PE malware classification:
import pandas as pdimport plotly.express as pxdf = pd.read_csv("file_pe_headers.csv", sep=",")fig = px.scatter_3d( df, x="SuspiciousImportFunctions", y="SectionsLength", z="SuspiciousNameSection", color="Malware",)fig.show()
The following screenshot shows the output:

y = df["Malware"]X = df.drop(["Name", "Malware"], axis=1).to_numpy()