July 2018
Intermediate to advanced
474 pages
13h 37m
English
The dataframe, df, has some additional columns, _c3, _c4, _c5, and _c6 that will not be used in the model and therefore, can be excluded from the dataset using the following script:
df = df.select('id', 'label', 'chat')df.show()
The output of the script can be seen in the following screenshot:

Read now
Unlock full access