Let's now look at clustering the Headlines dataset:
- We will import the required functions:
- We also want to construct silhouette plots, so we need to compute the Jaccard similarity. For that, we will use the following lines of code:
This results in the following output:
- Now, let's go ahead and perform this clustering, using the following function:
- We will then print out the clusters, using the following ...