O'Reilly logo

Practical Predictive Analytics by Ralph Winters

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Examining cluster 2

After tabling cluster 2, look at a portion of the file. When looking at both the first and last words together, it seems like this cluster has something to do with hanging holders:

> head(cluster2[10:13]) Desc2 lastword firstword Cluster 6 GlassStarFrostedT-lightHolder HOLDER GLASS 2 57 HangingHeartT-lightHolder HOLDER HANGING 2 62 GlassStarFrostedT-lightHolder HOLDER GLASS 2 70 HangingHeartT-lightHolder HOLDER HANGING 2 81 HangingHeartT-lightHolder HOLDER HANGING 2 156 ColourGlassT-lightHolderHanging HANGING COLOUR 2

Rather than looking just at records, look at the frequencies of the most popular words. Heart, hanging, and folder are the three most frequently occurring words in cluster 2:

 tail(sort(table(cluster2$lastword)), ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required