Chapter 5
Grouping Your Way into Accurate Predictions
IN THIS CHAPTER
Understanding the basics of clustering, classification, and other grouping algorithms
Clustering your data with the k-means algorithm and kernel density estimation
Choosing between decision trees and random forests
Getting to know hierarchical and neighborhood clustering algorithms
Working through nearest neighbor algorithms
When it comes to making predictions from data, grouping techniques can be a simple and powerful way to generate valuable insights quickly. Although grouping methods tend to be relatively simple, you can choose from quite a few approaches. In this chapter, I introduce you to classification, and clustering algorithms, as well as decision trees and random forests.
Data scientists use clustering to help them divide their unlabeled data into subsets. If they’re starting with labeled data, they can use classification methods to build predictive models that they can then use to forecast the classification ...
Get Data Science For Dummies, 3rd Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.