Chapter 5

Grouping Your Way into Accurate Predictions

IN THIS CHAPTER

check Understanding the basics of clustering, classification, and other grouping algorithms

check Clustering your data with the k-means algorithm and kernel density estimation

check Choosing between decision trees and random forests

check Getting to know hierarchical and neighborhood clustering algorithms

check Working through nearest neighbor algorithms

When it comes to making predictions from data, grouping techniques can be a simple and powerful way to generate valuable insights quickly. Although grouping methods tend to be relatively simple, you can choose from quite a few approaches. In this chapter, I introduce you to classification, and clustering algorithms, as well as decision trees and random forests.

Data scientists use clustering to help them divide their unlabeled data into subsets. If they’re starting with labeled data, they can use classification methods to build predictive models that they can then use to forecast the classification ...

Get Data Science For Dummies, 3rd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.