Skip to Content
Data Science For Dummies, 2nd Edition
book

Data Science For Dummies, 2nd Edition

by Lillian Pierson, Jake Porway
March 2017
Beginner
384 pages
9h 24m
English
For Dummies
Audiobook available
Content preview from Data Science For Dummies, 2nd Edition

Chapter 6

Using Clustering to Subdivide Data

IN THIS CHAPTER

check Understanding the basics of clustering

check Clustering your data with the k-means algorithm and kernel density estimation

check Getting to know hierarchical and neighborhood clustering algorithms

check Checking out decision tree and random forest algorithms

Data scientists use clustering to help them divide their unlabeled data into subsets. The basics behind clustering are relatively easy to understand, but things get tricky fast when you get into using some of the more advanced algorithms. In this chapter, I introduce the basics behind clustering. I follow that by introducing several nuanced algorithms that offer clustering solutions to meet your requirements, based on the specific characteristics of your feature dataset.

Introducing Clustering Basics

To grasp advanced methods for use in clustering your data, you should first take a few moments to make sure you have a firm understanding of the basics that underlie all forms of clustering. Clustering is a form of machine learning — the machine in this case is your computer, and learning ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Data Science For Dummies, 3rd Edition

Data Science For Dummies, 3rd Edition

Lillian Pierson
Data Science, 2nd Edition

Data Science, 2nd Edition

Vijay Kotu, Bala Deshpande
Python for Data Science For Dummies, 2nd Edition

Python for Data Science For Dummies, 2nd Edition

John Paul Mueller, Luca Massaron

Publisher Resources

ISBN: 9781119327639Purchase book