Chapter 9

Identifying Clustered and Related Data

IN THIS CHAPTER

check Finding clusters of data

check Exposing relationships between dataset items

check Choosing the right algorithm for your data

check Assessing the value of your model with diagnostics

check Writing Python code to find clusters of data

You find out about some of the most common analytics models in Chapter 7 and how to help ensure that you have the data your models need in Chapter 8. Now you’re ready to dig into specific models to discover how to choose the best one for your goals, write the code in Python to implement the model, and assess its accuracy and relevance.

One of the first steps in analyzing any dataset is to determine the structure and relationships among data items. Allow your data to tell any stories that are right below the surface. Some of these stories will take work to uncover, but other findings will almost jump out at you.

Cluster analysis and association rules analysis are both straightforward methods of letting your data ...

Get Blockchain Data Analytics For Dummies now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.