7.4 Implementing Cluster Analysis on Simple Data

We now turn our attention to the fundamental steps of the cluster analysis technique. To concentrate on the algorithm, we use a very simple data set. Later, we will apply these same ideas to more complex data. Our example data in this section consists of exam scores for a group of 21 students taking an introduction to computer science course. These scores represent the percentage of correct answers. The range of scores is between 0 and 100.

Table 7.1 provides a simple listing of the scores. Our initial observation does not reveal any patterns. We could certainly use our previous work to compute descriptive statistics that might yield a bit more information, but right now we are interested in identifying ...

Get Python Programming in Context, 4th Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.