book

Hands-On Unsupervised Learning with Python

by Giuseppe Bonaccorso

February 2019

Intermediate to advanced

386 pages

9h 54m

English

Packt Publishing

Read now

Unlock full access

Content preview from Hands-On Unsupervised Learning with Python

Contingency matrix

A very simple and powerful tool that can show the performance of a clustering algorithm when the ground truth is known is the contingency matrix C_m. If there are m classes, C_m ∈ ℜ^{m × m} and each element C_m(i, j) represents the number of samples with Y_true = i that have been assigned to the cluster j. Hence, a perfect contingency matrix is diagonal, while the presence of elements in all the other cells indicates a clustering error.

In our case, we obtain the following:

from sklearn.metrics.cluster import contingency_matrixcm = contingency_matrix(kmdff['diagnosis'].apply(lambda x: 0 if x == 'B' else 1), kmdff['prediction'])

The output of the previous snippet can be visualized as a heat map (the variable cm is a (2 × 2) matrix): ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Hands-On Unsupervised Learning Using Python

Publisher Resources

ISBN: 9781789348279Supplemental Content

Hands-On Unsupervised Learning with Python

by Giuseppe Bonaccorso

Contingency matrix

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Hands-On Unsupervised Learning Using Python

Deep Learning with Python

Deep Learning with Python, Second Edition

Introduction to Machine Learning with Python

Publisher Resources