book

Python: Real World Machine Learning

by Prateek Joshi, John Hearty, Bastiaan Sjardin, Luca Massaron, Alberto Boschetti

November 2016

Beginner to intermediate

941 pages

21h 55m

English

Packt Publishing

Read now

Unlock full access

Content preview from Python: Real World Machine Learning

Clustering data using the k-means algorithm

The k-means algorithm is one of the most popular clustering algorithms. This algorithm is used to divide the input data into k subgroups using various attributes of the data. Grouping is achieved using an optimization technique where we try to minimize the sum of squares of distances between the datapoints and the corresponding centroid of the cluster. If you need a quick refresher, you can learn more about k-means at http://www.onmyphd.com/?p=k-means.clustering&ckattempt=1.

How to do it…

The full code for this recipe is given in the kmeans.py file already provided to you. Let's look at how it's built. Create a new Python file, and import the following packages:
```
import numpy as np import matplotlib.pyplot ...
```

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Interpretable Machine Learning with Python

Serg Masís

Large Scale Machine Learning with Python

Luca Massaron, Alberto Boschetti, Bastiaan Sjardin

Hands-On Machine Learning with scikit-learn and Scientific Python Toolkits

Tarek Amr

Python Machine Learning Cookbook - Second Edition

Giuseppe Ciaburro, Prateek Joshi

Publisher Resources

ISBN: 9781787123212Supplemental Content Purchase Link