Chapter 4: Unsupervised machine learning: clustering algorithms

Abstract

In this chapter, various unsupervised machine learning algorithms such as k-means clustering, hierarchical clustering, density-based spatial clustering of applications with noise (DBSCAN), isolation forest, and local outlier factor (LOF) are discussed. This chapter starts with a step-by-step explanation of how k-means clustering functions, the math behind the clustering technique, and the step-by-step codes to apply k-means clustering to a geologic data set using scikit-learn in Python. Determining the number of clusters when using k-means clustering using the elbow point is also discussed and illustrated in Python. Another method for determining the effectiveness of the clusters ...

Get Machine Learning Guide for Oil and Gas Using Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.