Advanced analysis - undirected methods

Data mining and machine learning techniques are divided into two main classes:

  • The directed, or supervised approach: You use known examples and apply information to unknown examples to predict selected target variable(s)
  • The undirected, or unsupervised approach: You discover new patterns inside the dataset as a whole

The most common undirected techniques are clustering, dimensionality reduction, and affinity grouping, also known as basket analysis or association rules. An example of clustering is looking through a large number of initially undifferentiated customers and trying to see if they fall into natural groupings based on similarities or dissimilarities of their features. This is a pure example of "undirected ...

Get SQL Server 2016 Developer's Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.