CHAPTER 26Unsupervised Machine Learning: The Apriori Algorithm

Chapters on Linear Regression, Decision Trees, Random Forests, AdaBoost, Neural Networks and images-Nearest Neighbours belong to the supervised machine learning algorithms family. In such cases, we are equipped with two types of variables in the data set: the images-dimensional vector of features, images, and, the images-dimensional vector of dependent variables (usually images). Given the data set of images observations of both features and corresponding dependent variables, we calibrate the chosen model using both types of data. The relationship between the features and the dependent variable is then learnt.

Our objective for this chapter is to assume that we have only features in our data set. The objective is to understand the relationship between features. This can be formalised as follows. The set of features , is generated by a joint density distribution ...

Get Machine Learning and Big Data with kdb+/q now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.