August 2018
Intermediate to advanced
438 pages
12h 3m
English
Data and ML are the best of friends, yet a lot of issues come with more and bigger data. A large number of attributes or a bloated-up feature space is one common problem. A large feature space poses problems in analyzing and visualizing the data along with issues related to training, memory, and space constraints. This is also known as the curse of dimensionality. Since unsupervised methods help us extract insights and patterns from unlabeled training datasets, they are also useful in helping us reduce dimensionality.
In other words, unsupervised methods help us reduce feature space by helping us select a representative set of features from the complete available list: