Mastering Machine Learning with R - Second Edition
by Cory Lesmeister, Doug Ortiz, Vikram Dhillon, Miroslav Kopecky
Summary
In this chapter, we took a second stab at unsupervised learning techniques by exploring PCA, examining what it is, and applying it in a practical fashion. We explored how it can be used to reduce the dimensionality and improve the understanding of the dataset when confronted with numerous highly correlated variables. Then, we applied it to real data from the National Hockey League, using the resulting principal components in a regression analysis to predict total team points. Additionally, we explored ways to visualize the data and the principal components.
As an unsupervised learning technique, it requires some judgment along with trial and error to arrive at an optimal solution that is acceptable to business partners. Nevertheless, ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access