7

Images

Factorization

One way to think about almost everything we do in data science is as dimension reduction. We are trying to learn from high-dimensional x some low-dimensional summaries that contain the information necessary to make good decisions.

Dimension reduction can be supervised or unsupervised. In supervised learning, an outside “response” variable y dictates the direction of dimension reduction. In regression, a high-dimensional x is projected through coefficients β to create the low-dimensional (univariate) summary ŷ. Chapters 24 were all about supervised learning.

In contrast, for unsupervised learning there is no response or outcome. ...

Get Business Data Science: Combining Machine Learning and Economics to Optimize, Automate, and Accelerate Business Decisions now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.