Latent Semantic Analysis

The idea behind LSA is factorizing Mdw so as to extract a set of latent variables (this means that we can assume their existence, but they cannot be observed directly) that work as connectors between the document and terms. As discussed in Chapter 11, Introducing Recommendation Systems, a very common decomposition method is Singular Value Decomposition (SVD):

However, we're not interested in a full decomposition; we are interested only in the subspace defined by the top k singular values:

This approximation has the ...

Get Machine Learning Algorithms - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.