Latent semantic analysis

The idea behind latent semantic analysis is factorizing Mdw so as to extract a set of latent variables (this means that we can assume their existence but they cannot be observed directly) that work as connectors between the document and terms. As discussed in Chapter 11Introduction to Recommendation Systems, a very common decomposition method is SVD:

However, we're not interested in a full decomposition; we are interested only in the subspace defined by the top k singular values:

This approximation has the reputation ...

Get Machine Learning Algorithms now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.