O'Reilly logo

Machine Learning with Spark - Second Edition by Nick Pentreath, Manpreet Singh Ghotra, Rajdeep Dua

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Matrix factorization

Since Spark's recommendation models currently only include an implementation of Matrix factorization, we will focus our attention on this class of models. This focus is with good reason; however, these types of models have consistently been shown to perform extremely well in collaborative filtering and were among the best models in well-known competitions, such as the Netflix prize.

Matrix Factorization assumes that:

  • Each user can be described by n attributes or features. For example, feature one might be a number that says how much each user likes action movies.
  • Each item can be described by a set of n attributes or features. To connect with the preceding example, feature one for the movie might be a number that says ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required