- We will use the movie rating data for the SVD analysis. The movieLens 1M dataset contains around 1 million records which consist of anonymous ratings of around 3,900 movies made by 6,000 movieLens users.
The dataset can be retrieved at: http://files.grouplens.org/datasets/movielens/ml-1m.zip
The dataset contains the following files:
- ratings.dat: Contains the user ID, movie ID, ratings, and timestamp
- movies.dat: Contains the movie ID, titles, and genres
- users.dat: Contains the user ID, genders, ages, occupations, and zip code
- We will use the ratings.dat for our SVD analysis. Sample data for the ratings.dat looks like the following: