This section will focus on explaining the fields in each of the datasets available in the MovieLens 100K dataset. Take a look at these steps:
- The datasets are all available in the zipped file, ml-latest-small.zip, where the ratings.csv dataset will serve as the pseudo-fact table of our data, since it has transactions for each movie that is rated. The dataset, ratings, has the four column names shown in the following screenshot:
- The dataset shows the rating selected by each userId over the course of their time, from the earliest rating to the latest rating. The range of a rating can vary from 0.5 to 5.0 stars, as seen by ...