This section will show you how to prepare the data to be used in recommender models. Follow these steps:
- Select the relevant data.
- Normalize the data.
Selecting the most relevant data
When we explored the data, we noticed that the table contains:
- Movies that have been viewed only a few times. Their ratings might be biased because of lack of data.
- Users who rated only a few movies. Their ratings might be biased.
We need to determine the minimum number of users per movie and vice versa. The correct solution comes from an iteration of the entire process of preparing the data, building a recommendation model, and validating it. Since we are implementing the model for the first time, we can use a rule of thumb. After having built the models, ...