O'Reilly logo

Building Recommendation Engines by Suresh Kumar Gorakala

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Exploring the dataset

In this section let's explore the data in more detail. To find the dimensions of the data and the type of data, run the following commands:

There are 5000 users and 100 items:

dim(Jester5k) 
 
[1] 5000  100 
 

The data is of R Matrix:

class(Jester5k@data) 
 
[1] "dgCMatrix" 
attr(,"package") 
[1] "Matrix" 

Exploring the rating values

The following code snippet will help us understand the rating values distribution:

Rating distribution is given as:

hist(getRatings(Jester5k), main="Distribution of ratings") 
Exploring the rating values

The preceding image shows the frequency of the ratings available from the Jester5K dataset. We can observe that the negative ratings are more ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required