So, I can quickly drop any rows that happen to be in my original ratings series using the following code:
filteredSims = simCandidates.drop(myRatings.index) filteredSims.head(10)
Running that will let me see the final top 10 results:
And there we have it! Return of the Jedi (1983), Raiders of the Lost Ark (1981), Indiana Jones and the Last Crusade (1989), all the top results for my fictitious user, and they all make sense. I'm seeing a few family-friendly films, you know, Cinderella (1950), The Wizard of Oz (1939), Dumbo (1941), creeping in, probably based on the presence of Gone with the Wind in there, ...