May 2019
Beginner
403 pages
9h 18m
English
O nature, nature, why art thou so dishonest, as ever to send men with these false recommendations into the world!
Henry Fielding
Another common data problem is producing recommendations of some sort. Netflix recommends movies you might want to watch. Amazon recommends products you might want to buy. Twitter recommends users you might want to follow. In this chapter, we’ll look at several ways to use data to make recommendations.
In particular, we’ll look at the dataset of users_interests that we’ve used before:
users_interests=[["Hadoop","Big Data","HBase","Java","Spark","Storm","Cassandra"],["NoSQL","MongoDB","Cassandra","HBase","Postgres"],["Python","scikit-learn","scipy","numpy","statsmodels","pandas"],["R","Python","statistics","regression","probability"],["machine learning","regression","decision trees","libsvm"],["Python","R","Java","C++","Haskell","programming languages"],["statistics","probability","mathematics","theory"],["machine learning","scikit-learn","Mahout","neural networks"],["neural networks","deep learning","Big Data","artificial intelligence"],["Hadoop","Java","MapReduce","Big Data"],["statistics","R","statsmodels"],["C++","deep learning","artificial intelligence","probability"],["pandas","R","Python"],["databases","HBase","Postgres","MySQL","MongoDB"],["libsvm","regression","support vector machines"]]
And we’ll think about the problem of recommending new ...