Skip to Content
Java数据分析指南
book

Java数据分析指南

by Posts & Telecom Press, John R. Hubbard
May 2024
Intermediate to advanced
347 pages
5h 38m
Chinese
Packt Publishing
Content preview from Java数据分析指南

第9章 推荐系统

大多数在线购物者都可能熟悉亚马逊的推荐系统,如图9-1所示。

..\tu\B06398_09_01.png

图9-1 Amazon.com的推荐

当消费者浏览商品项目时,网站展示出一些销量很高的类似项目。这些项目来自访问令人惊叹的亚马逊产品、消费者和销售数据库的推荐系统。

现在,许多商品和服务的提供商运行着在线推荐系统。Netflix推荐电影,苹果推荐音乐,Audible推荐图书,Yelp推荐餐馆等。

推荐系统(recommender system)是一种算法,它的分析基于消费者之前的选择与许多其他消费者的比较,预测该消费者的产品偏好。这些算法由亚马逊和Netflix开创,现在已在网络上普及。

聚类算法提供了建立推荐系统的一种机制:推荐同一个聚类中的其他数据点。具体地说,我们可以使用k-均值算法,然后推荐聚类的均值。聚类算法和分类算法都可以类似这样使用,从而实现推荐系统。但是,本章将考察几种专门为推荐系统设计的算法。

大多数推荐系统使用的是用户项目偏好的量化输入。这些偏好通常排列在一个矩阵中,矩阵的一行表示每位用户,一列表示每个项目。这样的矩阵叫作效用矩阵(utility matrix)。例如,Netflix请求它的用户按一星到五星对电影进行评分。所以,这个效用矩阵中的每一项会是一个范围0~5的整数uij,代表了用户i对电影j的评分星数,其中0代表没有评分。

例如,表9-1展示了一个用户在1~5范围内对啤酒进行评分的效用矩阵,5表示最大的认可。空白表示没有用户对该项目评分。啤酒是:BL = ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Python实用技能学习指南

Python实用技能学习指南

Posts & Telecom Press, Robert Smallshire, Austin Bingham
Python计算机视觉和自然语言处理

Python计算机视觉和自然语言处理

Posts & Telecom Press, Álvaro Morena Alberolaï, Gonzalo Molina Gallegoï, Unai Garay Maestreï
Python和NLTK实现自然语言处理

Python和NLTK实现自然语言处理

Posts & Telecom Press, Nitin Hardeniya
Python设计模式(第2版)

Python设计模式(第2版)

Posts & Telecom Press, Chetan Giridhar

Publisher Resources

ISBN: 9781836201052