Skip to Content
利用 Python 进行数据分析:原书第 3 版
book

利用 Python 进行数据分析:原书第 3 版

by Wes McKinney
November 2023
Intermediate to advanced
512 pages
11h 53m
Chinese
China Machine Press
Content preview from 利用 Python 进行数据分析:原书第 3 版
数据分析案例
|
409
对排序结果做逆序,再次取出前
10
行,得到的则是男性观众更喜欢的电影:
如果只是想找出分歧最大的电影,不考虑性别因素。分歧可以用方差或标准差测量。要
这么做,首先计算按照电影名的评分标准差,然后对电影名进行过滤:
接着,进行降序排列并选取前
10
行,这大概就是分歧最大的
10
部电影:
可能你已经注意到了,电影分类是以管道分隔(|)字符串形式给出的,因为一部电影可
能属于多个分类。如果按电影分类对评分数据进行分组的话,可以在
DataFrame
上使用
explode 方法。来看看如何使用。首先,在
Series
上使用 str.split 方法将分类字符串
分割为分类列表:
410
|
13
而后,调用 movies.explode("genre") 生成一个新
DataFrame
,其中的行对应各个电影
种类列表的各个“内层”元素。例如,如果一部电影被分类为既是喜剧也是爱情电影,
则结果中就会有两行,一行只是 "Comedy"另一行只是 "Romance"
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Effective Python 第3版 ―Pythonプログラムを改良する125項目

Effective Python 第3版 ―Pythonプログラムを改良する125項目

Brett Slatkin, 鈴木 駿

Publisher Resources

ISBN: 9787111726722