Skip to Content
利用 Python 进行数据分析:原书第 3 版
book

利用 Python 进行数据分析:原书第 3 版

by Wes McKinney
November 2023
Intermediate to advanced
512 pages
11h 53m
Chinese
China Machine Press
Content preview from 利用 Python 进行数据分析:原书第 3 版
数据分析案例
|
421
个字母的分布发生了显著变化。为了了解具体情况,我首先将全部出生数据按照年度、
性别以及末字母进行聚合:
我选出其中具有代表性的三年,并打印前几行:
接下来,按出生总数对该表进行标准化处理,以便计算出一个新表格,其中包含每个字
母结尾的性别占总出生人数的比例:
422
|
13
有了这个字母比例数据之后,就可以生成一张各年度各性别的柱状图了(如图
13-8
):
13-8:男孩女孩名字中各个末字母的比例
可以看出,从
20
世纪
60
年代开始,以字母“
n
”结尾的男孩名字出现了显著增长。回
到之前创建的完整表格,按年度和性别对其进行标准化处理,并在男孩名字中选取几个
数据分析案例
|
423
字母的子集,最后进行转置以便将各列转换成时间序列:
有了这个时间序列的
DataFrame
之后,就可以通过其plot 方法绘制趋势图了(如
13-9
):
13-9:各年出生的男孩中,名字以 d/n/y 结尾的人数比例
变成女孩名字的男孩名字(以及相反的情况)
另一个有趣的趋势是早年流行于样本中某个性别的名字近年来“变性了” ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Effective Python 第3版 ―Pythonプログラムを改良する125項目

Effective Python 第3版 ―Pythonプログラムを改良する125項目

Brett Slatkin, 鈴木 駿

Publisher Resources

ISBN: 9787111726722