Skip to Content
利用 Python 进行数据分析:原书第 3 版
book

利用 Python 进行数据分析:原书第 3 版

by Wes McKinney
November 2023
Intermediate to advanced
512 pages
11h 53m
Chinese
China Machine Press
Content preview from 利用 Python 进行数据分析:原书第 3 版
数据聚合与分组操作
|
315
10.3.3
示例:用指定分组的值填充缺失值
在清洗缺失数据时,有时你会用 dropna 将其删除,而有时则可能想用固定值或由数据
本身衍生出的值来填充空(
NA
)值。这时就得使用 fillna 这个工具了。在下面这个例
子中,我用平均值填充空值:
假设你需要对不同的分组填充不同的值。一种方法是将数据分组,并使用 apply 和一个
316
|
10
能够对各数据块调用 fillna 的函数。下面是一些有关美国若干州的示例数据,将其分
为东部地区和西部地区:
将其中一些值设置为缺失值:
数据聚合与分组操作
|
317
我们可以用分组平均值来填充
NA
值,如下所示:
存在另外一种情况,你可能已经在代码中预设了针对各组的填充值。由于分组都具有内
置的 name 属性,我们可以这样使用:
10.3.4
示例:随机采样和排列
假设你想要从一个大型数据集中随机抽取(进行替换或不替换)样本,以进行蒙特卡
罗模拟(
Monte Carlo simulation
)或其他工作。“抽取”的方式有很多,这里采用的是 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Effective Python 第3版 ―Pythonプログラムを改良する125項目

Effective Python 第3版 ―Pythonプログラムを改良する125項目

Brett Slatkin, 鈴木 駿

Publisher Resources

ISBN: 9787111726722