Skip to Content
Python数据处理
book

Python数据处理

by Jacqueline Kazil, Katharine Jarmul
July 2017
Intermediate to advanced
398 pages
11h 54m
Chinese
Posts & Telecom Press
Content preview from Python数据处理
174
9
我们要使用
Python
更加深入地探索数据。
首先我们要安装将用到的
Python
agate
http://agate.readthedocs.org
),它可以帮助我
们发现数据的一些基本特征。
agate
是一个数据分析库,由
Christopher Groskopf
https://
github.com/onyxfish
)编写。
Christopher
是一位拥有高超技术水平的数据记者和
Python
发者,而
agate
库会帮助我们了解数据。使用
pip
安装这个库:
pip install agate
这一章中的代码与
agate
1.2.0
版本兼容。因为
agate
是一个相对较新的
Python
库,所以随着库的成熟,其中的一些功能是有可能发生改变的。为确
保安装的是指定版本的库,你可以使用
pip
设置版本。对本书来说,你可以
使用:
pip install agate==1.2.0
来安装
agate
。我们同样推荐你测试最新的
版本,并随时了解书中用到的库的最新代码变化。
我们想要探索
agate
库的一些特性。为了达到这个目的,我们将会使用从
UNICEF
年报得
到的关于童工雇用的数据。
9.1.1
 导入数据
首先,来看一下我们的第一个数据集——
UNICEF
的童工汇总数据。我们下载的数据是一
Excel
文件,包含全世界的童工雇用率列表。我们可以使用学到的关于
Excel
的知识以
及从第
4
章和第
7
章学到的数据清洗技术,将原始数据转化为
agate
库所接受的格式。
在处理
Excel
表单时,我们推荐你用喜欢的
Excel
查看器打开表单文件 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

数据科学中的实用统计学(第2版)

数据科学中的实用统计学(第2版)

Peter Bruce, Andrew Bruce, Peter Gedeck
Java持续交付

Java持续交付

Daniel Bryant, Abraham Marín-Pérez
解密金融数据

解密金融数据

Justin Pauley

Publisher Resources

ISBN: 9787115459190