Skip to Content
Python数据处理
book

Python数据处理

by Jacqueline Kazil, Katharine Jarmul
July 2017
Intermediate to advanced
398 pages
11h 54m
Chinese
Posts & Telecom Press
Content preview from Python数据处理
数据获取与存储
111
6.5
 案例研究
数据调查实例
我们将简单介绍几个不同的兴趣领域和问题,这样你可以知道第一步该做些什么。
6.5.1
 埃博拉病毒危机
比方说,你对调查西非的埃博拉病毒危机感兴趣。你会怎么开始调查?你可能很快会想到
用谷歌搜索“
Ebola crisis data
”(埃博拉病毒危机数据)。你发现有许多国际组织致力于追
踪病毒的传播,这些组织提供了许多工具,任你使用。首先,你会找到
WHO
的情况报告。
WHO
网站上有关于最新病例和死亡的信息,还有交互式地图显示受影响的地区,以及应
对措施的关键绩效指标,这些内容似乎都是每周更新。数据有
CSV
JSON
两种格式,是
真实可靠、定期更新的信息来源。
你要不断挖掘寻找其他可用的资源,而不是在出现的第一个结果这里就止步不前。经过进
一步搜索,我们找到
GitHub
用户
cmrivers
的仓库(
https://github.com/cmrivers/ebola
),
面是来自许多政府和媒体数据源的原始数据汇总。由于我们知道该用户,可以通过联系方
式联系到他们,所以我们还可以核实数据最近一次的更新时间,并咨询任何与数据采集方
法有关的问题。我们学过如何处理这些数据格式(
CSV
PDF
文件),所以处理起来应该
不成问题。
进一步深入挖掘,你可能会专注于一个具体的问题,比如:“在安全下葬方面采取了哪些
预防措施?”你找到一份由
Sam Libby
https://data.humdata.org/user/libbys
)维护的报告,
报告内容是关于安全、庄严的葬礼的 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

数据科学中的实用统计学(第2版)

数据科学中的实用统计学(第2版)

Peter Bruce, Andrew Bruce, Peter Gedeck
Java持续交付

Java持续交付

Daniel Bryant, Abraham Marín-Pérez
解密金融数据

解密金融数据

Justin Pauley

Publisher Resources

ISBN: 9787115459190