Skip to Content
Python数据处理
book

Python数据处理

by Jacqueline Kazil, Katharine Jarmul
July 2017
Intermediate to advanced
398 pages
11h 54m
Chinese
Posts & Telecom Press
Content preview from Python数据处理
194
9
腐败感和童工雇用率有什么关系?
对于你的数据集,你会有不同的问题,但是尝试跟随我们的实例,并且找到你想要探索的
趋势。任何统计学上的离群值或者聚合趋势都可以将你引向有趣的问题去研究。
对我们的数据来说,最有趣的问题是,在非洲政府腐败感和童工雇用的关系。政府腐败,
或者政府腐败感,是否会影响社区保护童工不被雇用的能力?
根据所使用的数据集和数据探索结果,你可能会有很多感兴趣、想要探索
的问题。尝试聚焦于一个具体的问题,并用你的分析来回答它。针对多个
具体问题重复这一过程。专注会帮助你找到好的答案,保持你的分析明确
清晰。
回答这个问题需要更多的探索和更多的数据集。我们可能希望阅读更多的文章,看一下在
这个主题上有哪些研究结果。我们可能还希望访问这一领域的专家。最终,我们可能希望
选择非洲的一个特定地区或一系列国家,来更好地评估童工雇用情况。下面这一小节展示
了怎么做这件事。
9.2.1
 分离和聚焦数据
为了之后的分析,我们首先需要分离出非洲国家的数据,更加充分地探索这一子集的数
据。我们已经知道了很多使用
agate
库来过滤数据的方式,所以让我们从这里开始。下面
的代码展示了怎样把非洲的数据同其他数据分离开来:
africa_cpi_cl = cpi_and_cl.where(lambda x: x['continent'] == 'africa')
for
r
in
africa_cpi_cl.order_by('Total (%)', reverse=True).rows:
print ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

数据科学中的实用统计学(第2版)

数据科学中的实用统计学(第2版)

Peter Bruce, Andrew Bruce, Peter Gedeck
Java持续交付

Java持续交付

Daniel Bryant, Abraham Marín-Pérez
解密金融数据

解密金融数据

Justin Pauley

Publisher Resources

ISBN: 9787115459190