Skip to Content
Python数据处理
book

Python数据处理

by Jacqueline Kazil, Katharine Jarmul
July 2017
Intermediate to advanced
398 pages
11h 54m
Chinese
Posts & Telecom Press
Content preview from Python数据处理
186
9
成为空值。如果
table_b
中有值没有在
table_a
中匹配,它们会被排除在新表之外。选择
哪一张表在前面和使用哪一种方式的联结是非常重要的。
我们绝不想要空值存在。我们的问题围绕数据是怎样关联的,为了达到这个目的,我们想
要使用内联结。
agate
库的
join
方法允许传递
inner=True
参数,这会使函数仅作内联结,
只保留匹配的行,不会在联结后有空值行。
我们尝试联结童工数据和新规整后的
cpi_table
。当我们查看这两个表时,可以将它们通
过国家
/
领土的名称匹配在一起。在
cpi_table
中,我们有
Country/Territory
列,同时,在
童工数据中,我们有
Counties and areas
列。为了联结这两张表,运行下面的代码
:
cpi_and_cl = cpi_table.join(ranked, 'Country / Territory',
'Countries and areas', inner=True)
将匹配行放到新表
cpi_and_cl
中。我们可以通过打印几个值来查看这张表,同时研究新的
联结后的列,像下面代码这样:
cpi_and_cl.column_names
for
r
in
cpi_and_cl.order_by('CPI 2013 Score').limit(10).rows:
print
'{}: {} - {}%'.format(r['Country / Territory'],
r['CPI 2013 Score'], r['Total ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

数据科学中的实用统计学(第2版)

数据科学中的实用统计学(第2版)

Peter Bruce, Andrew Bruce, Peter Gedeck
Java持续交付

Java持续交付

Daniel Bryant, Abraham Marín-Pérez
解密金融数据

解密金融数据

Justin Pauley

Publisher Resources

ISBN: 9787115459190