Skip to Content
Python数据处理
book

Python数据处理

by Jacqueline Kazil, Katharine Jarmul
July 2017
Intermediate to advanced
398 pages
11h 54m
Chinese
Posts & Telecom Press
Content preview from Python数据处理
1
1
Python
简介
无论你是一名记者、分析师,还是初出茅庐的数据科学家,选择这本书可能是因为你想学
习如何用编程来分析数据,得出结论,并将结论清楚地传达给别人。你可能会用报告、图
表或归纳统计的方式来展示你的结论。重要的是,你想讲述一个故事。
传统的故事讲述或新闻报道往往使用单一的故事来描述总体结论或趋势。在这种故事中,
数据成为了相对次要的部分。然而,其他讲故事的人,比如
Christian Rudde
Datacylsm
http://dataclysm.org/
)的作者,
OkCupid
的创始人之一]认为数据本身应该是故事的重点。
首先,你需要确定想要研究的主题。你可能对研究不同人或群体的沟通习惯感兴趣,这时
你可以从一个具体的问题入手,例如在网络上被人们广为分享的信息都有哪些特点。又或
许你可能对棒球的历史统计数据感兴趣,并想弄清楚一个问题:这些数据能否表明棒球运
动随时间发生了变化。
确定了感兴趣的领域之后,你需要寻找数据,以进一步探索这一主题。想研究人类行
为,你可以从
Twitter API
https://dev.twitter.com/overview/api
)中获取数据,研究人们在
Twitter
上分享的内容。如果想深入研究棒球历史,你可以使用
Sean Lahman
的棒球数据库
http://www.seanlahman.com/baseball-archive/statistics/
)。
Twitter
和棒球数据集都属于综合的大型数据集。为了回答你的具体问题,应把这些数据集
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

数据科学中的实用统计学(第2版)

数据科学中的实用统计学(第2版)

Peter Bruce, Andrew Bruce, Peter Gedeck
Java持续交付

Java持续交付

Daniel Bryant, Abraham Marín-Pérez
解密金融数据

解密金融数据

Justin Pauley

Publisher Resources

ISBN: 9787115459190