Skip to Content
Python数据处理
book

Python数据处理

by Jacqueline Kazil, Katharine Jarmul
July 2017
Intermediate to advanced
398 pages
11h 54m
Chinese
Posts & Telecom Press
Content preview from Python数据处理
106
6
下面给出从数据文件中找人的一些技巧。
在文件中搜索联系人信息。
寻找署名——如果没有人名,那就寻找机构名。
在网络上搜索文档的文件名和标题。
右键单击文件,在
Windows
上选择“属性”(在
Mac
上选择“显示简介”),查看文件元数据。
去联系你能找到的每一个人。如果他不是创建文件的人,可以问他知不知道是谁创建的文
件。不要害羞——你对他们的研究课题和工作感兴趣,就是对他们的恭维,他们会很乐意
帮助你的。
与通信官打交道
如果你遇到这种情况——发布文件的机构希望你能和他们的通信代表谈一谈——这意
味着时间可能会拖得很长。还记得一个叫作打电话的游戏吗:第一个人跟另一个人说
了些什么,另一个人将所听到的内容复述给下一个人,如此这般,最后一个人的话已
经与第一个人大相径庭?
要保证有效沟通,你可以做这两件事情。第一,努力建立信任。如果没有利益冲突,
你可以分享你感兴趣的工作,并承诺会将该机构列为数据源。这表示你会间接宣传他
们的工作,该机构也会在分享资料方面受到好评。第二,请求通信代表召开电话会议
或有监督的讨论。通过电话而不是电子邮件沟通,你可以及时准确地得到问题的回答。
找到了要联系的人之后,尝试用电话联系他,或者亲自拜访。电子邮件很容易引起误会,
通常时间也会拖得比较长。下面给出几个问题的例子,可以帮你思考要问什么样的问题。
你是如何获取第
6
页到第
200
页的数据的?
是否有其他格式的数据,比如
JSON
CSV
XML
或数据库?
数据是如何采集的?
能否描述一下数据采集的方法? ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

数据科学中的实用统计学(第2版)

数据科学中的实用统计学(第2版)

Peter Bruce, Andrew Bruce, Peter Gedeck
Java持续交付

Java持续交付

Daniel Bryant, Abraham Marín-Pérez
解密金融数据

解密金融数据

Justin Pauley

Publisher Resources

ISBN: 9787115459190