Skip to Content
Visualizing Data
book

Visualizing Data

by Ben Fry
December 2007
Beginner to intermediate
382 pages
10h 29m
English
O'Reilly Media, Inc.
Content preview from Visualizing Data

Chapter 9. Acquiring Data

The first step in visualizing data is to load it into your application. Typical data sources might be a file on a disk, a stream from a network, or a digitized signal (e.g., audio or sensor readings). Unless you own the data and it’s recorded in a definable, digitizable format, things can get messy quickly. How do you process weeks of surveillance video? How does one quantitatively acquire data from an hour-long meeting that involved a verbal discussion, drawings on a whiteboard, and note taking done by individual participants?

Thus, the acquisition stage covers several tasks that sometimes get complicated:

  • Unless you are generating your own data, you have to find a good source for the data you want.

  • If you don’t own the data, you have to make sure you have the right to use it.

  • You may have to go through contortions to extract the data from a web page or other source that wasn’t set up to make it easy for your application.

  • You have to download the data, which may present difficulties if the volume is large, especially if it’s fast-changing.

I’ll show some common solutions to these problems in this chapter. Even if they don’t fit your situation, they’ll still be a starting point for finding a solution.

In some cases, you may not use a Processing program to acquire and parse your initial data set. It’s not uncommon to preprocess the data in another language, such as Perl, Python, or Ruby, and later use the (cleaned) results with Processing. Simple integration can ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Visualizing Graph Data

Visualizing Graph Data

Corey Lanum
Hands-On Data Visualization

Hands-On Data Visualization

Jack Dougherty, Ilya Ilyankou
Designing Data Visualizations

Designing Data Visualizations

Noah Iliinsky, Julie Steele

Publisher Resources

ISBN: 9780596514556Errata Page