Skip to Content
Data Mashups in R
book

Data Mashups in R

by Xiao-Yi Li, Jeremy Leipzig
June 2009
Beginner to intermediate
29 pages
42m
English
O'Reilly Media, Inc.
Content preview from Data Mashups in R

Taking Shape

Finding a Usable Map

To display a map of Philadelphia with our foreclosures, we need to find a polygon of the county as well as a means of plotting our lat/long coordinates onto it. Both these requirements are met by the ubiquitous ESRI shapefile format. The term shapefile collectively refers to a .shp file, which contains polygons, and related files which store other features, indices, and metadata.

Googling “philadelphia shapefile” returns several promising results including this page: http://www.temple.edu/ssdl/Shape_files.htm.

“Philadelphia Tracts” seems useful because it has US Census Tract information included. We can use these tract ids to link to other census data. Tracts are standardized to contain roughly 1500-8000 people, so densely populated tracts tend to be smaller. This particular shapefile is especially appealing because the map “projection” uses the same WGS84 Lat/Long coordinate system that our address geocoding service uses, as opposed to a “state plane coordinate system” which can be difficult to transform (transformations require the rgdal package and gdal executables).

Save and unzip the following to your project directory: http://en.wikipedia.org/wiki/World_Geodetic_System.

PBSmapping

PBSmapping is a popular R package that offers several means of interacting with spatial data. It relies on some base functions from the maptools package to read ESRI shapefiles, so we need both packages.

  > install.packages(c("maptools","PBSmapping"))

As with other packages ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Data Mashups in R

Data Mashups in R

Jeremy Leipzig, Xiao-Yi Li
Graphing Data with R

Graphing Data with R

John Jay Hilfiger

Publisher Resources

ISBN: 9780596805302Errata