How do you use R to import, manage, visualize, and analyze real-world data? With this short, hands-on tutorial, you learn how to collect online data, massage it into a reasonable form, and work with it using R facilities to interact with web servers, parse HTML and XML, and more. Rather than use canned sample data, you'll plot and analyze current home foreclosure auctions in Philadelphia.
This practical mashup exercise shows you how to access spatial data in several formats locally and over the Web to produce a map of home foreclosures. It's an excellent way to explore how the R environment works with R packages and performs statistical analysis.
- Parse messy data from public foreclosure auction postings
- Plot the data using R's PBSmapping package
- Import US Census data to add context to foreclosure data
- Use R's lattice and latticeExtra packages for data visualization
- Create multidimensional correlation graphs with the pairs() scatterplot matrix package
Table of Contents
1. Mapping Foreclosures
- Messy Address Parsing
- Shaking the XML Tree
- The Many Ways to Philly (Latitude)
- Exceptional Circumstances
- Taking Shape
- Developing the Plot
- Turning Up the Heat
- 2. Statistics of Foreclosure
- A. Getting Started
- About the Authors
- Title: Data Mashups in R
- Release date: March 2011
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781449303532