5.3 Reading Data from the Internet

Earlier, we used statistical functions on a week’s worth of data about earthquakes from the U.S. Geological Survey (USGS) website (usgs.gov). One advantage of using such a small set of values was that we could easily demonstrate the concepts of mean, median, mode, and standard deviation. In particular, we used summary data as our data set: the number of earthquakes per day for a seven-day period. The real power of statistics and other methods of analysis, however, can be realized on larger raw (not summarized) collections of data.

Internet sites often provide large sets of data that can be downloaded for analysis. These data sets are usually made available in several formats: text files in CSV (comma-separated ...

Get Python Programming in Context, 4th Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.