Resources and Tools
Though we've provided contextual links and resources throughout the book, there were a few that didn't fit properly in the chapters, but are still important go-to resources and part of our daily workflows. We've compiled them—along with a “best of the best” of links from selected chapters—into an organized and annotated list for quick reference.
- OpenRefine (http://openrefine.org/)—An open source, locally installed, cross-platform toolkit that makes it extremely easy to import, explore, clean, transform, and enrich messy data into something usable for analysis.
- WebPlotDigitizer (http://arohatgi.info/WebPlotDigitizer/app/)—This online tool makes it possible to quickly “reverse engineer” charts and graphs that have no associated open data files.
- Google CRUSH Tools (https://code.google.com/p/crush-tools/)—A command-line processing engine and data transformation tool that makes it possible to work efficiently with large data sets from a shell prompt.
- csvkit (https://github.com/onyxfish/csvkit)—A suite of open source Python utilities that are similar to the CRUSH tools, but usable from both the command line and from within ...