Book description
In this insightful book, you'll learn from the best data practitioners in the field just how wide-ranging -- and beautiful -- working with data can be. Join 39 contributors as they explain how they developed simple and elegant solutions on projects ranging from the Mars lander to a Radiohead video. With Beautiful Data, you will:
Explore the opportunities and challenges involved in working with the vast number of datasets made available by the Web
Learn how to visualize trends in urban crime, using maps and data mashups
Discover the challenges of designing a data processing system that works within the constraints of space travel
Learn how crowdsourcing and transparency have combined to advance the state of drug research
Understand how new data can automatically trigger alerts when it matches or overlaps pre-existing data
Learn about the massive infrastructure required to create, capture, and process DNA data
That's only small sample of what you'll find in Beautiful Data. For anyone who handles data, this is a truly fascinating book. Contributors include:
Nathan Yau
Jonathan Follett and Matt Holm
J.M. Hughes
Raghu Ramakrishnan, Brian Cooper, and Utkarsh Srivastava
Jeff Hammerbacher
Jason Dykes and Jo Wood
Jeff Jonas and Lisa Sokol
Jud Valeski
Alon Halevy and Jayant Madhavan
Aaron Koblin with Valdean Klump
Michal Migurski
Jeff Heer
Coco Krumme
Peter Norvig
Matt Wood and Ben Blackburne
Jean-Claude Bradley, Rajarshi Guha, Andrew Lang, Pierre Lindenbaum, Cameron Neylon, Antony Williams, and Egon Willighagen
Lukas Biewald and Brendan O'Connor
Hadley Wickham, Deborah Swayne, and David Poole
Andrew Gelman, Jonathan P. Kastellec, and Yair Ghitza
Toby Segaran
Table of contents
-
Beautiful Data
- SPECIAL OFFER: Upgrade this ebook with O’Reilly
- A Note Regarding Supplemental Files
- Preface
- 1. Seeing Your Life in Data
- 2. The Beautiful People: Keeping Users in Mind When Designing Data Collection Methods
- 3. Embedded Image Data Processing on Mars
- 4. Cloud Storage Design in a PNUTShell
-
5. Information Platforms and the Rise of the Data Scientist
- Libraries and Brains
- Facebook Becomes Self-Aware
- A Business Intelligence System
- The Death and Rebirth of a Data Warehouse
- Beyond the Data Warehouse
- The Cheetah and the Elephant
- The Unreasonable Effectiveness of Data
- New Tools and Applied Research
- MAD Skills and Cosmos
- Information Platforms As Dataspaces
- The Data Scientist
- Conclusion
- 6. The Geographic Beauty of a Photographic Archive
-
7. Data Finds Data
- Introduction
- The Benefits of Just-in-Time Discovery
- Corruption at the Roulette Wheel
- Enterprise Discoverability
- Federated Search Ain't All That
- Directories: Priceless
- Relevance: What Matters and to Whom?
-
Components and Special Considerations
- The Existence of, and Availability of, Observations
- The Ability to Extract and Classify Features from the Observations
- The Ability to Efficiently Discover Related Historical Context
- The Ability to Make Assertions (Same or Related) About New Observations
- The Ability to Recognize When New Observations Reverse Earlier Assertions
- The Ability to Accumulate and Persist This Asserted Context
- The Ability to Recognize the Formation of Relevance/Insight
- The Ability to Notify the Appropriate Entity of Such Insight
- Privacy Considerations
- Conclusion
- 8. Portable Data in Real Time
- 9. Surfacing the Deep Web
- 10. Building Radiohead's House of Cards
- 11. Visualizing Urban Data
- 12. The Design of Sense.us
-
13. What Data Doesn't Do
-
When Doesn't Data Drive?
- 1. More Data Isn't Always Better
- 2. More Data Isn't Always Easy
- 3. Data Alone Doesn't Explain
- 4. Data Isn't Good for a Single Answer
- 5. Data Doesn't Predict
- 6. Probability Isn't Intuitive
- 7. Probabilities Aren't Intuitive
- 8. The Real World Doesn't Create Random Variables
- 9. Data Doesn't Stand Alone
- 10. Data Isn't Free from the Eye of the Beholder
- Conclusion
- References
-
When Doesn't Data Drive?
- 14. Natural Language Corpus Data
- 15. Life in Data: The Story of DNA
- 16. Beautifying Data in the Real World
- 17. Superficial Data Analysis: Exploring Millions of Social Stereotypes
- 18. Bay Area Blues: The Effect of the Housing Crisis
- 19. Beautiful Political Data
- 20. Connecting Data
- A. Contributors
- Index
- About the Authors
- COLOPHON
- SPECIAL OFFER: Upgrade this ebook with O’Reilly
Product information
- Title: Beautiful Data
- Author(s):
- Release date: July 2009
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9780596157111
You might also like
book
Beautiful Visualization
Visualization is the graphic presentation of data -- portrayals meant to reveal complex information at a …
book
Hands-On Data Visualization
Tell your story and show it with data, using free and easy-to-learn tools on the web. …
book
Semantic Modeling for Data
What value does semantic data modeling offer? As an information architect or data science professional, let’s …
book
Natural Language Processing in Action
Natural Language Processing in Action is your guide to creating machines that understand human language using …