Chapter 3

Working with Real Data

IN THIS CHAPTER

Bullet Manipulating data streams

Bullet Working with flat and unstructured files

Bullet Interacting with relational databases

Bullet Using NoSQL as a data source

Bullet Interacting with web-based data

Data is the new oil.

— CLIVE HUMBY

Data science applications require data by definition. It would be nice if you could simply go to a data store somewhere, purchase the data you need in an easy-open package, and then write an application to access that data. However, data is messy. It appears in all sorts of places, in many different forms, and you can interpret it in many different ways. Every organization has a different method of viewing data and stores it in a different manner as well. Even when the data management system used by one company is the same as the data management system used by another company, the chances are slim that the data will appear in the same format or even use the same data types. In short, before you can do any data science work, you must discover ...

Get Coding All-in-One For Dummies, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.