© David Paper 2018
David PaperData Science Fundamentals for Python and MongoDBhttps://doi.org/10.1007/978-1-4842-3597-3_5

5. Working with Data

David Paper1 
(1)
Apt 3, Logan, Utah, USA
 

Working with data details the earliest processes of data science problem solving. The 1st step is to identify the problem, which determines all else that needs to be done. The 2nd step is to gather data. The 3rd step is to wrangle (munge) data, which is critical. Wrangling is getting data into a form that is useful for machine learning and other data science problems. Of course, wrangled data will probably have to be cleaned. The 4th step is to visualize the data. Visualization helps you get to know the data and, hopefully, identify patterns.

One-Dimensional Data ...

Get Data Science Fundamentals for Python and MongoDB now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.