© David Paper 2018
David PaperData Science Fundamentals for Python and MongoDBhttps://doi.org/10.1007/978-1-4842-3597-3_6

6. Exploring Data

David Paper1 
(1)
Apt 3, Logan, Utah, USA
 

Exploring probes deeper into the realm of data. An important topic in data science is dimensionality reduction. This chapter borrows munged data from Chapter 5 to demonstrate how this works. Another topic is speed simulation. When working with large datasets, speed is of great importance. Big data is explored with a popular dataset used by academics and industry. Finally, Twitter and Web scraping are two important data sources for exploration.

Heat Maps

Heat maps were introduced in Chapter 5, but one wasn’t created for the munged dataset. So, we start by creating a Heat ...

Get Data Science Fundamentals for Python and MongoDB now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.