November 2017
Beginner to intermediate
204 pages
5h 23m
English
To demonstrate working with large datasets, I've created an artificial dataset, fake_weather_data.csv, containing fake weather data since 1980. The dataset is indeed quite large (4 GB), and importing the data into a database will take up even more memory. If you are limited by disk space or do not want to wait for the data to be processed, you may alternatively use a smaller version of the dataset called fake_weather_data_small.csv along with the demonstrations. If you use the smaller dataset, be sure to change the filename in the demonstrations accordingly.
The data for this chapter is available in the data folder of the external resources.
Read now
Unlock full access