2

Organizing Data with Datasets

In his story The Adventure of the Copper Beeches, Arthur Conan Doyle has Sherlock Holmes shout “Data! Data! Data! I cannot make bricks without clay.” This mindset, which served the most famous detective in literature so well, should be adopted by every data scientist. For that reason, we begin the more technical part of this book with a chapter dedicated to data: specifically, in the Kaggle context, leveraging the power of the Kaggle Datasets functionality for our purposes.

In this chapter, we will cover the following topics:

  • Setting up a dataset
  • Gathering the data
  • Working with datasets
  • Using Kaggle Datasets in Google Colab
  • Legal caveats

Setting up a dataset

In principle, any data you can use you can upload ...

Get The Kaggle Book now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.