While the specifics of the different data collection techniques are outside of the scope of this chapter, we can still briefly consider some of them:
- Downloading from an external source: This is the case for our example dataset since I downloaded it from Kaggle. When using a dataset downloaded from the internet, we should always make sure to check its copyright license. Most of the time, if it is in the public domain, we can freely use and distribute it without any worry. The example dataset we are using is an instance of this. On the other hand, if the dataset is copyrighted, you might still be able to use it by asking for permission from the author/owner of the dataset. I have personally found that, after reaching out to ...