Acquiring data from the Web – web scraping tasks
Given the advances in the Internet of Things (IoT) and the progress of cloud computing, we can quietly affirm that in future, a huge part of our data will be available through the Internet, which on the other hand doesn't mean it will be public.
It is, therefore, crucial to know how to take that data from the Web and load it into your analytical environment.
You can find data on the Web either in the form of data statically stored on websites (that is, tables on Wikipedia or similar websites) or in the form of data stored on the cloud, which is accessible via APIs.
For API recipes, we will go through all the steps you need to get data statically exposed on websites in the form of tabular and nontabular ...