In this section, we will use Hive to analyze the import prices of oil in countries across the world from 1980-2016. The data is available from the site of the OECD (Organisation for Economic Co-operation and Development) at the URL shown in the following screenshot:
The actual CSV file is available at https://stats.oecd.org/sdmx-json/data/DP_LIVE/.OILIMPPRICE.../OECD?contentType=csv&detail=code&separator=comma&csv-lang=en.
Since we'll be loading the data in Hive, it makes sense to download the file into our home directory via the terminal in our Cloudera Quickstart CDH environment. The steps ...