Databricks contains a simple user interface which allows you to load a file to the Databricks HDFS filesystem. Alternatively, you can load the file directly to Amazon Web Services (AWS) and read the file directly from the Databricks API.
- Switch to the Databricks application, select Tables, and then Data Import. Note that in some of the versions of Databricks this is embedded under the Data menu: Select "Tables", and then click the +.
- You may be prompted to create a new cluster. If so, first follow the instructions for creating a new cluster as we did in Chapter 9.
- Drag the file named 2015_sqf_csv from the PracticalPredictiveAnalytics directory to the data import box, as shown next:
- Click on Preview ...