O'Reilly logo

Practical Predictive Analytics by Ralph Winters

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Importing the CSV file to databricks

Databricks contains a simple user interface which allows you to load a file to the Databricks HDFS filesystem. Alternatively, you can load the file directly to Amazon Web Services (AWS) and read the file directly from the Databricks API.

  1. Switch to the Databricks application, select Tables, and then Data Import. Note that in some of the versions of Databricks this is embedded under the Data menu: Select "Tables", and then click the +.
  2. You may be prompted to create a new cluster. If so, first follow the instructions for creating a new cluster as we did in Chapter 9.
  3. Drag the file named 2015_sqf_csv from the PracticalPredictiveAnalytics directory to the data import box, as shown next:
  1. Click on Preview ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required