July 2018
Intermediate to advanced
334 pages
8h 20m
English
The first step to loading the iris csv file is to invoke the read method on spark. The read method returns DataFrameReader, which can be used to read our dataset:
val dfReader1 = spark.readdfReader1: org.apache.spark.sql.DataFrameReader=org.apache.spark.sql.DataFrameReader@6980d3b3
dfReader1 is of type org.apache.spark.sql.DataFrameReader. Calling the format method on dfReader1 with Spark's com.databricks.spark.csv CSV format-specifier string returns DataFrameReader again:
val dfReader2 = dfReader1.format("com.databricks.spark.csv")dfReader2: org.apache.spark.sql.DataFrameReader=org.apache.spark.sql.DataFrameReader@6980d3b3
After all, iris.csv is a CSV file.
Needless to say, dfReader1 and ...
Read now
Unlock full access