July 2018
Intermediate to advanced
334 pages
8h 20m
English
Run the invoke on the dataframe describe method and pass to it column names:
val dFrame2 = dFrame1.describe("Id","SepalLengthCm","SepalWidthCm","PetalLengthCm","PetalWidthCm","Species")dFrame2: org.apache.spark.sql.DataFrame = [summary: string, Id: string ... 5 more fields]
The invoke on the describe method of DataFrame dfReader results in a transformed DataFrame that we call dFrame2. On dFrame2, we invoke the show method to return a table of statistical results. This completes the first phase of a basic yet important EDA:
val dFrame2Display= = dfReader2.show
The results of the statistical analysis are shown in the following screenshot:
We did all that extra work simply to demonstrate ...
Read now
Unlock full access