July 2018
Intermediate to advanced
334 pages
8h 20m
English
Invoking the describe method on this DataFrame should cause Spark to perform a basic statistical analysis on each column of DataFrame:
dFrame1.describe("Id","SepalLengthCm","SepalWidthCm","PetalLengthCm","PetalWidthCm","Species")WARN Utils: Truncated the string representation of a plan since it was too large. This behavior can be adjusted by setting 'spark.debug.maxToStringFields' in SparkEnv.conf.res16: org.apache.spark.sql.DataFrame = [summary: string, Id: string ... 5 more fields]
Lets fix the WARN.Utils issue described in the preceding code block. The fix is to locate the file spark-defaults-template.sh under SPARK_HOME/conf and save it as spark-defaults.sh.
At the bottom of this file, add an entry for spark.debug.maxToStringFields ...
Read now
Unlock full access