January 2018
Intermediate to advanced
470 pages
11h 9m
English
Once we have rowRDD and the header, the next task is to construct the rows of our Schema DataFrame from the variants using the header and rowRDD:
val sqlContext = spark.sqlContextval schemaDF = sqlContext.createDataFrame(rowRDD, header)schemaDF.printSchema()schemaDF.show(10)>>>

In the preceding DataFrame, only a few columns, including the label, are shown so that it fits on the page.
Read now
Unlock full access