February 2017
Intermediate to advanced
274 pages
5h 58m
English
Now that you have created the swimmersJSON DataFrame, we will be able to run the DataFrame API, as well as SQL queries against it. Let's start with a simple query showing all the rows within the DataFrame.
To do this using the DataFrame API, you can use the show(<n>) method, which prints the first n rows to the console:
Running the.show() method will default to present the first 10 rows.
# DataFrame API swimmersJSON.show()
This gives the following output:

If you prefer writing SQL statements, you can write the following query:
spark.sql("select * from swimmersJSON").collect()This will give the following ...
Read now
Unlock full access