January 2019
Beginner to intermediate
154 pages
4h 31m
English
Another alternative to GraphX is GraphFrames. GraphFrames enjoys all the benefits of structured APIs, as it uses DataFrames underneath. GraphFrames is an external package that can be used with both Scala and Python. You will need to load this package when you start your Spark application. You can download this package from https://spark-packages.org/package/graphframes/graphframes. The following example shows how to include this package while starting a pyspark shell:
> $SPARK_HOME/bin/pyspark --packages graphframes:graphframes:0.6.0-spark2.3-s_2.11
We shall use the same data that we used in the GraphX section; the only difference will be that we'll use Python API this time. The procedure to create a graph in GraphFrames is similar ...
Read now
Unlock full access