Comparison between GraphFrames and GraphX

It is important to look at a quick comparison between GraphX and GraphFrames as it gives you an idea as to where GraphFrames are going. Joseph Bradley, who is a software Engineer at Databricks, gave a brilliant talk on GraphFrames and the difference between the two APIs. The talk is available at http://bit.ly/2hBrDwH. Here is a summary of the comparison:

GraphFrames

GraphX

Core APIs

Scala, Java, Python

Scala only

Programming Abstraction

DataFrames

RDDs

Use Cases

Algorithms, Queries, Motif Finding

Algorithms

VertexIds

Any type (in Catalyst)

Long

Vertex/edge attributes

Any number of DataFrame columns

Any type (VD,ED)

Return Types

GraphFrames/DataFrames

Graph [VD,ED] or ...

Get Learning Apache Spark 2 now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.