January 2019
Beginner to intermediate
154 pages
4h 31m
English
Spark provides the low-level API, GraphX, to work with graphs and it makes use of RDDs underneath. Although you may wish to write your own package for graph processing, GraphX comes with some of the optimizations handy. One of the key challenges in graph analysis is data movement. GraphX tackles these challenges by providing features such as the following:
In GraphX, you store vertex and edge information in RDDs. It is very easy to define a graph in GraphX. All you need is two RDDs representing both vertices and edges. The following example shows ...
Read now
Unlock full access