O'Reilly logo

Fast Data Processing with Spark 2 - Third Edition by Krishna Sankar

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

The GraphX API landscape

The next task in our to-do list is to take a quick look at the GraphX APIs. For the most part, the organization is the same, except for a couple of twists. The following figure shows the organization and categories of the APIs:

The GraphX API landscape

Objects such as Edge, EdgeRDD, and others are under org.apache.spark.graphx. The graph object has APIs such as triplets, persist, subgraph and so on. But the graph algorithms are separated under ops, which is a GraphOps object, to separate the algorithms from the graph implementation. Another quirk is lib, which has analytic functions such as SVD++, ShortestPath, and others. So navigate around the ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required