Chapter 2. GraphX quick start

This chapter covers

  • Finding graph data to play with
  • First steps with GraphX using the Spark Shell
  • Invoking the PageRank algorithm

The Spark Shell is the easiest way to quickly start using Spark and is a great way to explore graph datasets. No compilation is necessary, which means you can focus on running commands and seeing their output. Even though Spark Shell uses Scala as its programming language, there’s no need to worry if you haven’t used Scala before. This chapter will guide you every step of the way.

The chapter is intended to walk you through the steps of working with GraphX without delving into the details. You’ll download some sample graph data consisting of bibliographic citations. Using the Spark ...

Get Spark GraphX in Action now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.