Advanced IO

This chapter covers

  • Using Hive tables for IO
  • Accessing data through Gora

In this chapter, you continue with the more advanced features of Giraph, focusing on ways to read input data and write output data. Recall that as your input graph is usually stored in some special format on a storage system, Giraph must be instructed on how to read data from your storage system and how to convert it to its own internal representation; that is, vertices and edges using the VertexInputFormat and EdgeInputFormat implementations. For example, in Chapter 7, you described the basics of reading and writing data from the Hadoop Distributed ...

Get Practical Graph Analytics with Apache Giraph now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.