Appendix I. Reference for transformations and actions

This appendix lists the transformations and actions supported by Spark in a Java context. The online documentation covers them from a Scala viewpoint.

Lab Examples from this appendix are available in GitHub at https://github.com/jgperrin/net.jgp.books.spark.ch12 . Chapter 12 focuses on transformations (records, documents, and joins).

I.1 Transformations

Transformations modify data only after an action. Table I.1 lists the available transformations. This list is based on Spark documentation and adapted to Java.

This is not an exhaustive list of transformations, but rather the primitives being used by higher-level functions: any higher-level function using one or more transformations is a ...

Get Spark in Action, Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.