November 2017
Beginner to intermediate
290 pages
7h 34m
English
Both ParDo and per key aggregation are standard patterns for parallelism that go back decades. When implementing these in a massive-scale distributed data processing engine, we can highlight a few characteristics that are particularly important.
Characteristics of ParDo:
Characteristics for per key aggregation:
Stateful ParDo is a computational pattern that combines aspects of each of these:
Read now
Unlock full access