Learning Apache Apex
by Ananth Gundabattula, Thomas Weise, Munagala V. Ramanath, David Yan, Kenneth Knowles
Join
Join here refers to a transformation of multiple unbounded datasets as input. The enrichment operator covered earlier also performs a type of join, between a stream and a static bounded data set from a lookup source. The support for stream joins as of release 3.8 of the Apex library is still evolving.
Join of streams can be accomplished using the previously introduced WindowedOperator with the merge accumulation. Such accumulation can accept two incoming streams and is most efficient when it also performs incremental aggregation (because it reduces the state that the operator needs to hold until the window is complete). For a join without simultaneous aggregation, individual input records have to be maintained in the state, making this ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access