Learning Apache Apex
by Ananth Gundabattula, Thomas Weise, Munagala V. Ramanath, David Yan, Kenneth Knowles
Windowed transformations
As discussed in Chapter 1, Introduction to Apex, the streaming window in Apex more or less represents the ingression time of the data tuples within it. The accumulation of state based on the beginWindow() and the endWindow() callbacks is sufficient if the application intends to process the data based on ingression time or processing time. This is usually not the case. Applications often need to process the incoming tuples based on the timestamps of the individual events, not when the tuples enter the system.
If the incoming data tuples arrive in order, processing the data based on event time would be straightforward. In such cases, since we know the next incoming event always has the newest timestamp, we would not ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access