Balancing in Apache Beam

Apache Beam provides a way to keep balance between completeness, latency, and cost. Completeness refers here to how all events should process, latency is the time taken to execute an event and cost is the computing power required to finish the job. The following are the right questions that should be asked to build a Pipeline in Apache Beam which maintains balance between the above three parameters:

  • What results are calculated? By using the transformations available in Pipeline, the system is calculating results.
  • Where in the event time results are calculated? This is achieved by using event-time windowing. Event time windowing is further categorized into fixed, sliding, and session window.
  • When in processing time ...

Get Practical Real-time Data Processing and Analytics now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.