Introduction to streaming execution model

Flink is an open source framework for distributed stream processing that:

  • Provides results that are accurate, even in the case of out-of-order or late-arriving data
  • Is stateful and fault tolerant, and can seamlessly recover from failures while maintaining an exactly-once application state
  • Performs on a large scale, running on thousands of nodes with very good throughput and latency characteristics

The following diagram is a generalized view of stream processing:

Many of Flink's features - state management, handling out-of-order data, flexible windowing – are essential for computing accurate results ...

Get Big Data Analytics with Hadoop 3 now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.