Spark Streaming allows for windowed processing, which enables you to apply transformations over a sliding window of events. This sliding window is created over a specified interval.
Every time a window slides over a DStream, the source RDDs that fall into the window specification are combined to create a windowed DStream as shown in the following diagram. The window must have two specified parameters:
- Window length – specifies the interval length considered
- Sliding interval – the interval at which a window is created
The following ...