Spark Streaming provides windowed processing, which allows you to apply transformations over a sliding window of events. The sliding window is created over an interval specified. Every time the window slides over a source DStream, the source RDDs, which fall within the window specification, are combined and operated upon to generate the windowed DStream. There are two parameters that need to be specified for the window:
- Window length: This specifies the length in interval considered as the window
- Sliding interval: This is the interval at which the window is created
Shown in the following is an illustration shows a DStream with a sliding ...