
Chapter 2
Introduction to Data Streams
Nowadays, we are in the presence of sources of data produced continuously
at high speed. Examples include TCP/IP traffic, GPS data, mobile calls,
emails, sensor networks, customer click streams, etc. These data sources con-
tinuously generate huge amounts of data from nonstationary distributions.
Storage, maintenance, and querying data streams brought new challenges in
the database and data mining communities. The database community has de-
veloped Data Stream Management Systems (DSMS) for continuous querying,
compact data structures (sketches and summaries), and sub-linear algorithms
for massive dataset analysis. ...