Chapter 10

A Survey of Stream Clustering Algorithms

Charu C. Aggarwal

IBM T. J. Watson Research CenterYorktown Heights, NYcharu@us.ibm.com

10.1 Introduction

In recent years, advances in hardware technology have allowed us to automatically record transactions and other pieces of information of everyday life at a rapid rate. Such processes generate huge amounts of online data which grow at an unlimited rate. These kinds of online data are referred to as data streams. The issues on management and analysis of data streams have been researched extensively in recent years because of their emerging, imminent, and broad applications [1].

Many important problems such as clustering and classification have been widely studied in the data mining community. ...

Get Data Clustering now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.