Chapter 5. Algorithms for data analysis

This chapter covers

  • Querying a stream
  • Thinking about time
  • Understanding four powerful summarization techniques

Chapter 4 covered how the data flows through many stream-processing frameworks, the delivery semantics, and fault tolerance. In this chapter we’re going to depart from the architectural views and discuss the algorithmic side of stream processing, often called streaming analytics or stream mining. We will focus on the what and why of streaming analysis algorithms and occasionally dip our toes into the detailed how. Don’t worry if you’re looking for the detailed math or code behind the algorithms—ample resources will be provided so that you can continue your learning.

Before we begin, I’ll ...

Get Streaming Data now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.