Skip to Content
Visualizing Streaming Data
book

Visualizing Streaming Data

by Anthony Aragues
June 2018
Beginner to intermediate
200 pages
4h 56m
English
O'Reilly Media, Inc.
Content preview from Visualizing Streaming Data

Chapter 5. Processing Streaming Data for Visualization

Processing data is the most common operation mentioned in this book. There are specific considerations to bear in mind when processing streaming data to be visualized.

Batch Processing

Batch processing is the most common approach for handling high volumes of data. The process of batching means that data will be cached somewhere to be processed at intervals. The processing interval is chosen according to the data’s significance and the ability to take actions on it. Processing daily batches overnight is by far the most common approach, but daily batch processing falls short when there are significant events that may have occurred almost 24 hours earlier by the time the report is reviewed by a person. An indicator that your brand has been mimicked publicly for malicious purposes would be an instance where every minute counts. In order to deal with this, hourly batch processing is often used. Most applications will not process batches more often than hourly because of perceived limitations in being able to act on the data any faster. Another reason for not processing batches too often is that it’s a complex process and has the potential to not finish before processing of the next batch begins, causing a backlog.

The process that runs at the chosen interval will query the data from where it’s stored in order to create the aggregate ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Fast Data Architectures for Streaming Applications, 2nd Edition

Fast Data Architectures for Streaming Applications, 2nd Edition

Dean Wampler
Streaming Data

Streaming Data

Andrew Psaltis
Event Streams in Action

Event Streams in Action

Valentin Crettaz, Alexander Dean

Publisher Resources

ISBN: 9781492031840Errata Page