Skip to Main Content
Grokking Streaming Systems
book

Grokking Streaming Systems

by Ning Wang, Josh Fischer
March 2022
Intermediate to advanced content levelIntermediate to advanced
312 pages
6h 29m
English
Manning Publications
Content preview from Grokking Streaming Systems

3 Parallelization and data grouping

In this chapter

  • parallelization
  • data parallelism and task parallelism
  • event grouping

Nine people can’t make a baby in a month.

—Frederick P. Brooks

In the previous chapter, AJ and Miranda tackled keeping a real-time count of traffic driving over the bridge using a streaming job. The system she built is fairly limited in processing heavy amounts of traffic. Can you imagine going through a bridge and tollbooth with only one lane during rush hour? Yikes! In this chapter, we are going to learn a basic technique to solve a fundamental challenge in most distributed systems. This challenge is scaling streaming systems to increase throughput of a job or, in other words, process more data.

The sensor is emitting ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Streaming Systems

Streaming Systems

Tyler Akidau, Slava Chernyak, Reuven Lax

Publisher Resources

ISBN: 9781617297304Supplemental ContentPublisher SupportOtherPublisher WebsiteSupplemental ContentPurchase Link