Skip to Content
Learn AWS Serverless Computing
book

Learn AWS Serverless Computing

by Scott Patterson
December 2019
Intermediate to advanced
382 pages
9h 43m
English
Packt Publishing
Content preview from Learn AWS Serverless Computing

Micro-batch

The use cases for micro-batch data processing can be similar to traditional batch processing. The main differences are that the data will be processed more often and in smaller volumes. When designing a micro-batch process, there are two ways to approach the scheduling model. It can be based on the following:

  • Time interval-based: This is when you process data on a predefined schedule, for example, every 5 minutes.
  • Event-based: This is when you choose to process data when the amount of data that's available or queued for processing reaches a defined threshold, for example, every 2 MB or every 1,000 records.

Apache Spark is one of the most popular open source tools for micro-batch workloads—AWS Glue is also based on Apache Spark. ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Practical Amazon EC2, SQS, Kinesis, and S3: A Hands-On Approach to AWS

Practical Amazon EC2, SQS, Kinesis, and S3: A Hands-On Approach to AWS

Sunil Gulabani

Publisher Resources

ISBN: 9781789958355