book

Learn AWS Serverless Computing

by Scott Patterson

December 2019

Intermediate to advanced

382 pages

9h 43m

English

Packt Publishing

Read now

Unlock full access

Content preview from Learn AWS Serverless Computing

Micro-batch

The use cases for micro-batch data processing can be similar to traditional batch processing. The main differences are that the data will be processed more often and in smaller volumes. When designing a micro-batch process, there are two ways to approach the scheduling model. It can be based on the following:

Time interval-based: This is when you process data on a predefined schedule, for example, every 5 minutes.
Event-based: This is when you choose to process data when the amount of data that's available or queued for processing reaches a defined threshold, for example, every 2 MB or every 1,000 records.

Apache Spark is one of the most popular open source tools for micro-batch workloads—AWS Glue is also based on Apache Spark. ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Practical Amazon EC2, SQS, Kinesis, and S3: A Hands-On Approach to AWS

Sunil Gulabani

Hands-On Serverless Computing

Kuldeep Chowhan

Mastering AWS CloudFormation

Karen Tovmasyan

Getting Started with Cloud Pipelines with AWS Lambda and AWS Step Functions

Alfredo Deza, Noah Gift

Publisher Resources

ISBN: 9781789958355