Table of Contents
Preface
Section 1: Overview, Architecture, Big Data Applications, and Common Use Cases of Amazon EMR
Chapter 1: An Overview of Amazon EMR
What is Amazon EMR?
What is big data?
Hadoop – processing framework to handle big data
Overview of Amazon EMR – managed and scalable Hadoop cluster in AWS
A brief history of the major big data releases
Benefits of Amazon EMR
Decoupling compute and storage
Persistent versus transient clusters
Integration with other AWS services
Amazon S3 with EMR File System (EMRFS)
Amazon Kinesis Data Streams (KDS)
Amazon Managed Streaming for Kafka (MSK)
AWS Glue Data Catalog
Amazon Relational Database Service (RDS)
Amazon DynamoDB
Amazon Redshift
AWS Lake Formation
AWS Identity and Access Management (IAM) ...
Get Simplify Big Data Analytics with Amazon EMR now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.