Index
A
- access control lists (ACLs), Data Security in Elastic MapReduce
- activities, adding in data pipeline, Adding Activities
- add-instance-group option, Scheduling with the CLI
- Amazon Architecture Center, Amazon EMR Distributions
- Amazon Cloudwatch, Amazon EMR and the Hadoop Ecosystem
- Amazon Data Pipeline
- adding activities, Adding Activities
- adding data nodes, Adding Data Nodes
- basics of, Amazon Web Services Used in This Book, Data Filtering Design Patterns and Scheduling Work
- costs of, AWS Pipeline Costs
- geographic availability of, Scheduling with AWS Data Pipeline
- Job Flow scheduling with, Scheduling with AWS Data Pipeline
- online resources for, Amazon AWS Cost Estimation Tools
- pipeline creation, Creating a Pipeline
- reviewing pipeline status, Reviewing Pipeline Status
- scheduling pipelines, Scheduling Pipelines
- Amazon Elastic Compute Cloud (EC2)
- Bash script on, Simulating Syslog Data
- basics of, Amazon Web Services Used in This Book
- custom instance creation, Amazon EMR Distributions
- key pairs in, Utilizing Pig in Amazon EMR
- management console choices, Simulating Syslog Data
- online resources for, Amazon AWS Online Resources
- performance improvement with, Performance
- pre-configured instances, AWS Best Practices and Architecture
- Amazon Elastic MapReduce (EMR)
- basics of, Preface, Amazon Web Services Used in This Book, Data Collection and Data Analysis with AWS
- cluster interaction, Scheduling with the CLI
- cluster overview, Amazon Elastic MapReduce, EMR and EC2 usage billed by the hour
- cluster types, Amazon Job Flow ...
Get Programming Elastic MapReduce now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.