One of the most widely used use cases for AWS Data Pipeline is its ability to synchronize and schedule backup jobs. You can use Data Pipeline to take backups of data stored within EC2 instances, EBS volumes, databases and even S3 buckets. In this section, we will walk through a simple, parameterized pipeline definition using which you can effectively schedule and perform backups of files stored within an Amazon S3 bucket.
First up, let's have a look at the pipeline definition file itself:
You can find the complete copy of code at https://github.com/yoyoclouds/Administering-AWS-Volume2.
To start with, we once again provide a list of objects that describe the pipeline components starting with a pipeline ...