Skip to Content
Learn Python by Building Data Science Applications
book

Learn Python by Building Data Science Applications

by Philipp Kats, David Katz
August 2019
Beginner
482 pages
12h 56m
English
Packt Publishing
Content preview from Learn Python by Building Data Science Applications

Writing to an S3 bucket

S3 buckets and similar blob storage services have proven to be a great tool. Given the small price and ease of use, they are arguably the best solution for shared data exchange in the cloud. Here, we won't go in depth on S3 and Amazon Web Services (AWS) in general. Instead, we'll show how to modify your existing pipelines to redirect them to S3. Aside from data being in the cloud, this has the benefit of a shared state—if another computer or user tries to run the pipeline, they will find that the data is there already.

Let's assume you have registered as an AWS customer and have an S3 bucket. To work with buckets, luigi uses the boto3 package, an official library for AWS-related operations, built by Amazon. This means ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Python for Data Science

Python for Data Science

Yuli Vasiliev
Introduction to Machine Learning with Python

Introduction to Machine Learning with Python

Andreas C. Müller, Sarah Guido

Publisher Resources

ISBN: 9781789535365Supplemental Content