Skip to Content
Data Science with Python and Dask
book

Data Science with Python and Dask

by Jesse Daniel
July 2019
Intermediate to advanced content levelIntermediate to advanced
296 pages
9h 1m
English
Manning Publications
Content preview from Data Science with Python and Dask

11 Scaling and deploying Dask

This chapter covers

  • Creating a Dask Distributed cluster on Amazon AWS using Docker and Elastic Container Service
  • Using a Jupyter Notebook server and Elastic File System to store and access data science notebooks and shared datasets in Amazon AWS
  • Using the Distributed client object to submit jobs to a Dask cluster
  • Monitoring execution of jobs on the cluster using the Distributed monitoring dashboard

Up to this point, we’ve been working with Dask in local mode. This means that everything we’ve asked Dask to do has all been executed on a single computer. Running Dask in local mode is very useful for prototyping, development, and ad-hoc exploration, but we can still quickly reach the performance limits of a single ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Practical Data Science with Python

Practical Data Science with Python

Nathan George
Python: End-to-end Data Analysis

Python: End-to-end Data Analysis

Phuong Vothihong, Martin Czygan, Ivan Idris, Magnus Vilhelm Persson, Luiz Felipe Martins

Publisher Resources

ISBN: 9781617295607OtherSupplemental ContentPublisher SupportPublisher WebsiteSupplemental ContentPurchase Link