Skip to Content
Data Science with Python and Dask
book

Data Science with Python and Dask

by Jesse Daniel
July 2019
Intermediate to advanced content levelIntermediate to advanced
296 pages
9h 1m
English
Manning Publications
Content preview from Data Science with Python and Dask

Part 3 Extending and deploying Dask

In part 3, we round out our exploration of Dask by covering some advanced topics: unstructured data, machine learning, and deploying Dask to the cloud. These are good topics to end on, because you should be fairly comfortable with the Dask paradigm by now. Once again, all the chapters are anchored on real-world datasets and common tasks you may encounter in any data science project.

Chapter 9 discusses how to use Dask Bags—a parallelized implementation of standard Python Lists—and Dask Arrays—a parallelized implementation of NumPy Arrays—to work with more complicated, unstructured datasets. We’ll cover some advanced collections topics such as mapping, folding, and reducing by parsing text data stored in ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Practical Data Science with Python

Practical Data Science with Python

Nathan George
Python: End-to-end Data Analysis

Python: End-to-end Data Analysis

Phuong Vothihong, Martin Czygan, Ivan Idris, Magnus Vilhelm Persson, Luiz Felipe Martins

Publisher Resources

ISBN: 9781617295607OtherSupplemental ContentPublisher SupportPublisher WebsiteSupplemental ContentPurchase Link