Chapter 5: Parallel Training with TPOT and Dask

In this chapter, you'll dive into a bit of a more advanced topic; that is, automated machine learning. You'll learn how to handle machine learning tasks in a parallel manner by distributing the work on a Dask cluster. This chapter will be more theoretical than the previous two, but you will still learn many useful things.

We'll cover essential topics and ideas behind parallelism in Python, and you'll learn how to achieve parallelism in a couple of different ways. Then, we'll dive deep into the Dask library, explore its basic functionality, and see how you can tie it with TPOT.

This chapter will cover the following topics:

  • Introduction to parallelism in Python
  • Introduction to the Dask library

Get Machine Learning Automation with TPOT now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.