Chapter 8: Creating an End-to-End Machine Learning Workflow

In previous chapters, we learned about Pachyderm basics and how to install Pachyderm locally and on a cloud platform. We've deployed our first pipeline, learned how to update a pipeline, and performed some fundamental Pachyderm operations, such as splitting. I hope by now you are convinced that Pachyderm is an extremely versatile tool that gives you a lot of flexibility and power in managing your machine learning pipelines. To make it even more obvious, we will deploy a much more complex example than the ones that we have deployed so far. We hope this chapter will be especially fun for you to work on and will expand your understanding of data infrastructure quirks even more.

In this ...

Get Reproducible Data Science with Pachyderm now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.