Skip to Content
Reproducible Data Science with Pachyderm
book

Reproducible Data Science with Pachyderm

by Svetlana Karslioglu
March 2022
Intermediate to advanced content levelIntermediate to advanced
364 pages
6h 44m
English
Packt Publishing
Content preview from Reproducible Data Science with Pachyderm

Preface

Pachyderm is a distributed version control platform for building end-to-end data science workflows. Since its creation in 2016, Pachyderm has become a go-to solution for large and small organizations. The core functionality of Pachyderm is open source and has a vivid community of engineers around it. This book walks you through basic and advanced examples of Pachyderm usage. This book will help you get started quickly and integrate a reliable data science solution into your infrastructure.

Reproducible Data Science with Pachyderm provides a clear overview of Pachyderm, as well as instructions on how to install and run Pachyderm in the cloud, and how to use the Pachyderm Software-as-a-Service (SaaS) version – Pachyderm Hub. This book ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

TensorFlow 2 Pocket Reference

TensorFlow 2 Pocket Reference

KC Tung
Understanding Economic Equilibrium

Understanding Economic Equilibrium

Thomas J. Cunningham, Mike Shaw

Publisher Resources

ISBN: 9781801074483Supplemental Content