Exploring Pachyderm
Our focus for this book is on developing deep learning systems in Go. So, naturally, now that we are talking about how to manage the data that we feed to our networks, let's take a look at a tool to do so that is also written in Go.
Pachyderm is a mature and scalable tool that offers containerized data pipelines. In these, everything you could possibly need, from data to tools, is held together in a single place where deployments can be maintained and managed and versioning for the data itself. The Pachyderm team sell their tool as Git for data, which is a useful analogy. Ideally, we want to version the entire pipeline so that we know which data was used to train, and which, in turn, gave us the specific prediction of ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access