Video description
Training deep neural network models requires a highly tuned system with the right combination of software, drivers, compute, memory, network, and storage resources. Deep learning frameworks like TensorFlow, PyTorch, Caffe, Torch, Theano, and MXNet have contributed to the popularity of deep learning by reducing the effort and skill needed to design, train, and use deep learning models. Fabric for Deep Learning (FfDL, pronounced “fiddle”) provides a consistent way to run these deep learning frameworks as a service on Kubernetes. FfDL uses a microservices architecture to reduce coupling between components, keep each component simple and as stateless as possible, isolate component failures, and allow each component to be developed, tested, deployed, scaled, and upgraded independently. Animesh Singh, Atin Sood, and Tommy Li share lessons learned while building and using FfDL and demonstrate how to leverage it to execute distributed deep learning training for models written using multiple frameworks, using GPUs and object storage constructs. They then explain how to take models from IBM’s Model Asset Exchange, train them using FfDL, and deploy them on Kubernetes for serving and inferencing. This session is sponsored by IBM.
Table of contents
Product information
- Title: Fabric for deep learning at IBM
- Author(s):
- Release date: August 2019
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 0636920452133
You might also like
video
Spot and overcome machine learning bottlenecks: Lessons from Baidu
A few months ago, Baidu deployed Alluxio to accelerate its big data analytics workload. Bin Fan …
video
ODSC Europe 2018 (Open Data Science Conference)
ODSC Europe 2018 Royalties for this video set help fund the ODSC Grant Award for open …
video
ODSC East 2018 (Open Data Science Conference)
ODSC The Open Data Science Conference has established itself as the leading conference in the field …
video
Machine Learning for analyzing major datasets to create automated educational tools for young poets by Power Poetry
Power Poetry is the largest online platform for young poets, with over 350K users. Ann Nguyen …