Video description
Training deep neural network models requires a highly tuned system with the right combination of software, drivers, compute, memory, network, and storage resources. Deep learning frameworks like TensorFlow, PyTorch, Caffe, Torch, Theano, and MXNet have contributed to the popularity of deep learning by reducing the effort and skill needed to design, train, and use deep learning models. Fabric for Deep Learning (FfDL, pronounced “fiddle”) provides a consistent way to run these deep learning frameworks as a service on Kubernetes. FfDL uses a microservices architecture to reduce coupling between components, keep each component simple and as stateless as possible, isolate component failures, and allow each component to be developed, tested, deployed, scaled, and upgraded independently. Animesh Singh, Atin Sood, and Tommy Li share lessons learned while building and using FfDL and demonstrate how to leverage it to execute distributed deep learning training for models written using multiple frameworks, using GPUs and object storage constructs. They then explain how to take models from IBM’s Model Asset Exchange, train them using FfDL, and deploy them on Kubernetes for serving and inferencing. This session is sponsored by IBM.
Table of contents
Product information
- Title: Fabric for deep learning at IBM
- Author(s):
- Release date: August 2019
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 0636920452133
You might also like
video
Spot and overcome machine learning bottlenecks: Lessons from Baidu
A few months ago, Baidu deployed Alluxio to accelerate its big data analytics workload. Bin Fan …
video
ODSC Europe 2018 (Open Data Science Conference)
ODSC Europe 2018 Royalties for this video set help fund the ODSC Grant Award for open …
video
How Pirelli built a data science team from scratch
Pirelli is one of the world's largest tire manufacturers and the exclusive tire supplier for F1 …
video
Business Forecasting with AI at Fluidly: Building an intelligent cashflow engine
Cashflow is responsible for 80–90% of UK SME failure. Fluidly uses the wealth of financial data …