Chapter 14Reliability Pillar

Although a lot of our discussion of the ML lifecycle thus far has focused on the iterative and experimental nature, once an ML model is deployed into production, or once your ML environments are set up, you will want to ensure that they are resilient and fault tolerant against failures, that they can scale up and down elastically to meet your demands dynamically, and that you can quickly mitigate any ...

Get AWS Certified Machine Learning Study Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.