Book description
Swiftly build and deploy machine learning models without managing infrastructure and boost productivity using the latest Amazon SageMaker capabilities such as Studio, Autopilot, Data Wrangler, Pipelines, and Feature Store
Key Features
- Build, train, and deploy machine learning models quickly using Amazon SageMaker
- Optimize the accuracy, cost, and fairness of your models
- Create and automate end-to-end machine learning workflows on Amazon Web Services (AWS)
Book Description
Amazon SageMaker enables you to quickly build, train, and deploy machine learning models at scale without managing any infrastructure. It helps you focus on the machine learning problem at hand and deploy high-quality models by eliminating the heavy lifting typically involved in each step of the ML process. This second edition will help data scientists and ML developers to explore new features such as SageMaker Data Wrangler, Pipelines, Clarify, Feature Store, and much more.
You'll start by learning how to use various capabilities of SageMaker as a single toolset to solve ML challenges and progress to cover features such as AutoML, built-in algorithms and frameworks, and writing your own code and algorithms to build ML models. The book will then show you how to integrate Amazon SageMaker with popular deep learning libraries, such as TensorFlow and PyTorch, to extend the capabilities of existing models. You'll also see how automating your workflows can help you get to production faster with minimum effort and at a lower cost. Finally, you'll explore SageMaker Debugger and SageMaker Model Monitor to detect quality issues in training and production.
By the end of this Amazon book, you'll be able to use Amazon SageMaker on the full spectrum of ML workflows, from experimentation, training, and monitoring to scaling, deployment, and automation.
What you will learn
- Become well-versed with data annotation and preparation techniques
- Use AutoML features to build and train machine learning models with AutoPilot
- Create models using built-in algorithms and frameworks and your own code
- Train computer vision and natural language processing (NLP) models using real-world examples
- Cover training techniques for scaling, model optimization, model debugging, and cost optimization
- Automate deployment tasks in a variety of configurations using SDK and several automation tools
Who this book is for
This book is for software engineers, machine learning developers, data scientists, and AWS users who are new to using Amazon SageMaker and want to build high-quality machine learning models without worrying about infrastructure. Knowledge of AWS basics is required to grasp the concepts covered in this book more effectively. A solid understanding of machine learning concepts and the Python programming language will also be beneficial.
Table of contents
- Learn Amazon SageMaker Second Edition
- Contributors
- About the author
- About the reviewers
- Preface
- Section 1: Introduction to Amazon SageMaker
- Chapter 1: Introducing Amazon SageMaker
- Chapter 2: Handling Data Preparation Techniques
- Section 2: Building and Training Models
- Chapter 3: AutoML with Amazon SageMaker Autopilot
- Chapter 4: Training Machine Learning Models
- Chapter 5: Training CV Models
- Chapter 6: Training Natural Language Processing Models
- Chapter 7: Extending Machine Learning Services Using Built-In Frameworks
-
Chapter 8: Using Your Algorithms and Code
- Technical requirements
- Understanding how SageMaker invokes your code
- Customizing an existing framework container
- Using the SageMaker Training Toolkit with scikit-learn
- Building a fully custom container for scikit-learn
- Building a fully custom container for R
- Training and deploying with your own code on MLflow
- Building a fully custom container for SageMaker Processing
- Summary
- Section 3: Diving Deeper into Training
-
Chapter 9: Scaling Your Training Jobs
- Technical requirements
- Understanding when and how to scale
- Monitoring and profiling training jobs with Amazon SageMaker Debugger
- Streaming datasets with pipe mode
- Distributing training jobs
- Scaling an image classification model on ImageNet
- Training with the SageMaker data and model parallel libraries
- Using other storage services
- Summary
-
Chapter 10: Advanced Training Techniques
- Technical requirements
- Optimizing training costs with managed spot training
- Optimizing hyperparameters with automatic model tuning
- Exploring models with SageMaker Debugger
- Managing features and building datasets with SageMaker Feature Store
- Detecting bias in datasets and explaining predictions with SageMaker Clarify
- Summary
- Section 4: Managing Models in Production
-
Chapter 11: Deploying Machine Learning Models
- Technical requirements
- Examining model artifacts and exporting models
- Deploying models on real-time endpoints
- Deploying models on batch transformers
- Deploying models on inference pipelines
- Monitoring prediction quality with Amazon SageMaker Model Monitor
- Deploying models to container services
- Summary
-
Chapter 12: Automating Machine Learning Workflows
- Technical requirements
- Automating with AWS CloudFormation
- Automating with AWS CDK
- Building end-to-end workflows with AWS Step Functions
-
Building end-to-end workflows with Amazon SageMaker Pipelines
- Defining workflow parameters
- Processing the dataset with SageMaker Processing
- Ingesting the dataset in SageMaker Feature Store with SageMaker Processing
- Building a dataset with Amazon Athena and SageMaker Processing
- Training a model
- Creating and registering a model in SageMaker Pipelines
- Creating a pipeline
- Running a pipeline
- Deploying a model from the model registry
- Summary
- Chapter 13: Optimizing Prediction Cost and Performance
- Other Books You May Enjoy
Product information
- Title: Learn Amazon SageMaker - Second Edition
- Author(s):
- Release date: November 2021
- Publisher(s): Packt Publishing
- ISBN: 9781801817950
You might also like
book
Learn Amazon SageMaker
Quickly build and deploy machine learning models without managing infrastructure, and improve productivity using Amazon SageMaker’s …
book
Learning Go, 2nd Edition
Go has rapidly become the preferred language for building web services. Plenty of tutorials are available …
book
Learning Spark, 2nd Edition
Data is bigger, arrives faster, and comes in a variety of formatsâ??and it all needs to …
book
Software Engineering at Google
Today, software engineers need to know not only how to program effectively but also how to …