Book description
With the increasing use of AI in high-stakes domains such as medicine, law, and defense, organizations spend a lot of time and money to make ML models trustworthy. Many books on the subject offer deep dives into theories and concepts. This guide provides a practical starting point to help development teams produce models that are secure, more robust, less biased, and more explainable.
Authors Yada Pruksachatkun, Matthew McAteer, and Subhabrata Majumdar translate best practices in the academic literature for curating datasets and building models into a blueprint for building industry-grade trusted ML systems. With this book, engineers and data scientists will gain a much-needed foundation for releasing trustworthy ML applications into a noisy, messy, and often hostile world.
You'll learn:
- Methods to explain ML models and their outputs to stakeholders
- How to recognize and fix fairness concerns and privacy leaks in an ML pipeline
- How to develop ML systems that are robust and secure against malicious attacks
- Important systemic considerations, like how to manage trust debt and which ML obstacles require human intervention
Publisher resources
Table of contents
-
Preface
- Implementing Machine Learning in Production
- The Transformer Convergence
- An Explosion of Large and Highly Capable ML Models
- Why We Wrote This Book
- Who This Book Is For
- AI Safety and Alignment
- Use of HuggingFace PyTorch for AI Models
- Foundations
- Conventions Used in This Book
- Using Code Examples
- O’Reilly Online Learning
- How to Contact Us
- Acknowledgments
- 1. Privacy
-
2. Fairness and Bias
- Case 1: Social Media
- Case 2: Triaging Patients in Healthcare Systems
- Case 3: Legal Systems
- Key Concepts in Fairness and Fairness-Related Harms
- Scenario 1: Language Generation
- Scenario 2: Image Captioning
- Fairness Harm Mitigation
- Fairness Tool Kits
- How Can You Prioritize Fairness in Your Organization?
- Conclusion
- Further Reading
-
3. Model Explainability and Interpretability
- Explainability Versus Interpretability
- The Need for Interpretable and Explainable Models
- A Possible Trade-off Between Explainability and Privacy
- Evaluating the Usefulness of Interpretation or Explanation Methods
- Definitions and Categories
- Methods for Explaining Models and Interpreting Outputs
- Overcome the Limitations of Interpretability with a Security Mindset
- Limitations and Pitfalls of Explainable and Interpretable Methods
- Risks of Deceptive Interpretability
- Conclusion
- 4. Robustness
-
5. Secure and Trustworthy Data Generation
- Case 1: Unsecured AWS Buckets
- Case 2: Clearview AI Scraping Photos from Social Media
- Case 3: Improperly Stored Medical Data
- Issues in Procuring Real-World Data
-
Synthetically Generated Data
- DALL·E, GPT-3, and Synthetic Data
- Improving Pattern Recognition with Synthetic Data
- Deep Dive: Pre-Training a Model with a Process-Driven Synthetic Dataset
- Facial Recognition, Pose Detection, and Human-Centric Tasks
- Object Recognition and Related Tasks
- Environment Navigation
- Unity and Unreal Environments
- Limitations of Synthetic Data in Healthcare
- Limitations of Synthetic Data in NLP
- Self-Supervised Learned Models Versus Giant Natural Datasets
- Repurposing Quality Control Metrics for Security Purposes
- Conclusion
- 6. More State-of-the-Art Research Questions
- 7. From Theory to Practice
- 8. An Ecosystem of Trust
- A. Synthetic Data Generation Tools
- B. Other Interpretability and Explainability Tool Kits
- Index
- About the Authors
Product information
- Title: Practicing Trustworthy Machine Learning
- Author(s):
- Release date: January 2023
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098120276
You might also like
book
Graph-Powered Machine Learning
Upgrade your machine learning models with graph-based algorithms, the perfect structure for complex and interlinked data. …
book
Graph Machine Learning
Build machine learning algorithms using graph data and efficiently exploit topological information within your models Key …
book
Machine Learning Pocket Reference
With detailed notes, tables, and examples, this handy reference will help you navigate the basics of …
book
Training Data for Machine Learning
Your training data has as much to do with the success of your data project as …