Book description
With the increasing use of AI in high-stakes domains such as medicine, law, and defense, organizations spend a lot of time and money to make ML models trustworthy. Many books on the subject offer deep dives into theories and concepts. This guide provides a practical starting point to help development teams produce models that are secure, more robust, less biased, and more explainable.
Authors Yada Pruksachatkun, Matthew McAteer, and Subhabrata Majumdar translate best practices in the academic literature for curating datasets and building models into a blueprint for building industry-grade trusted ML systems. With this book, engineers and data scientists will gain a much-needed foundation for releasing trustworthy ML applications into a noisy, messy, and often hostile world.
You'll learn:
- Methods to explain ML models and their outputs to stakeholders
- How to recognize and fix fairness concerns and privacy leaks in an ML pipeline
- How to develop ML systems that are robust and secure against malicious attacks
- Important systemic considerations, like how to manage trust debt and which ML obstacles require human intervention
Table of contents
-
Preface
- Implementing Machine Learning in Production
- The Transformer Convergence
- An Explosion of Large and Highly Capable ML Models
- Why We Wrote This Book
- Who This Book Is For
- AI Safety and Alignment
- Use of HuggingFace PyTorch for AI Models
- Foundations
- Conventions Used in This Book
- Using Code Examples
- O’Reilly Online Learning
- How to Contact Us
- Acknowledgments
- 1. Privacy
-
2. Fairness and Bias
- Case 1: Social Media
- Case 2: Triaging Patients in Healthcare Systems
- Case 3: Legal Systems
- Key Concepts in Fairness and Fairness-Related Harms
- Scenario 1: Language Generation
- Scenario 2: Image Captioning
- Fairness Harm Mitigation
- Fairness Tool Kits
- How Can You Prioritize Fairness in Your Organization?
- Conclusion
- Further Reading
-
3. Model Explainability and Interpretability
- Explainability Versus Interpretability
- The Need for Interpretable and Explainable Models
- A Possible Trade-off Between Explainability and Privacy
- Evaluating the Usefulness of Interpretation or Explanation Methods
- Definitions and Categories
- Methods for Explaining Models and Interpreting Outputs
- Overcome the Limitations of Interpretability with a Security Mindset
- Limitations and Pitfalls of Explainable and Interpretable Methods
- Risks of Deceptive Interpretability
- Conclusion
- 4. Robustness
-
5. Secure and Trustworthy Data Generation
- Case 1: Unsecured AWS Buckets
- Case 2: Clearview AI Scraping Photos from Social Media
- Case 3: Improperly Stored Medical Data
- Issues in Procuring Real-World Data
-
Synthetically Generated Data
- DALL·E, GPT-3, and Synthetic Data
- Improving Pattern Recognition with Synthetic Data
- Deep Dive: Pre-Training a Model with a Process-Driven Synthetic Dataset
- Facial Recognition, Pose Detection, and Human-Centric Tasks
- Object Recognition and Related Tasks
- Environment Navigation
- Unity and Unreal Environments
- Limitations of Synthetic Data in Healthcare
- Limitations of Synthetic Data in NLP
- Self-Supervised Learned Models Versus Giant Natural Datasets
- Repurposing Quality Control Metrics for Security Purposes
- Conclusion
- 6. More State-of-the-Art Research Questions
- 7. From Theory to Practice
- 8. An Ecosystem of Trust
- A. Synthetic Data Generation Tools
- B. Other Interpretability and Explainability Tool Kits
- Index
- About the Authors
Product information
- Title: Practicing Trustworthy Machine Learning
- Author(s):
- Release date: January 2023
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098120276
You might also like
book
Causal Inference in Python
How many buyers will an additional dollar of online marketing bring in? Which customers will only …
book
Analytical Skills for AI and Data Science
While several market-leading companies have successfully transformed their business models by following data- and AI-driven paths, …
book
Mastering Financial Pattern Recognition
Candlesticks have become a key component of platforms and charting programs for financial trading. With these …
book
SQL for Data Analysis
With the explosion of data, computing power, and cloud data warehouses, SQL has become an even …