book

Interpretable Machine Learning with Python - Second Edition

Name: Interpretable Machine Learning with Python - Second Edition
Author: Serg Masís
ISBN: 9781803235424

by Serg Masís

October 2023

Intermediate to advanced

606 pages

16h 37m

English

Packt Publishing

Read now

Unlock full access

Preface
Who this book is forWhat this book coversTo get the most out of this bookGet in touch
Interpretation, Interpretability, and Explainability; and Why Does It All Matter?
Technical requirementsWhat is machine learning interpretation?Understanding a simple weight prediction modelUnderstanding the difference between interpretability and explainabilityWhat is interpretability?Beware of complexityWhen does interpretability matter?What are black-box models?What are white-box models?What is explainability?Why and when does explainability matter?A business case for interpretabilityBetter decisionsMore trusted brandsMore ethicalMore profitableSummaryImage sourcesDataset sourcesFurther reading
Key Concepts of Interpretability
Technical requirementsThe missionDetails about CVDThe approachPreparationsLoading the librariesUnderstanding and preparing the dataThe data dictionaryData preparationInterpretation method types and scopesModel interpretability method typesModel interpretability scopesInterpreting individual predictions with logistic regressionAppreciating what hinders machine learning interpretabilityNon-linearityInteractivityNon-monotonicityMission accomplishedSummaryFurther reading
Interpretation Challenges
Technical requirementsThe missionThe approachThe preparationsLoading the librariesUnderstanding and preparing the dataThe data dictionaryData preparationReviewing traditional model interpretation methodsPredicting minutes delayed with various regression methodsClassifying flights as delayed or not delayed with various classification methodsTraining and evaluating the classification modelsUnderstanding limitations of traditional model interpretation methodsStudying intrinsically interpretable (white-box) modelsGeneralized Linear Models (GLMs)Linear regressionRidge regressionPolynomial regressionLogistic regressionDecision treesCART decision treesRuleFitInterpretation and feature importanceNearest neighborsk-Nearest NeighborsNaïve BayesGaussian Naïve BayesRecognizing the trade-off between performance and interpretabilitySpecial model propertiesThe key property: explainabilityThe remedial property: regularizationAssessing performanceDiscovering newer interpretable (glass-box) modelsExplainable Boosting Machine (EBM)Global interpretationLocal interpretationPerformanceGAMI-NetGlobal interpretationLocal interpretationPerformanceMission accomplishedSummaryDataset sourcesFurther reading
Global Model-Agnostic Interpretation Methods
Technical requirementsThe missionThe approachThe preparationsLoading the librariesData preparationModel training and evaluationWhat is feature importance?Assessing feature importance with model-agnostic methodsPermutation feature importanceSHAP valuesComprehensive explanations with KernelExplainerFaster explanations with TreeExplainerVisualize global explanationsSHAP bar plotSHAP beeswarm plotFeature summary explanationsPartial dependence plots SHAP scatter plotALE plotsFeature interactionsSHAP bar plot with clustering2D ALE plotsPDP interactions plotsMission accomplishedSummaryFurther reading
Local Model-Agnostic Interpretation Methods
Technical requirementsThe missionThe approachThe preparationsLoading the librariesUnderstanding and preparing the dataThe data dictionaryData preparationLeveraging SHAP’s KernelExplainer for local interpretations with SHAP valuesTraining a C-SVC modelComputing SHAP values using KernelExplainerLocal interpretation for a group of predictions using decision plotsLocal interpretation for a single prediction at a time using a force plotEmploying LIMEWhat is LIME?Local interpretation for a single prediction at a time using LimeTabularExplainerUsing LIME for NLPTraining a LightGBM modelLocal interpretation for a single prediction at a time using LimeTextExplainerTrying SHAP for NLPComparing SHAP with LIMEMission accomplishedSummaryDataset sourcesFurther reading
Anchors and Counterfactual Explanations
Technical requirementsThe missionUnfair bias in recidivism risk assessmentsThe approachThe preparationsLoading the librariesUnderstanding and preparing the dataThe data dictionaryExamining predictive bias with confusion matricesData preparationModelingGetting acquainted with our “instance of interest”Understanding anchor explanationsPreparations for anchor and counterfactual explanations with alibiLocal interpretations for anchor explanationsExploring counterfactual explanationsCounterfactual explanations guided by prototypesCounterfactual instances and much more with WITConfiguring WITDatapoint editorPerformance & FairnessMission accomplishedSummaryDataset sourcesFurther reading
Visualizing Convolutional Neural Networks
Technical requirementsThe missionThe approachPreparationsLoading the librariesUnderstanding and preparing the dataData preparationInspect dataThe CNN modelsLoad the CNN modelAssessing the CNN classifier with traditional interpretation methodsDetermining what misclassifications to focus onVisualizing the learning process with activation-based methodsIntermediate activationsEvaluating misclassifications with gradient-based attribution methodsSaliency mapsGuided Grad-CAMIntegrated gradientsBonus method: DeepLIFTTying it all togetherUnderstanding classifications with perturbation-based attribution methodsFeature ablationOcclusion sensitivityShapley value samplingKernelSHAPTying it all togetherMission accomplishedSummaryFurther reading
Interpreting NLP Transformers
Technical requirementsThe missionThe approachThe preparationsLoading the librariesUnderstanding and preparing the dataThe data dictionaryLoading the modelVisualizing attention with BertVizPlotting all attention with the model viewDiving into layer attention with the head viewInterpreting token attributions with integrated gradientsLIME, counterfactuals, and other possibilities with the LITMission accomplishedSummaryFurther reading
Interpretation Methods for Multivariate Forecasting and Sensitivity Analysis
Technical requirementsThe missionThe approachThe preparationLoading the librariesUnderstanding and preparing the dataThe data dictionaryUnderstanding the dataData preparationLoading the LSTM modelAssessing time series models with traditional interpretation methodsUsing standard regression metricsPredictive error aggregationsEvaluating the model like a classification problemGenerating LSTM attributions with integrated gradientsComputing global and local attributions with SHAP’s KernelExplainerWhy use KernelExplainer?Defining a strategy to get it to work with a multivariate time series modelLaying the groundwork for the permutation approximation strategyComputing the SHAP valuesIdentifying influential features with factor prioritizationComputing Morris sensitivity indicesAnalyzing the elementary effectsQuantifying uncertainty and cost sensitivity with factor fixingGenerating and predicting on Saltelli samplesPerforming Sobol sensitivity analysisIncorporating a realistic cost functionMission accomplishedSummaryDataset and image sourcesFurther reading

Feature Selection and Engineering for Interpretability
Technical requirementsThe missionThe approachThe preparationsLoading the librariesUnderstanding and preparing the dataUnderstanding the effect of irrelevant featuresCreating a base modelEvaluating the modelTraining the base model at different max depthsReviewing filter-based feature selection methodsBasic filter-based methodsConstant features with a variance thresholdQuasi-constant features with value_countsDuplicating featuresRemoving unnecessary featuresCorrelation filter-based methodsRanking filter-based methodsComparing filter-based methodsExploring embedded feature selection methodsDiscovering wrapper, hybrid, and advanced feature selection methodsWrapper methodsSequential forward selection (SFS)Hybrid methodsRecursive Feature Elimination (RFE)Advanced methodsModel-agnostic feature importanceGenetic algorithmsEvaluating all feature-selected modelsConsidering feature engineeringMission accomplishedSummaryDataset sourcesFurther reading
Bias Mitigation and Causal Inference Methods
Technical requirementsThe missionThe approachThe preparationsLoading the librariesUnderstanding and preparing the dataThe data dictionaryData preparationDetecting biasVisualizing dataset biasQuantifying dataset biasQuantifying model biasMitigating biasPreprocessing bias mitigation methodsThe Reweighing methodThe disparate impact remover methodIn-processing bias mitigation methodsThe exponentiated gradient reduction methodThe gerry fair classifier methodPost-processing bias mitigation methodsThe equalized odds post-processing methodThe calibrated equalized odds postprocessing methodTying it all together!Creating a causal modelUnderstanding the results of the experimentUnderstanding causal modelsInitializing the linear doubly robust learnerFitting the causal modelUnderstanding heterogeneous treatment effectsChoosing policiesTesting estimate robustnessAdding a random common causeReplacing the treatment variable with a random variableMission accomplishedSummaryDataset sourcesFurther reading
Monotonic Constraints and Model Tuning for Interpretability
Technical requirementsThe missionThe approachThe preparationsLoading the librariesUnderstanding and preparing the dataVerifying the sampling balancePlacing guardrails with feature engineeringOrdinalizationDiscretizationInteraction terms and non-linear transformationsCategorical encodingOther preparationsTuning models for interpretabilityTuning a Keras neural networkDefining the model and parameters to tuneRunning the hyperparameter tuningExamining the resultsEvaluating the best modelTuning other popular model classesA quick introduction to relevant model parametersBatch hyperparameter tuning modelsEvaluating models by precisionAssessing fairness for the highest-performing modelOptimizing for fairness with Bayesian hyperparameter tuning and custom metricsDesigning a custom metricRunning Bayesian hyperparameter tuningFitting and evaluating a model with the best parametersExamining racial bias through feature importanceImplementing model constraintsConstraints for XGBoostSetting regularization and constraint parametersTraining and evaluating the constrained modelExamining constraintsConstraints for TensorFlow LatticeInitializing the model and Lattice inputsBuilding a Keras model with TensorFlow Lattice layersTraining and evaluating the modelMission accomplishedSummaryDataset sourcesFurther reading
Adversarial Robustness
Technical requirementsThe missionThe approachThe preparationsLoading the librariesUnderstanding and preparing the dataLoading the CNN base modelAssessing the CNN base classifierLearning about evasion attacksFast gradient sign method attackCarlini and Wagner infinity norm attackTargeted adversarial patch attackDefending against targeted attacks with preprocessingShielding against any evasion attack by adversarial training of a robust classifierEvaluating adversarial robustnessComparing model robustness with attack strengthMission accomplishedSummaryDataset sourcesFurther reading
What’s Next for Machine Learning Interpretability?
Understanding the current landscape of ML interpretabilityTying everything together!Current trendsSpeculating on the future of ML interpretabilityA new vision for MLA multidisciplinary approachAdequate standardizationEnforcing regulationSeamless machine learning automation with built-in interpretationTighter integration with MLOps engineersSummaryFurther reading
Other Books You May Enjoy
Index

Content preview from Interpretable Machine Learning with Python - Second Edition

13 Adversarial Robustness

Machine learning interpretation has many concerns, ranging from knowledge discovery to high-stakes ones with tangible ethical implications, like the fairness issues examined in the last two chapters. In this chapter, we will direct our attention to concerns involving reliability, safety, and security.

As we realized using the contrastive explanation method in Chapter 7, Visualizing Convolutional Neural Networks, we can easily trick an image classifier into making embarrassingly false predictions. This ability can have serious ramifications. For instance, a perpetrator can place a black sticker on a yield sign, and while most drivers would still recognize this as a yield sign, a self-driving car may no longer recognize ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Machine Learning for Time-Series with Python

Publisher Resources

ISBN: 9781803235424

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Interpretable Machine Learning with Python - Second Edition

by Serg Masís

13

Adversarial Robustness

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.