book

AI Fairness

by Trisha Mahoney, Kush R. Varshney, Michael Hind

April 2020

Intermediate to advanced

34 pages

43m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Are Human Decisions Less Biased Than Automated Ones?AI Fairness Is Becoming Increasingly CriticalDefining FairnessWhere Does Bias Come From?Bias and Machine LearningCan’t I Just Remove Protected Attributes?Conclusion
Tools and TerminologyTerminologyWhich Metrics Should You Use?Individual Versus Group Fairness MetricsWorldviews and MetricsDataset ClassTransparency in Bias MetricsExplainer ClassAI FactSheets
Most Bias Starts with Your DataPre-Processing AlgorithmsIn-Processing AlgorithmsPost-Processing AlgorithmsContinuous Pipeline Measurement
Step 1: Import StatementsStep 2: Load Dataset, Specify Protected Attribute, and Split Dataset into Train and TestStep 3: Compute Fairness Metric on Original Training DatasetStep 4: Mitigate Bias by Transforming the Original Dataset Step 5: Compute Fairness Metric on Transformed Dataset
The Future of Fairness in AI

Content preview from AI Fairness

Chapter 2. Algorithms for Bias Mitigation

We can measure data and model fairness at different points in the machine learning pipeline. In this chapter, we look at the pre-processing, in-processing, and post-processing categories of bias mitigation algorithms.

Most Bias Starts with Your Data

AIF360’s bias mitigation algorithms are categorized based on where in the machine learning pipeline they are deployed, as illustrated in Figure 2-1. As a general guideline, you can use its pre-processing algorithms if you can modify the training data. You can use in-processing algorithms if you can change the learning procedure for a machine learning model. If you need to treat the learned model as a black box and cannot modify the training data or learning algorithm, you will need to use the post-processing algorithms.

Pre-Processing Algorithms

Pre-processing is the optimal time to mitigate bias given that most bias is intrinsic to the data. With pre-processing algorithms, you attempt to reduce bias by manipulating the training data before training the algorithm. Although this is conceptually simple, there are two key issues to consider. First, data can be biased in complex ways, so it is difficult for an algorithm to translate one dataset to a new dataset which is both accurate and unbiased. Second, there can be legal issues involved: ...