3 Multi-armed bandits: Maximizing business metrics while experimenting

This chapter covers

Defining the multi-armed bandit (MAB) problem
Modifying A/B testing’s randomization procedure
Extending epsilon-greedy to simultaneously evaluate multiple system changes
Evaluating system changes even more quickly with Thompson sampling

In the previous chapter, we learned how to use A/B testing to evaluate changes to the system your engineering team is building. Once the tooling is in place to run A/B tests, the team should see a steady increase in the quality of the system as new changes follow the engineering workflow: implement a change candidate, evaluate it offline, and evaluate it online with an A/B test.

As the use of A/B testing increases, you’ll ...

Get Experimentation for Engineers now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Experimentation for Engineers by David Sweet

3 Multi-armed bandits: Maximizing business metrics while experimenting

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly