3 Multi-armed bandits: Maximizing business metrics while experimenting

This chapter covers

  • Defining the multi-armed bandit (MAB) problem
  • Modifying A/B testing’s randomization procedure
  • Extending epsilon-greedy to simultaneously evaluate multiple system changes
  • Evaluating system changes even more quickly with Thompson sampling

In the previous chapter, we learned how to use A/B testing to evaluate changes to the system your engineering team is building. Once the tooling is in place to run A/B tests, the team should see a steady increase in the quality of the system as new changes follow the engineering workflow: implement a change candidate, evaluate it offline, and evaluate it online with an A/B test.

As the use of A/B testing increases, you’ll ...

Get Experimentation for Engineers now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.