February 2023
Intermediate to advanced
248 pages
7h 23m
English

Feedback loop caused by refitting on logged samples. If the logged samples are biased, the fit will produce a biased model. If the model is biased, the policy will make biased decisions. The logs record samples of those biased decisions, and the bias is perpetuated.

A two-dimensional surrogate function, an interpolation of markout_profit_2D over settings of threshold and order_size. The filled dots mark the parameter settings where markout profit was measured in the experiment. In this figure, there are many more estimates ...