October 2019
Intermediate to advanced
366 pages
12h 4m
English
UCB1 belongs to the UCB family, and its contribution is in the selection of
.
In UCB1, the
UCB is computed by keeping track of the number of times an action, (
), has been selected, along with
, and the total number of actions that are selected with , as represented in the following formula:
(12.2)
The uncertainty of an action, is thus related ...
Read now
Unlock full access