Proposition 9.3.5.7. The convergence time to be close to the risk-sensitive payoff with error tolerance η is at most
where (Gj)-1 is the inverse of the mapping and , error = maxj errorj. In particular for gj = 1 (almost active case), the convergence time is of order of .
Proof. We verify that the solution of the ODE is . From the assumptions, the primitive function Gj is a bijection and . The last assertion is obtained for gj = 1. This completes the proof.
As promised in our introduction of this Chapter, ...
Get Distributed Strategic Learning for Wireless Engineers now with the O’Reilly learning platform.
O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.