Proposition 9.3.5.7. The convergence time to be close to the risk-sensitive payoff with error tolerance η is at most
where (Gj)-1 is the inverse of the mapping and , error = maxj errorj. In particular for gj = 1 (almost active case), the convergence time is of order of .
Proof. We verify that the solution of the ODE is . From the assumptions, the primitive function Gj is a bijection and . The last assertion is obtained for gj = 1. This completes the proof.
As promised in our introduction of this Chapter, ...
Get Distributed Strategic Learning for Wireless Engineers now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.