April 2018
Intermediate to advanced
334 pages
10h 18m
English
The sequence of rewards play an important role in finding the optimal policy for an MDP problem, but there are certain assumptions that unveil how a sequence of rewards implements the concept of delayed rewards.