October 2019
Intermediate to advanced
366 pages
12h 4m
English
An MDP expresses the problem of sequential decision-making, where actions influence the next states and the results. MDPs are general and flexible enough to provide a formalization of the problem of learning a goal through interactions, the same problem that is addressed with RL. Thus we can express and reason with RL problems in terms of MDPs.
An MDP is four-tuple (S,A,P,R):
Read now
Unlock full access