January 2019
Intermediate to advanced
386 pages
11h 13m
English
We'll start with policy evaluation, or how to compute the state-value function,
, given a specific policy, π. This task is also known as prediction. As a reminder, we'll assume that the state-value function is a table. We'll implement policy evaluation using the state-value Bellman equation we defined in the Bellman equations section. Let's start: