January 2020
Intermediate to advanced
432 pages
10h 18m
English
If you recall from our very first chapter, Chapter 1, Understanding Rewards-Based Learning, we explored the primary elements of RL. We learned that RL comprises of a policy, a value function, a reward function, and, optionally, a model. We use the word model in this context to refer to a detailed plan of the environment. Going back to the last chapter again, where we used the FrozenLake environment, we had a perfect model of that environment:
Model of the FrozenLake environmentOf course, looking at problems with a fully described model in a finite MDP is all well and good for learning. However, ...
Read now
Unlock full access