December 2018
Beginner to intermediate
684 pages
21h 9m
English
Solving complex decision problems usually requires a simplified model that isolates the key aspects. The following diagram highlights the salient features of an RL problem:

These typically include the following:
In addition, the environment emits a reward signal that reflects the new state resulting from the agent's action. At the core, the agent usually learns a value function that shapes its judgment over actions. The agent has an objective function ...