Before learning different algorithms, let's accustom ourselves to the RL terminology. For illustration purposes, let's consider two examples: an agent finding a route in a maze and an agent steering the wheel of a Self-Driving Car (SDC). The two are illustrated in the following diagram:
Before going further, let's acquaint ourselves with common RL terms:
- States s: The states can be thought of as a set of tokens (or representation) that can define all of the possible states the environment can be in. The state can be continuous or discrete. For example, in the case of an agent finding a path through ...