With the introduction of quality iteration methods, the derivation of a finite state method called Q-learning or quality learning was derived. Q uses the technique of quality iteration for a given finite state problem to determine the best course of action for an agent. The equation we saw in the previous section can now be represented as the following:
Consider the following equation:
- current state
- current action
- next action ...