Q-learning

With the introduction of quality iteration methods, the derivation of a finite state method called Q-learning or quality learning was derived. Q uses the technique of quality iteration for a given finite state problem to determine the best course of action for an agent. The equation we saw in the previous section can now be represented as the following:

Consider the following equation:

  •   current state
  •    current action
  •  next action ...

Get Hands-On Deep Learning for Games now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.