May 2020
Beginner to intermediate
430 pages
10h 39m
English
Reinforcement learning is a type of machine learning where the agent learns to act in the current environment by predicting a reward (or outcome) based on feedback from cumulative past reward signals. Q-learning, introduced by Christopher Watkins in the paper titled Learning from Delayed Rewards, is one of the most popular algorithms in reinforcement learning. The Q means quality—this is the value of a given action in generating a reward: