October 2019
Intermediate to advanced
366 pages
12h 4m
English
In the absence of a model, model-free (MF) algorithms run trajectories within a given policy to gain experience and to improve the agent. MF algorithms are made up of three main steps that are repeated until a good policy is created:
These three components are at the heart of this type of algorithm, but based on how each step is performed, they generate different algorithms. Value-based algorithms and policy gradient algorithms ...
Read now
Unlock full access