October 2019
Intermediate to advanced
366 pages
12h 4m
English
REINFORCE and Actor-Critic are very intuitive methods that work well on small to medium-sized RL tasks. However, they present some problems that need to be addressed so that we can adapt policy gradient algorithms so that they work on much larger and complex tasks. The main problems are as follows:
Read now
Unlock full access