October 2019
Intermediate to advanced
366 pages
12h 4m
English
The key innovations brought by DQN involve a replay buffer to get over the data correlation drawback, and a separate target network to get over the non-stationarity problem.
Read now
Unlock full access