DPG algorithm

The following pseudo code shows the DDPG algorithm:

Initialize replay memory  to capacity ;Initialize the critic network  and actor network  with random weights  and ;Initialize the target networks  and  with weights  and ;Repeat for ...

Get Python Reinforcement Learning Projects now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.