The following pseudo code shows the DDPG algorithm:
Initialize replay memoryto capacity
;Initialize the critic network
and actor network
with random weights
and ;Initialize the target networks and with weights and ;Repeat for ...
The following pseudo code shows the DDPG algorithm:
Initialize replay memoryto capacity
;Initialize the critic network
and actor network
with random weights
and ;Initialize the target networks and with weights and ;Repeat for ...
Get Python Reinforcement Learning Projects now with O’Reilly online learning.
O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.