October 2018
Intermediate to advanced
252 pages
6h 49m
English
Sequential memory is used by the DQN agent to store various states, actions, and rewards. It has the following data structures:
In the code, these data structures are defined as shown in the following snippet:
self.actions = RingBuffer(limit)self.rewards = RingBuffer(limit)self.terminals = RingBuffer(limit)self.observations = RingBuffer(limit)