book

Deep Learning with Keras

by Antonio Gulli, Sujit Pal

April 2017

Intermediate to advanced

318 pages

7h 40m

English

Packt Publishing

Read now

Unlock full access

Content preview from Deep Learning with Keras

Experience replay, or the value of experience

Based on the equations that represent the Q-value for a state action pair (s_t, a_t) in terms of the current reward r_t and the discounted maximum Q-value for the next time step (s_t+1, a_t+1), our strategy would logically be to train the network to predict the best next state s' given the current state (s, a, r). It turns out that this tends to drive the network into a local minimum. The reason for this is that consecutive training samples tend to be very similar.

To counter this, during game play, we collect all the previous moves (s, a, r, s') into a large fixed size queue called the replay memory. The replay memory represents the experience of the network. When training the network, we generate ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781787128422Supplemental Content

Deep Learning with Keras

by Antonio Gulli, Sujit Pal

Experience replay, or the value of experience

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Advanced Deep Learning with Keras

Deep Learning with TensorFlow 2 and Keras - Second Edition

Hands-On Neural Networks with Keras

Advanced Deep Learning with TensorFlow 2 and Keras - Second Edition

Publisher Resources