book

Keras Deep Learning Cookbook

by Rajdeep Dua, Sujit Pal, Manpreet Singh Ghotra

October 2018

Intermediate to advanced

252 pages

6h 49m

English

Packt Publishing

Read now

Unlock full access

Content preview from Keras Deep Learning Cookbook

Dueling DQN to play Cartpole

In this section, we will look at a modification of the original DQN network, called the Dueling DQN network, the network architecture. It explicitly separates the representation of state values and (state-dependent) action advantages. The dueling architecture consists of two streams that represent the value and advantage functions while sharing a common convolutional feature learning module.

The two streams are combined via an aggregating layer to produce an estimate of the state-action value function Q, as shown in the following diagram: