book

Keras Deep Learning Cookbook

by Rajdeep Dua, Sujit Pal, Manpreet Singh Ghotra

October 2018

Intermediate to advanced

252 pages

6h 49m

English

Packt Publishing

Read now

Unlock full access

Content preview from Keras Deep Learning Cookbook

Adjustment during training

In practice, we utilize an additional temperature parameter (τ), which is annealed over time. This parameter controls the spread of the softmax distribution so that all actions are considered equally at the start of training, and actions are sparsely distributed by the end of training.

In mathematical terms, the policy can be written as shown in the following formula:

The following code shows how this policy is initialized:

class BoltzmannQPolicy(Policy):    """Implement the Boltzmann Q Policy    """    def __init__(self, tau=1., clip=(-500., 500.)):        super(BoltzmannQPolicy, self).__init__()        self.tau = tau self.clip = clip ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Applied Deep Learning with Keras

Ritesh Bhagwat, Mahla Abdolahnejad, Matthew Moocarme

Advanced Deep Learning with Keras

Rowel Atienza, Neeraj Verma, Valerio Maggio

The Applied TensorFlow and Keras Workshop

Harveen Singh Chadha, Luis Capelo, Abhranshu Bagchi, Achint Chaudhary, Vishal Chauhan, Alexis Rutherford, Subhash Sundaravadivelu

Hands-On Neural Networks with TensorFlow 2.0

Paolo Galeone

Publisher Resources

ISBN: 9781788621755Supplemental Content