Skip to Main Content
Advanced Deep Learning with Keras
book

Advanced Deep Learning with Keras

by Rowel Atienza, Neeraj Verma, Valerio Maggio
October 2018
Intermediate to advanced content levelIntermediate to advanced
368 pages
9h 20m
English
Packt Publishing
Content preview from Advanced Deep Learning with Keras

Monte Carlo policy gradient (REINFORCE) method

The simplest policy gradient method is called REINFORCE [5], this is a Monte Carlo policy gradient method:

Monte Carlo policy gradient (REINFORCE) method (Equation 10.2.1)

where Rt is the return as defined in Equation 9.1.2. Rt is an unbiased sample of Monte Carlo policy gradient (REINFORCE) method in the policy gradient theorem.

Algorithm 10.2.1 summarizes the REINFORCE algorithm [2]. REINFORCE is a Monte Carlo algorithm. It does not require knowledge of the dynamics of the environment (that is, model-free). Only experience samples, , are needed to optimally tune the parameters of the policy ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Hands-On Neural Networks with Keras

Hands-On Neural Networks with Keras

Niloy Purkait
Deep Learning with Keras

Deep Learning with Keras

Antonio Gulli, Sujit Pal
Keras Deep Learning Cookbook

Keras Deep Learning Cookbook

Rajdeep Dua, Sujit Pal, Manpreet Singh Ghotra

Publisher Resources

ISBN: 9781788629416Supplemental Content