Chapter 2: Implementing Value-Based, Policy-Based, and Actor-Critic Deep RL Algorithms

This chapter provides a practical approach to building value-based, policy-based, and actor-critic algorithm-based reinforcement learning (RL) agents. It includes recipes for implementing value iteration-based learning agents and breaks down the implementation details of several foundational algorithms in RL into simple steps. The policy gradient-based agent and the actor-critic agent make use of the latest major version of TensorFlow 2.x to define the neural network policies.

The following recipes will be covered in this chapter:

Building stochastic environments for training RL agents
Building value-based (RL) agent algorithms
Implementing temporal difference ...

Get TensorFlow 2 Reinforcement Learning Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

TensorFlow 2 Reinforcement Learning Cookbook by Praveen Palanisamy

Chapter 2: Implementing Value-Based, Policy-Based, and Actor-Critic Deep RL Algorithms

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly