Chapter 2: Implementing Value-Based, Policy-Based, and Actor-Critic Deep RL Algorithms
This chapter provides a practical approach to building value-based, policy-based, and actor-critic algorithm-based reinforcement learning (RL) agents. It includes recipes for implementing value iteration-based learning agents and breaks down the implementation details of several foundational algorithms in RL into simple steps. The policy gradient-based agent and the actor-critic agent make use of the latest major version of TensorFlow 2.x to define the neural network policies.
The following recipes will be covered in this chapter:
- Building stochastic environments for training RL agents
- Building value-based (RL) agent algorithms
- Implementing temporal difference ...
Get TensorFlow 2 Reinforcement Learning Cookbook now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.