N. SanghiDeep Reinforcement Learning with Pythonhttps://doi.org/10.1007/978-1-4842-6809-4_9

9. Integrated Planning and Learning

Nimish Sanghi¹

(1)

Bangalore, India

Studying topics separately followed by learning about them together has been a recurring theme in this book. We first looked at model-based algorithms in Chapter 3. Using this setup, we knew the model dynamics of the world in which the agent was operating. The agent used the knowledge of model dynamics along with Bellman equations to first carry out the evaluation/prediction task to learn the state or state-action values. It then followed this up by improving the policy to get the optimal behavior, which was called policy improvement/policy iteration . Once we know ...

Get Deep Reinforcement Learning with Python: With PyTorch, TensorFlow and OpenAI Gym now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Deep Reinforcement Learning with Python: With PyTorch, TensorFlow and OpenAI Gym by Nimish Sanghi

9. Integrated Planning and Learning

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly