© The Author(s), under exclusive license to APress Media, LLC, part of Springer Nature 2023
M. HuThe Art of Reinforcement Learninghttps://doi.org/10.1007/978-1-4842-9606-6_2

2. Markov Decision Processes

Michael Hu1  
(1)
Shanghai, Shanghai, China
 

Markov decision processes (MDPs) offer a powerful framework for tackling sequential decision-making problems in reinforcement learning. Their applications span various domains, including robotics, finance, and optimal control.

In this chapter, we provide an overview of the key components of Markov decision processes (MDPs) and demonstrate the formulation of a basic reinforcement learning problem using the MDP framework. We delve into the concepts of policy and value functions, examine the Bellman equations, ...

Get The Art of Reinforcement Learning: Fundamentals, Mathematics, and Implementations with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.