11 Designing Policies

Now that we have learned how to model a sequential decision problem and simulate an exogenous process W1,,Wt,, we return to the challenge of finding a policy that solves our objective function from chapter 9

StartLayout 1st Row max Underscript pi element-of normal upper Pi Endscripts double-struck upper E left-brace sigma-summation Underscript t equals 0 Overscript upper T Endscripts upper C Subscript t Baseline left-parenthesis upper S Subscript t Baseline comma upper X Subscript t Superscript pi Baseline left-parenthesis upper S Subscript t Baseline right-parenthesis right-parenthesis vertical-bar upper S 0 right-brace period EndLayout  (11.1)

objective function has been the basis of our “model first, then solve” approach. But now it is time to solve. This leaves us with the question: How in the world do we search over some arbitrary class of policies?

This is precisely the reason that this form of the objective function is popular with mathematicians who do not care about computation, or in communities where it is already clear what type of policy is being used. However, equation (11.1) is not widely used, and we believe the reason is that there has not been a natural path to computation. In fact, entire fields have emerged which focus on particular classes of policies.

In this chapter, we address the problem of searching over policies in a general way. Our approach is quite practical in that we organize our search using classes of policies that are widely used either in practice or in the research literature. Instead of focusing on a particular hammer looking for a nail, we cover all four classes of policies, with the knowledge that when you settle on an approach, it will come from one of the four classes, or possibly a hybrid of two (or more).

We start by clarifying one area ...

Get Reinforcement Learning and Stochastic Optimization now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.