8 State-dependent Problems
In chapters 5 and 7, we introduced sequential decision problems in which the state variable consisted only of the state of the algorithm (chapter 5) or the state of our belief about an unknown function (chapter 7). These problems cover a very important class of applications that involve maximizing or minimizing functions that can represent anything from complex analytical functions and black-box simulators to laboratory and field experiments.
The distinguishing feature of state-dependent problems is that the problem being optimized now depends on our state variable, where the “problem” might be the function , the expectation (e.g. the distribution of ), or the feasible region . The state variable may be changing purely exogenously (where decisions do not impact the state of the system), purely endogenously (the state variable only changes as a result of decisions), or both (which is more typical).
There is a genuinely vast range of problems where the performance metric (costs or contributions), the distributions of random variables , and/or the constraints, depend on information that is changing over time, either exogenously or as a result of decisions (or both). When information changes over time, it is captured in the state variable (or if we are counting events with ).
Examples of state variables that affect the problem itself include:
- Physical state variables, which might include inventories, the location of a vehicle ...
Get Reinforcement Learning and Stochastic Optimization now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.