Dynamic consistency for stochastic optimal control problems
From MaRDI portal
Abstract: For a sequence of dynamic optimization problems, we aim at discussing a notion of consistency over time. This notion can be informally introduced as follows. At the very first time step , the decision maker formulates an optimization problem that yields optimal decision rules for all the forthcoming time step ; at the next time step , he is able to formulate a new optimization problem starting at time that yields a new sequence of optimal decision rules. This process can be continued until final time is reached. A family of optimization problems formulated in this way is said to be time consistent if the optimal strategies obtained when solving the original problem remain optimal for all subsequent problems. The notion of time consistency, well-known in the field of Economics, has been recently introduced in the context of risk measures, notably by Artzner et al. (2007) and studied in the Stochastic Programming framework by Shapiro (2009) and for Markov Decision Processes (MDP) by Ruszczynski (2009). We here link this notion with the concept of "state variable" in MDP, and show that a significant class of dynamic optimization problems are dynamically consistent, provided that an adequate state variable is chosen.
Recommendations
- Building up time-consistency for risk measures and dynamic optimization
- On a time consistency concept in risk averse multistage stochastic programming
- Time-inconsistent optimal control problems
- Time-inconsistent optimal control problems and related issues
- Dynamic approaches for some time-inconsistent optimization problems
Cites work
- scientific article; zbMATH DE number 3889341 (Why is no real title available?)
- scientific article; zbMATH DE number 3126094 (Why is no real title available?)
- scientific article; zbMATH DE number 772850 (Why is no real title available?)
- A standard form for sequential stochastic control
- Changing Tastes and Coherent Dynamic Choice
- Coherent multiperiod risk adjusted values and Bellman's principle
- Conditional and dynamic convex risk measures
- Convexity of chance constraints with independent random variables
- Dynamic coherent risk measures
- Dynamic monetary risk measures for bounded discrete-time processes
- On Information Structures, Feedback and Causality
- On a time consistency concept in risk averse multistage stochastic programming
- On the Existence of a Consistent Course of Action when Tastes are Changing
- On the connectedness of probabilistic constraint sets
- Richard Bellman on the Birth of Dynamic Programming
- Risk-averse dynamic programming for Markov decision processes
- Temporal Resolution of Uncertainty and Dynamic Choice Theory
- Variational Analysis
Cited in
(20)- A combined SDDP/Benders decomposition approach with a risk-averse surface concept for reservoir operation in long term power generation planning
- Effective scenarios in multistage distributionally robust optimization with a focus on total variation distance
- Risk management for forestry planning under uncertainty in demand and prices
- Stable Optimal Control and Semicontractive Dynamic Programming
- An application of control theory for imperfect production problem with carbon emission investment policy in interval environment
- scientific article; zbMATH DE number 7733443 (Why is no real title available?)
- Decomposability and time consistency of risk averse multistage programs
- Time (in)consistency of multistage distributionally robust inventory models with moment constraints
- Time Consistency for Multistage Stochastic Optimization Problems under Constraints in Expectation
- Structure of risk-averse multistage stochastic programs
- Dynamic risked equilibrium
- Building up time-consistency for risk measures and dynamic optimization
- Risk aversion in multistage stochastic programming: a modeling and algorithmic perspective
- Time-inconsistent multistage stochastic programs: martingale bounds
- The nested Sinkhorn divergence to learn the nested distance
- A survey of time consistency of dynamic risk measures and dynamic performance measures in discrete time: LM-measure perspective
- Time-consistent decisions and temporal decomposition of coherent risk functionals
- Minimax and risk averse multistage stochastic programming
- Time consistency of dynamic risk measures
- Controlled Markov decision processes with AVaR criteria for unbounded costs
This page was built for publication: Dynamic consistency for stochastic optimal control problems
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1931661)