Universally Measurable Policies in Dynamic Programming

From MaRDI portal
Publication:4199855

DOI10.1287/moor.4.1.15zbMath0412.90071OpenAlexW2018546408MaRDI QIDQ4199855

Dimitri P. Bertsekas, Steven E. Shreve

Publication date: 1979

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/moor.4.1.15




Related Items (16)

A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable PoliciesContinuous time shock markov decision processes with discounted criterionConditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programsDiscounted semi-markov decision process in a semi-markov environmentAverage Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable PoliciesHow to stay in a set or Koenig's lemma for random pathsQuantitative model-checking of controlled discrete-time Markov processesA limited order capacity stochastic inventory model with a fixed cost for order: The discounted caseDiscrete type shock semi-markov decision processes with borel state spaceContinuous time markov decision processes with nonuniformly bounded transition rate: expected total rewardsOn measurable minimax selectorsStackelberg equilibrium in a dynamic stimulation model with complete informationOn the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded CostsMixed Markov decision processes in a semi-Markov environment with discounted criterionOn structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policiesOn Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes




This page was built for publication: Universally Measurable Policies in Dynamic Programming