Universally Measurable Policies in Dynamic Programming
DOI10.1287/moor.4.1.15zbMath0412.90071OpenAlexW2018546408MaRDI QIDQ4199855
Dimitri P. Bertsekas, Steven E. Shreve
Publication date: 1979
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/moor.4.1.15
convergence analysisdynamic programmingMarkov decision processesanalytic setsdiscrete timeBorel spacesprogramming in abstract spacesuniversally measurable policiesexistence of epsilon-optimal policies
Minimax problems in mathematical programming (90C47) Classes of sets (Borel fields, (sigma)-rings, etc.), measurable sets, Suslin sets, analytic sets (28A05) Discrete-time control/observation systems (93C55) Dynamic programming (90C39)
Related Items (16)
This page was built for publication: Universally Measurable Policies in Dynamic Programming