COMPUTING AVERAGE OPTIMAL CONSTRAINED POLICIES IN STOCHASTIC DYNAMIC PROGRAMMING

From MaRDI portal

Publication:2713008

Jump to:navigation, search

DOI10.1017/S0269964801151089zbMath1087.90523MaRDI QIDQ2713008

Linn I. Sennott

Publication date: 2001

Published in: Probability in the Engineering and Informational Sciences (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1017/s0269964801151089

Mathematics Subject Classification ID

90C15: Stochastic programming

90C40: Markov and semi-Markov decision processes

Related Items

A reinforcement learning approach to call admission and call dropping control in links with variable capacity, The vanishing discount approach to constrained continuous-time controlled Markov chains

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2713008&oldid=15559568"