Adaptive policies for time-varying stochastic systems under discounted criterion
From MaRDI portal
Publication:1397033
DOI10.1007/s001860100170zbMath1042.93065OpenAlexW1972124016MaRDI QIDQ1397033
Nadine Hilgert, J. Adolfo Minjárez-Sosa
Publication date: 16 July 2003
Published in: Mathematical Methods of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s001860100170
discounted cost criteriondiscrete-time stochastic systemsoptimal adaptive policynon-homogeneous Markov control processes
Discrete-time Markov processes on general state spaces (60J05) Optimal stochastic control (93E20) Stochastic learning and adaptive control (93E35)
Related Items
Unnamed Item ⋮ Markov control models with unknown random state-action-dependent discount factors ⋮ Estimation of the Optimality Deviation in Discounted Semi-Markov Control Models ⋮ Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria ⋮ Two person zero-sum semi-Markov games with unknown holding times distribution on one side: A discounted payoff criterion ⋮ Time-varying Markov decision processes with state-action-dependent discount factors and unbounded costs ⋮ Unnamed Item ⋮ Partially observable Markov decision processes with partially observable random discount factors
This page was built for publication: Adaptive policies for time-varying stochastic systems under discounted criterion