Publication:3783097
From MaRDI portal
zbMath0641.90087MaRDI QIDQ3783097
Publication date: 1985
stochastic dynamic programming; asymptotic discount optimality; expected total discounted reward; approximation of dynamic programs; nonstationary value- iteration; adaptive control policies; polish state and control spaces
90C15: Stochastic programming
90C39: Dynamic programming
60K99: Special processes
90C40: Markov and semi-Markov decision processes
Related Items
Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution, Continuous dependence of stochastic control models on the noise distribution, Nonparametric adaptive control of discrete-time partially observable stochastic systems, Discretization procedures for adaptive Markov control processes, Unnamed Item