Publication:3745652
From MaRDI portal
zbMath0606.90130MaRDI QIDQ3745652
Onésimo Hernández-Lerma, Roberto S. Acosta Abreu
Publication date: 1985
successive approximation; countable state space; naive feedback controller; average reward adaptive Markov decision processes; compact feasible action sets; nonstationary value- iteration; strong scrambling condition
62M05: Markov processes: estimation; hidden Markov models
90C40: Markov and semi-Markov decision processes
Related Items
Adaptive control of Markov processes with incomplete state information and unknown parameters, A unified approach to adaptive control of average reward Markov decision processes, Recursive adaptive control of Markov decision processes with the average reward criterion, Density estimation and adaptive control of Markov processes: Average and discounted criteria, Unnamed Item, Unnamed Item