Optimal adaptive controllers for unknown Markov chains
From MaRDI portal
Publication:3950406
DOI10.1109/TAC.1982.1103017zbMath0488.93036OpenAlexW2115447855MaRDI QIDQ3950406
Publication date: 1982
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/tac.1982.1103017
Adaptive control/observation systems (93C40) Optimal stochastic control (93E20) Stochastic systems in control theory (general) (93E03) Continuous-time Markov processes on discrete state spaces (60J27)
Related Items
An incremental off-policy search in a model-free Markov decision process using a single sample path ⋮ An optimal stopping time problem with time average cost in a bounded interval ⋮ Self-tuning control of diffusions without the identifiability condition ⋮ Adaptive control of Markov chains with local updates ⋮ Ergodic and adaptive control of nearest-neighbor motions ⋮ Stochastic $\varepsilon$-Optimal Linear Quadratic Adaptation: An Alternating Controls Policy ⋮ Ergodic control of multidimensional diffusions. II: Adaptive control ⋮ The Kumar-Becker-Lin scheme revisited ⋮ On the Milito-Cruz adaptive control scheme for Markov chains