Estimation and control in Markov chains
From MaRDI portal
Publication:4766345
DOI10.2307/1426206zbMath0281.60070WikidataQ100640305 ScholiaQ100640305MaRDI QIDQ4766345
Publication date: 1974
Published in: Advances in Applied Probability (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.2307/1426206
60J10: Markov chains (discrete-time Markov processes on discrete state spaces)
60J99: Markov processes
Related Items
Strong 0-discount optimal policies in a Markov decision process with a Borel state space, Nonparametric estimation and adaptive control in a class of finite Markov decision chains, Ergodic and adaptive control of nearest-neighbor motions, Estimation and control in multichain processes, Computationally efficient algorithms for on-line optimization of Markov decision processes, On the Milito-Cruz adaptive control scheme for Markov chains, Statistical inference for a finite optimal stopping problem with unknown transition probabilities, Sample complexity for Markov chain self-tuner, Notes on average Markov decision processes with a minimum-variance criterion, Central limit theorem for the estimator of the value of an optimal stopping problem, Unnamed Item, Unnamed Item, Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes, Unnamed Item