Asymptotically Efficient Adaptive Choice of Control Laws inControlled Markov Chains

From MaRDI portal

Publication:4337732

Jump to:navigation, search

DOI10.1137/S0363012994275440MaRDI QIDQ4337732zbMATH OpenOpenAlexFDO

Authors Tze Leung Lai, Todd L. Graves

Publication date 26 May 1997

Published in SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1137/s0363012994275440

zbMATH Keywords

certainty equivalence sequential testing adaptive control of Markov chains multiarmed bandit problem uncertainty adjustments

Mathematics Subject Classification ID

Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Sequential statistical analysis (62L10) Adaptive control/observation systems (93C40) Optimal stochastic control (93E20) Stochastic learning and adaptive control (93E35)

Recommendations

Cited in

(19)

This page was built for publication: Asymptotically Efficient Adaptive Choice of Control Laws inControlled Markov Chains

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4337732)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4337732&oldid=18302556"