On Optimality of Myopic Policy for Restless Multi-Armed Bandit Problem: An Axiomatic Approach
DOI10.1109/TSP.2011.2170684zbMATH Open1391.62009arXiv1205.5375OpenAlexW1989239951MaRDI QIDQ4573448FDOQ4573448
Authors: Kehao Wang, Lin Chen
Publication date: 18 July 2018
Published in: IEEE Transactions on Signal Processing (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1205.5375
Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Bayesian problems; characterization of Bayes procedures (62C10)
This page was built for publication: On Optimality of Myopic Policy for Restless Multi-Armed Bandit Problem: An Axiomatic Approach
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4573448)