A lemma on the multiarmed bandit problem
From MaRDI portal
Publication:3722290
DOI10.1109/TAC.1986.1104332zbMath0592.90090OpenAlexW2120323365WikidataQ125049428 ScholiaQ125049428MaRDI QIDQ3722290
Publication date: 1986
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/tac.1986.1104332
Related Items (3)
Optimal control of single-server queueing networks ⋮ Multi-armed bandit processes with optimal selection of the operating times ⋮ The archievable region method in the optimal control of queueing systems; formulations, bounds and policies
This page was built for publication: A lemma on the multiarmed bandit problem