Multi-armed bandits with switching penalties
DOI10.1109/9.486316zbMath0847.90137OpenAlexW2168709337MaRDI QIDQ4884105
Manjari Asawa, Demosthenis Teneketzis
Publication date: 8 July 1996
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/9.486316
Gittins indexmulti-armed bandit problemoptimal scheduling policiesoptimal allocation policyscheduling of parallel queuesswitching penalties
Deterministic scheduling theory in operations research (90B35) Queueing theory (aspects of probability theory) (60K25) Queues and service in operations research (90B22) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40) Applications of queueing theory (congestion, allocation, storage, traffic, etc.) (60K30)
Related Items
This page was built for publication: Multi-armed bandits with switching penalties