Integrated online learning and adaptive control in queueing systems with uncertain payoffs
From MaRDI portal
Publication:5080672
Recommendations
Cites work
- scientific article; zbMATH DE number 4087408 (Why is no real title available?)
- Adaptive matching for expert systems with uncertain task types
- Asymptotically efficient adaptive allocation rules
- Decentralized Learning for Multiplayer Multiarmed Bandits
- Finite-time analysis of the multiarmed bandit problem
- Multi-armed bandit allocation indices. With a foreword by Peter Whittle.
- On the Connection-Level Stability of Congestion-Controlled Communication Networks
- Online Advertisement, Optimization and Stochastic Networks
- Open bandit processes and optimal scheduling of queueing networks
- Stability of queueing networks and scheduling policies
- Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks
- The Nonstochastic Multiarmed Bandit Problem
Cited in
(3)
This page was built for publication: Integrated online learning and adaptive control in queueing systems with uncertain payoffs
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5080672)