Adaptive KL-UCB based bandit algorithms for Markovian and I.I.D. settings

From MaRDI portal
Publication:6575985