Mechanisms with learning for stochastic multi-armed bandit problems

From MaRDI portal
Publication:2520139