On nearly selfoptimizing strategies for multiarmed bandit problems with controlled arms

From MaRDI portal
Publication:4879862