Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost

From MaRDI portal
Publication:3835405