Multi-armed bandits with discount factor near one: The Bernoulli case

From MaRDI portal
Publication:1161450