An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems

From MaRDI portal
Publication:2665165