Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards

From MaRDI portal
Publication:5113912

DOI10.1287/stsy.2019.0033zbMath1447.93371arXiv1405.3316OpenAlexW2962821829WikidataQ126855665 ScholiaQ126855665MaRDI QIDQ5113912

Yonatan Gur, Assaf J. Zeevi, Omar Besbes

Publication date: 18 June 2020

Published in: Stochastic Systems (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1405.3316




Related Items (7)


Uses Software


Cites Work


This page was built for publication: Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards