A learning algorithm for the finite-time two-armed bandit problem
From MaRDI portal
Publication:3342234
DOI10.1109/TSMC.1984.6313253zbMath0549.90092MaRDI QIDQ3342234
Hiroshi Takeda, Mitsuo Sato, Ken-Ichi Abe
Publication date: 1984
Published in: IEEE Transactions on Systems, Man, and Cybernetics (Search for Journal in Brave)
This page was built for publication: A learning algorithm for the finite-time two-armed bandit problem