The Continuum-Armed Bandit Problem

From MaRDI portal

Publication:4862444

Jump to:navigation, search

DOI10.1137/S0363012992237273MaRDI QIDQ4862444zbMATH OpenFDO

Authors Rajeev Agrawal

Publication date 8 February 1996

Published in SIAM Journal on Control and Optimization (Search for Journal in Brave)

zbMATH Keywords

adaptive control multiarmed bandit problem learning loss

Mathematics Subject Classification ID

Asymptotic properties of nonparametric inference (62G20) Sequential statistical design (62L05) Stochastic learning and adaptive control (93E35)

Recommendations

Cited in

(36)

This page was built for publication: The Continuum-Armed Bandit Problem

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4862444)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4862444&oldid=19214644"