On the Optimal Reward Function of the Continuous Time Multiarmed Bandit Problem

From MaRDI portal

Publication:3200906

Jump to:navigation, search

DOI10.1137/0328005MaRDI QIDQ3200906zbMATH OpenOpenAlexFDO

Authors José-Luis Menaldi, M. Robin

Publication date 1990

Published in SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL https://digitalcommons.wayne.edu/mathfrp/35

zbMATH Keywords

multi-armed bandit problem switching problems optimal reward function stopping problems Markov-Feller processes

Mathematics Subject Classification ID

Dynamic programming (90C39) Continuous-time Markov processes on general state spaces (60J25) Markov and semi-Markov decision processes (90C40) Optimal stochastic control (93E20)

Recommendations

Cited in

(13)

This page was built for publication: On the Optimal Reward Function of the Continuous Time Multiarmed Bandit Problem

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3200906)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3200906&oldid=16367202"