On the two-armed bandit problem with continuous time parameter and discounted rewards

From MaRDI portal

Publication:3786305

Jump to:navigation, search

DOI10.1080/17442508808833495zbMath0643.90096MaRDI QIDQ3786305

Alexander A. Yushkevich

Publication date: 1988

Published in: Stochastics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1080/17442508808833495

zbMATH Keywords

continuous-time two-armed bandit; expected discounted reward; stationary optimal policy; Explicit formulae

Mathematics Subject Classification ID

90C40: Markov and semi-Markov decision processes

Related Items

On the two-armed bandit problem with non-observed Poissonian switching of arms., Average optimality in a Poissonian bandit with switching arms, Good signals gone bad: dynamic signalling with switched effort levels, Learning to disagree in a game of experimentation

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3786305&oldid=17346062"