Optimal strategy for Bayesian two-armed bandit problem with an arched reward function

From MaRDI portal

Publication:6127344

Jump to:navigation, search

DOI10.3934/MCRF.2022057OpenAlexW4313031531MaRDI QIDQ6127344

Zeng-Jing Chen, Zhao-Ang Zhang

Publication date: 12 April 2024

Published in: Mathematical Control and Related Fields (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.3934/mcrf.2022057

zbMATH Keywords

optimal strategy dynamic programming law of large numbers sequential design myopic strategy Bayesian two-armed bandit

Mathematics Subject Classification ID

Central limit and other weak theorems (60F05) Bayesian problems; characterization of Bayes procedures (62C10) Dynamic programming (90C39) Sequential statistical design (62L05)

This page was built for publication: Optimal strategy for Bayesian two-armed bandit problem with an arched reward function

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6127344&oldid=35581970"