Adaptive policies for sequential sampling under incomplete information and a cost constraint

DOI10.1007/978-1-4614-4109-0_8MaRDI QIDQ5261007zbMATH OpenOpenAlexFDO

Authors Odysseas Kanavetas, A. N. Burnetas

Publication date 1 July 2015

Published in Applications of Mathematics and Informatics in Military Science (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1201.4002

zbMATH Keywords

sequential design stochastic learning and adaptive control sampling cost constraint

Mathematics Subject Classification ID

Sequential statistical design (62L05) Stochastic learning and adaptive control (93E35)

Abstract: We consider the problem of sequential sampling from a finite number of independent statistical populations to maximize the expected infinite horizon average outcome per period, under a constraint that the expected average sampling cost does not exceed an upper bound. The outcome distributions are not known. We construct a class of consistent adaptive policies, under which the average outcome converges with probability 1 to the true value under complete information for all distributions with finite means. We also compare the rate of convergence for various policies in this class using simulation.

Recommendations

Cites work

Cited in

(3)

This page was built for publication: Adaptive policies for sequential sampling under incomplete information and a cost constraint

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5261007)