scientific article; zbMATH DE number 3867090
From MaRDI portal
Publication:3335552
zbMATH Open0544.90100MaRDI QIDQ3335552FDOQ3335552
Publication date: 1984
Title of this publication is not available (Why is that?)
infinite horizonoptimal policiessequential decision proceduremultinomial distributed rewardssearch process
Stopping times; optimal stopping problems; gambling theory (60G40) Markov and semi-Markov decision processes (90C40)
Cited In (1)
Recommendations
- Real-reward testing for probabilistic processes π π
- Ratewise-optimal non-sequential search strategies under constraints on the tests π π
- Title not available (Why is that?) π π
- Satisficing in Multi-Armed Bandit Problems π π
- Comparisons of search designs using search probabilities π π
- An experimental test of a search model under ambiguity π π
- Title not available (Why is that?) π π
- Pure exploration in multi-armed bandits problems π π
- Multi-armed bandit with sub-exponential rewards π π
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3335552)