scientific article; zbMATH DE number 3125136
From MaRDI portal
Publication:3240573
zbMATH Open0075.14801MaRDI QIDQ3240573FDOQ3240573
Authors: Richard Bellman
Publication date: 1956
Title of this publication is not available (Why is that?)
Cited In (23)
- BANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASES
- Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges
- Learning the distribution with largest mean: two bandit frameworks
- The apparent conflict between estimation and control - a survey of the two-armed bandit problem
- On the two armed bandit with one probability known
- Some problems of optimal sampling strategy
- Functional equations in the theory of dynamic programming. III
- Some statistical methods in machine intelligence research
- On Bayesian index policies for sequential resource allocation
- Four proofs of Gittins' multiarmed bandit theorem
- Herbert Robbins and sequential analysis
- A new approach to filtering and adaptive control: Stability results
- Response-adaptive designs for clinical trials: simultaneous learning from multiple patients
- A Bayesian-bandit adaptive design for \(N\)-of-1 clinical trials
- Strategic learning in teams
- Generalisations of a Bayesian decision-theoretic randomisation procedure and the impact of delayed responses
- The theory of dynamic programming
- Sequentielle Versuchspläne
- Dynamic priority allocation via restless bandit marginal productivity indices
- Title not available (Why is that?)
- Sequentielle Versuchs-Pläne
- Optimizing a Unimodal Response Function for Binary Variables
- Dynamic programming, generalized states, and switching systems
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3240573)