Pages that link to "Item:Q4337732"
From MaRDI portal
The following pages link to Asymptotically Efficient Adaptive Choice of Control Laws inControlled Markov Chains (Q4337732):
Displaying 7 items.
- The multi-armed bandit problem: an efficient nonparametric solution (Q2176624) (← links)
- Adaptive control design under structured model information limitation: a cost-biased maximum-likelihood approach (Q2258149) (← links)
- Adaptive policies for perimeter surveillance problems (Q2286935) (← links)
- Optimal strategies for a class of sequential control problems with precedence relations (Q2456018) (← links)
- Learning the distribution with largest mean: two bandit frameworks (Q4606431) (← links)
- Learning to Optimize via Information-Directed Sampling (Q4969321) (← links)
- Sequential Generalized Likelihood Ratios and Adaptive Treatment Allocation for Optimal Sequential Selection (Q5478885) (← links)