Pages that link to "Item:Q4974829"
From MaRDI portal
The following pages link to A Structured Multiarmed Bandit Problem and the Greedy Policy (Q4974829):
Displaying 6 items.
- Response-adaptive designs for clinical trials: simultaneous learning from multiple patients (Q320737) (← links)
- Bayesian policy reuse (Q1689554) (← links)
- Multi-objective multi-armed bandit with lexicographically ordered and satisficing objectives (Q2051318) (← links)
- Learning in Combinatorial Optimization: What and How to Explore (Q5144784) (← links)
- A linear response bandit problem (Q5168867) (← links)
- A Bayesian two-armed bandit model (Q6574583) (← links)