Pages that link to "Item:Q415624"
From MaRDI portal
The following pages link to An asymptotically optimal policy for finite support models in the multiarmed bandit problem (Q415624):
Displaying 4 items.
- Infomax strategies for an optimal balance between exploration and exploitation (Q310029) (← links)
- Kullback-Leibler upper confidence bounds for optimal sequential allocation (Q366995) (← links)
- A perpetual search for talents across overlapping generations: a learning process (Q898767) (← links)
- On Bayesian index policies for sequential resource allocation (Q1750289) (← links)