Multi-armed bandits with discount factor near one: The Bernoulli case (Q1161450): Difference between revisions
From MaRDI portal
Added link to MaRDI item. |
Set profile property. |
||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank |
Revision as of 02:25, 5 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Multi-armed bandits with discount factor near one: The Bernoulli case |
scientific article |
Statements
Multi-armed bandits with discount factor near one: The Bernoulli case (English)
0 references
1981
0 references
Bernoulli bandit process
0 references
limit rule
0 references
play-the-winner rule
0 references
least failures rule
0 references
discount optimality
0 references
expected average reward optimality
0 references
multi-armed bandit
0 references
optimal arm pulling strategy
0 references
infinite sequence of Bernoulli random variables
0 references
Gittins index
0 references
asymptotic bounds
0 references