Multi-armed bandits with discount factor near one: The Bernoulli case (Q1161450): Difference between revisions
From MaRDI portal
Created a new Item |
Added link to MaRDI item. |
||
links / mardi / name | links / mardi / name | ||
Revision as of 04:42, 31 January 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Multi-armed bandits with discount factor near one: The Bernoulli case |
scientific article |
Statements
Multi-armed bandits with discount factor near one: The Bernoulli case (English)
0 references
1981
0 references
Bernoulli bandit process
0 references
limit rule
0 references
play-the-winner rule
0 references
least failures rule
0 references
discount optimality
0 references
expected average reward optimality
0 references
multi-armed bandit
0 references
optimal arm pulling strategy
0 references
infinite sequence of Bernoulli random variables
0 references
Gittins index
0 references
asymptotic bounds
0 references