Multi-armed bandits with discount factor near one: The Bernoulli case (Q1161450): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Set OpenAlex properties.
 
(One intermediate revision by one other user not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1214/aos/1176345578 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2095160246 / rank
 
Normal rank

Latest revision as of 22:49, 19 March 2024

scientific article
Language Label Description Also known as
English
Multi-armed bandits with discount factor near one: The Bernoulli case
scientific article

    Statements

    Multi-armed bandits with discount factor near one: The Bernoulli case (English)
    0 references
    0 references
    1981
    0 references
    0 references
    0 references
    0 references
    0 references
    Bernoulli bandit process
    0 references
    limit rule
    0 references
    play-the-winner rule
    0 references
    least failures rule
    0 references
    discount optimality
    0 references
    expected average reward optimality
    0 references
    multi-armed bandit
    0 references
    optimal arm pulling strategy
    0 references
    infinite sequence of Bernoulli random variables
    0 references
    Gittins index
    0 references
    asymptotic bounds
    0 references
    0 references