Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards (Q3780858): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Removed claim: author (P16): Item:Q1906866
Set OpenAlex properties.
 
(2 intermediate revisions by 2 users not shown)
Property / author
 
Property / author: Pravin P. Varaiya / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1109/tac.1987.1104485 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W4232620022 / rank
 
Normal rank

Latest revision as of 09:28, 30 July 2024

scientific article
Language Label Description Also known as
English
Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards
scientific article

    Statements

    Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards (English)
    0 references
    0 references
    0 references
    0 references
    1987
    0 references
    learning scheme
    0 references
    multiarmed bandit
    0 references
    Markovian rewards
    0 references
    regret function
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references