Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards (Q3780858)

From MaRDI portal
Revision as of 10:28, 30 July 2024 by Openalex240730090724 (talk | contribs) (Set OpenAlex properties.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards
scientific article

    Statements

    Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards (English)
    0 references
    0 references
    0 references
    0 references
    1987
    0 references
    learning scheme
    0 references
    multiarmed bandit
    0 references
    Markovian rewards
    0 references
    regret function
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references