Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards (Q5113912): Difference between revisions

From MaRDI portal
ReferenceBot (talk | contribs)
Changed an Item
Normalize DOI.
 
(One intermediate revision by one other user not shown)
Property / DOI
 
Property / DOI: 10.1287/stsy.2019.0033 / rank
Normal rank
 
Property / Wikidata QID
 
Property / Wikidata QID: Q126855665 / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1287/STSY.2019.0033 / rank
 
Normal rank

Latest revision as of 16:58, 30 December 2024

scientific article; zbMATH DE number 7213023
Language Label Description Also known as
English
Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards
scientific article; zbMATH DE number 7213023

    Statements

    Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards (English)
    0 references
    0 references
    0 references
    0 references
    18 June 2020
    0 references
    multi-armed bandit
    0 references
    exploration/exploitation
    0 references
    nonstationary
    0 references
    dynamic oracle
    0 references
    minimax regret
    0 references
    dynamic regret
    0 references

    Identifiers