Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems (Q5396763): Difference between revisions

From MaRDI portal
Created claim: Wikidata QID (P12): Q59538563, #quickstatements; #temporary_batch_1712186161777
Created claim: DBLP publication ID (P1635): journals/ftml/BubeckC12, #quickstatements; #temporary_batch_1732530249250
 
(One intermediate revision by one other user not shown)
Property / arXiv ID
 
Property / arXiv ID: 1204.5721 / rank
 
Normal rank
Property / DBLP publication ID
 
Property / DBLP publication ID: journals/ftml/BubeckC12 / rank
 
Normal rank

Latest revision as of 11:27, 25 November 2024

scientific article; zbMATH DE number 6254309
Language Label Description Also known as
English
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
scientific article; zbMATH DE number 6254309

    Statements

    Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems (English)
    0 references
    0 references
    0 references
    3 February 2014
    0 references
    learning and statistical methods
    0 references
    game-theoretic learning
    0 references
    online learning
    0 references
    optimization
    0 references
    reinforcement learning
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references