Regret bounds for restless Markov bandits (Q465253): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(5 intermediate revisions by 4 users not shown)
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 60G40 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 90C40 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 91A60 / rank
 
Normal rank
Property / zbMATH DE Number
 
Property / zbMATH DE Number: 6362896 / rank
 
Normal rank
Property / zbMATH Keywords
 
restless bandits
Property / zbMATH Keywords: restless bandits / rank
 
Normal rank
Property / zbMATH Keywords
 
Markov decision processes
Property / zbMATH Keywords: Markov decision processes / rank
 
Normal rank
Property / zbMATH Keywords
 
regret
Property / zbMATH Keywords: regret / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2178643644 / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1209.2693 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient adaptive allocation rules / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Nonstochastic Multiarmed Bandit Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2896090 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4197923 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3815845 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Equivalence notions and model minimization in Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4737593 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Pseudometrics for State Aggregation in Average Reward Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Chebyshev-Type Inequalities for Primes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Threshold limits for cover times / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the possibility of learning in reactive environments with arbitrary dependence / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 06:18, 9 July 2024

scientific article
Language Label Description Also known as
English
Regret bounds for restless Markov bandits
scientific article

    Statements

    Regret bounds for restless Markov bandits (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    31 October 2014
    0 references
    0 references
    0 references
    0 references
    0 references
    restless bandits
    0 references
    Markov decision processes
    0 references
    regret
    0 references
    0 references
    0 references