An analysis of model-based interval estimation for Markov decision processes (Q959899): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: 10.1162/153244303321897663 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303765208377 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3046711 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093383 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bounded-parameter Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Near-optimal reinforcement learning in polynomial time / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive treatment allocation and the multi-armed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Robust Control of Markov Decision Processes with Uncertain Transition Matrices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: A theory of the learnable / rank
 
Normal rank

Revision as of 21:47, 28 June 2024

scientific article
Language Label Description Also known as
English
An analysis of model-based interval estimation for Markov decision processes
scientific article

    Statements

    An analysis of model-based interval estimation for Markov decision processes (English)
    0 references
    0 references
    0 references
    12 December 2008
    0 references
    reinforcement learning
    0 references
    learning theory
    0 references
    Markov decision processes
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references