Bayesian optimistic Kullback-Leibler exploration (Q2425228): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4821526 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303765208377 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2896090 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Near-optimal reinforcement learning in polynomial time / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5305630 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of model-based interval estimation for Markov decision processes / rank
 
Normal rank

Revision as of 16:33, 19 July 2024

scientific article
Language Label Description Also known as
English
Bayesian optimistic Kullback-Leibler exploration
scientific article

    Statements

    Bayesian optimistic Kullback-Leibler exploration (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    26 June 2019
    0 references
    model-based Bayesian reinforcement learning
    0 references
    Bayes-adaptive Markov decision process
    0 references
    PAC-BAMDP
    0 references

    Identifiers