Bayesian optimistic Kullback-Leibler exploration (Q2425228): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
Created claim: Wikidata QID (P12): Q128709924, #quickstatements; #temporary_batch_1728347902604
 
(One intermediate revision by one other user not shown)
Property / cites work
 
Property / cites work: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4821526 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303765208377 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2896090 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Near-optimal reinforcement learning in polynomial time / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5305630 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of model-based interval estimation for Markov decision processes / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q128709924 / rank
 
Normal rank

Latest revision as of 02:16, 8 October 2024

scientific article
Language Label Description Also known as
English
Bayesian optimistic Kullback-Leibler exploration
scientific article

    Statements

    Bayesian optimistic Kullback-Leibler exploration (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    26 June 2019
    0 references
    model-based Bayesian reinforcement learning
    0 references
    Bayes-adaptive Markov decision process
    0 references
    PAC-BAMDP
    0 references

    Identifiers