Hybrid least-squares algorithms for approximate policy evaluation (Q1959511): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / Wikidata QID
 
Property / Wikidata QID: Q115146324 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477859 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4737965 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Generalized polynomial approximations in Markovian decision processes / rank
 
Normal rank

Latest revision as of 07:48, 3 July 2024

scientific article
Language Label Description Also known as
English
Hybrid least-squares algorithms for approximate policy evaluation
scientific article

    Statements

    Hybrid least-squares algorithms for approximate policy evaluation (English)
    0 references
    0 references
    0 references
    0 references
    7 October 2010
    0 references
    0 references
    reinforcement learning
    0 references
    Markov decision processes
    0 references
    0 references
    0 references