Hybrid least-squares algorithms for approximate policy evaluation (Q1959511): Difference between revisions

From MaRDI portal
Created claim: Wikidata QID (P12): Q115146324, #quickstatements; #temporary_batch_1710884486334
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477859 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4737965 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Generalized polynomial approximations in Markovian decision processes / rank
 
Normal rank

Latest revision as of 07:48, 3 July 2024

scientific article
Language Label Description Also known as
English
Hybrid least-squares algorithms for approximate policy evaluation
scientific article

    Statements

    Hybrid least-squares algorithms for approximate policy evaluation (English)
    0 references
    0 references
    0 references
    0 references
    7 October 2010
    0 references
    0 references
    reinforcement learning
    0 references
    Markov decision processes
    0 references
    0 references
    0 references