Hybrid least-squares algorithms for approximate policy evaluation (Q1959511): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s10994-009-5128-4 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W4236439427 / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q115146324 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477859 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4737965 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Generalized polynomial approximations in Markovian decision processes / rank
 
Normal rank

Latest revision as of 07:48, 3 July 2024

scientific article
Language Label Description Also known as
English
Hybrid least-squares algorithms for approximate policy evaluation
scientific article

    Statements

    Hybrid least-squares algorithms for approximate policy evaluation (English)
    0 references
    0 references
    0 references
    0 references
    7 October 2010
    0 references
    0 references
    reinforcement learning
    0 references
    Markov decision processes
    0 references
    0 references
    0 references