Basis function adaptation in temporal difference reinforcement learning (Q2485935): Difference between revisions

From MaRDI portal
Created claim: MaRDI profile type (P1460): MaRDI publication profile (Q5976449), #quickstatements; #temporary_batch_1710461151948
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s10479-005-5732-z / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W1998172110 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Application of the cross-entropy method to the buffer allocation problem in a simulation-based environment / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4368722 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3151174 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Technical update: Least-squares temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477859 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A tutorial on the cross-entropy method / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4001920 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4422978 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4353852 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4709211 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4709223 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4828558 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The cross-entropy method for combinatorial and continuous optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477860 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: An adaptive optimal controller for discrete-time Markov environments / rank
 
Normal rank

Latest revision as of 14:55, 10 June 2024

scientific article
Language Label Description Also known as
English
Basis function adaptation in temporal difference reinforcement learning
scientific article

    Statements

    Basis function adaptation in temporal difference reinforcement learning (English)
    0 references
    0 references
    0 references
    0 references
    5 August 2005
    0 references
    0 references
    0 references
    0 references
    0 references
    reinforcement learning
    0 references
    temporal difference algorithms
    0 references
    cross entropy method
    0 references
    radial basis functions
    0 references
    0 references