Practical issues in temporal difference learning (Q1812929): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learnability and the Vapnik-Chervonenkis dimension / rank
 
Normal rank
Property / cites work
 
Property / cites work: The convergence of \(TD(\lambda)\) for general \(\lambda\) / rank
 
Normal rank
Property / cites work
 
Property / cites work: A comparison and evaluation of three machine learning procedures as applied to the game of checkers / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multilayer feedforward networks are universal approximators / rank
 
Normal rank
Property / cites work
 
Property / cites work: A pattern classification approach to evaluation function learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Stochastic Approximation Method / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning representations by back-propagating errors / rank
 
Normal rank
Property / cites work
 
Property / cites work: A parallel network that learns to play backgammon / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Optimal Doubling in Backgammon / rank
 
Normal rank

Latest revision as of 16:52, 14 May 2024

scientific article
Language Label Description Also known as
English
Practical issues in temporal difference learning
scientific article

    Statements