Practical issues in temporal difference learning (Q1812929): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
Created claim: DBLP publication ID (P1635): journals/ml/Tesauro92, #quickstatements; #temporary_batch_1731468600454
 
(One intermediate revision by one other user not shown)
Property / cites work
 
Property / cites work: Learnability and the Vapnik-Chervonenkis dimension / rank
 
Normal rank
Property / cites work
 
Property / cites work: The convergence of \(TD(\lambda)\) for general \(\lambda\) / rank
 
Normal rank
Property / cites work
 
Property / cites work: A comparison and evaluation of three machine learning procedures as applied to the game of checkers / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multilayer feedforward networks are universal approximators / rank
 
Normal rank
Property / cites work
 
Property / cites work: A pattern classification approach to evaluation function learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Stochastic Approximation Method / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning representations by back-propagating errors / rank
 
Normal rank
Property / cites work
 
Property / cites work: A parallel network that learns to play backgammon / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Optimal Doubling in Backgammon / rank
 
Normal rank
Property / DBLP publication ID
 
Property / DBLP publication ID: journals/ml/Tesauro92 / rank
 
Normal rank

Latest revision as of 04:51, 13 November 2024

scientific article
Language Label Description Also known as
English
Practical issues in temporal difference learning
scientific article

    Statements

    Identifiers