Practical issues in temporal difference learning (Q1812929): Difference between revisions

From MaRDI portal
Created claim: Wikidata QID (P12): Q56225518, #quickstatements; #temporary_batch_1705765412645
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learnability and the Vapnik-Chervonenkis dimension / rank
 
Normal rank
Property / cites work
 
Property / cites work: The convergence of \(TD(\lambda)\) for general \(\lambda\) / rank
 
Normal rank
Property / cites work
 
Property / cites work: A comparison and evaluation of three machine learning procedures as applied to the game of checkers / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multilayer feedforward networks are universal approximators / rank
 
Normal rank
Property / cites work
 
Property / cites work: A pattern classification approach to evaluation function learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Stochastic Approximation Method / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning representations by back-propagating errors / rank
 
Normal rank
Property / cites work
 
Property / cites work: A parallel network that learns to play backgammon / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Optimal Doubling in Backgammon / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 15:52, 14 May 2024

scientific article
Language Label Description Also known as
English
Practical issues in temporal difference learning
scientific article

    Statements

    Identifiers