Simple statistical gradient-following algorithms for connectionist reinforcement learning (Q1812928): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Created claim: DBLP publication ID (P1635): journals/ml/Williams92, #quickstatements; #temporary_batch_1731547958265
 
(4 intermediate revisions by 4 users not shown)
Property / Wikidata QID
 
Property / Wikidata QID: Q39487141 / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Pattern-recognizing stochastic learning automata / rank
 
Normal rank
Property / cites work
 
Property / cites work: Associative search network: A reinforcement learning associative memory / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3799870 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An N-player sequential stochastic game with identical payoffs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3856120 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4125549 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A new approach to the design of reinforcement schemes for learning automata / rank
 
Normal rank
Property / cites work
 
Property / cites work: Decentralized learning in finite Markov chains / rank
 
Normal rank
Property / DBLP publication ID
 
Property / DBLP publication ID: journals/ml/Williams92 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 02:46, 14 November 2024

scientific article
Language Label Description Also known as
English
Simple statistical gradient-following algorithms for connectionist reinforcement learning
scientific article

    Statements

    Simple statistical gradient-following algorithms for connectionist reinforcement learning (English)
    0 references
    0 references
    11 August 1992
    0 references
    gradient descent
    0 references
    reinforcement learning
    0 references
    connectionist networks
    0 references

    Identifiers