Simple statistical gradient-following algorithms for connectionist reinforcement learning (Q1812928): Difference between revisions
From MaRDI portal
Created claim: Wikidata QID (P12): Q39487141, #quickstatements; #temporary_batch_1706368214787 |
Created claim: DBLP publication ID (P1635): journals/ml/Williams92, #quickstatements; #temporary_batch_1731547958265 |
||
(3 intermediate revisions by 3 users not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Pattern-recognizing stochastic learning automata / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Associative search network: A reinforcement learning associative memory / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3799870 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: An N-player sequential stochastic game with identical payoffs / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3856120 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4125549 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: A new approach to the design of reinforcement schemes for learning automata / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Decentralized learning in finite Markov chains / rank | |||
Normal rank | |||
Property / DBLP publication ID | |||
Property / DBLP publication ID: journals/ml/Williams92 / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 02:46, 14 November 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Simple statistical gradient-following algorithms for connectionist reinforcement learning |
scientific article |
Statements
Simple statistical gradient-following algorithms for connectionist reinforcement learning (English)
0 references
11 August 1992
0 references
gradient descent
0 references
reinforcement learning
0 references
connectionist networks
0 references