Simple statistical gradient-following algorithms for connectionist reinforcement learning (Q1812928): Difference between revisions

@@ Property / cites work @@
+Pattern-recognizing stochastic learning automata
@@ Property / cites work: Pattern-recognizing stochastic learning automata / rank @@
+Normal rank
@@ Property / cites work @@
+Associative search network: A reinforcement learning associative memory
+Normal rank
@@ Property / cites work @@
+Q3799870
@@ Property / cites work: Q3799870 / rank @@
+Normal rank
@@ Property / cites work @@
+An N-player sequential stochastic game with identical payoffs
+Normal rank
@@ Property / cites work @@
+Q3856120
@@ Property / cites work: Q3856120 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4125549
@@ Property / cites work: Q4125549 / rank @@
+Normal rank
@@ Property / cites work @@
+A new approach to the design of reinforcement schemes for learning automata
+Normal rank
@@ Property / cites work @@
+Decentralized learning in finite Markov chains
@@ Property / cites work: Decentralized learning in finite Markov chains / rank @@
+Normal rank
@@ Property / DBLP publication ID @@
+journals/ml/Williams92
@@ Property / DBLP publication ID: journals/ml/Williams92 / rank @@
+Normal rank