Analysis and improvement of policy gradient estimation (Q448295): Difference between revisions

@@ Property / author @@
-Masashi Sugiyama
@@ Property / author: Masashi Sugiyama / rank @@
-Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1016/j.neunet.2011.09.005
+Normal rank
@@ Property / OpenAlex ID @@
+W2148053762
@@ Property / OpenAlex ID: W2148053762 / rank @@
+Normal rank
@@ Property / Wikidata QID @@
+Q51513131
@@ Property / Wikidata QID: Q51513131 / rank @@
+Normal rank