On Generalized Bellman Equations and Temporal-Difference Learning (Q3305109): Difference between revisions

← Older edit

@@ description / en / description / en @@
+scientific article; zbMATH DE number 6982339
@@ Property / zbMATH Open document ID @@
+.90117
@@ Property / zbMATH Open document ID: 1465.90117 / rank @@
+Normal rank
@@ Property / publication date @@
+November 2018Timestamp +2018-11-21T00:00:00Z
Timezone +00:00
Calendar Gregorian
Precision 1 day
Before 0
After 0
-Timestamp
++2018-11-21T00:00:00Z
-Timezone
++00:00
-Calendar
+Gregorian
-Precision
+day
 Before
 After
@@ Property / publication date: 21 November 2018 / rank @@
+Normal rank
@@ Property / full work available at URL @@
+http://jmlr.csail.mit.edu/papers/v19/17-283.html
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+J20
@@ Property / Mathematics Subject Classification ID: 60J20 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C39
@@ Property / Mathematics Subject Classification ID: 90C39 / rank @@
+Normal rank
@@ Property / zbMATH DE Number @@
+6982339
@@ Property / zbMATH DE Number: 6982339 / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+approximate policy evaluation
@@ Property / zbMATH Keywords: approximate policy evaluation / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+reinforcement learning
@@ Property / zbMATH Keywords: reinforcement learning / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+temporal-difference method
@@ Property / zbMATH Keywords: temporal-difference method / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+SBEED
@@ Property / describes a project that uses: SBEED / rank @@
+Normal rank
@@ Property / arXiv classification @@
+cs.LG
@@ Property / arXiv classification: cs.LG / rank @@
+Normal rank
@@ Property / arXiv classification @@
+math.OC
@@ Property / arXiv classification: math.OC / rank @@
+Normal rank