Hybrid least-squares algorithms for approximate policy evaluation (Q1959511): Difference between revisions

@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10994-009-5128-4
+Normal rank
@@ Property / OpenAlex ID @@
+W4236439427
@@ Property / OpenAlex ID: W4236439427 / rank @@
+Normal rank
@@ Property / Wikidata QID @@
+Q115146324
@@ Property / Wikidata QID: Q115146324 / rank @@
+Normal rank
@@ Property / cites work @@
+Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
+Normal rank
@@ Property / cites work @@
+Q5477859
@@ Property / cites work: Q5477859 / rank @@
+Normal rank
@@ Property / cites work @@
+.1162/1532443041827907
@@ Property / cites work: 10.1162/1532443041827907 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4737965
@@ Property / cites work: Q4737965 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Generalized polynomial approximations in Markovian decision processes
+Normal rank