Adaptive importance sampling for value function approximation in off-policy reinforcement learning (Q1784527): Difference between revisions

@@ Property / full work available at URL @@
+https://doi.org/10.1016/j.neunet.2009.01.002
+Normal rank
@@ Property / OpenAlex ID @@
+W2002748013
@@ Property / OpenAlex ID: W2002748013 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4869639
@@ Property / cites work: Q4869639 / rank @@
+Normal rank
@@ Property / cites work @@
+The elements of statistical learning. Data mining, inference, and prediction
+Normal rank
@@ Property / cites work @@
+.1162/1532443041827907
@@ Property / cites work: 10.1162/1532443041827907 / rank @@
+Normal rank
@@ Property / cites work @@
+Linear Statistical Inference and its Applications
@@ Property / cites work: Linear Statistical Inference and its Applications / rank @@
+Normal rank
@@ Property / cites work @@
+Improving predictive inference under covariate shift by weighting the log-likelihood function
+Normal rank
@@ Property / cites work @@
+Trading Variance Reduction with Unbiasedness: The Regularized Subspace Information Criterion for Robust Model Selection in Kernel Regression
+Normal rank
@@ Property / cites work @@
+Q3174108
@@ Property / cites work: Q3174108 / rank @@
+Normal rank
@@ Property / DBLP publication ID @@
+journals/nn/HachiyaASP09
@@ Property / DBLP publication ID: journals/nn/HachiyaASP09 / rank @@
+Normal rank