A dynamic programming strategy to balance exploration and exploitation in the bandit problem (Q647433): Difference between revisions

@@ Property / describes a project that uses @@
+bootstrap
@@ Property / describes a project that uses: bootstrap / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+PRMLT
@@ Property / describes a project that uses: PRMLT / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10472-010-9190-1
+Normal rank
@@ Property / OpenAlex ID @@
+W2052471706
@@ Property / OpenAlex ID: W2052471706 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Q4252717
@@ Property / cites work: Q4252717 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3795523
@@ Property / cites work: Q3795523 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5483032
@@ Property / cites work: Q5483032 / rank @@
+Normal rank
@@ Property / cites work @@
+A dynamic programming strategy to balance exploration and exploitation in the bandit problem
+Normal rank
@@ Property / cites work @@
+Q4318617
@@ Property / cites work: Q4318617 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4692329
@@ Property / cites work: Q4692329 / rank @@
+Normal rank
@@ Property / cites work @@
+The Sample Average Approximation Method for Stochastic Discrete Optimization
+Normal rank
@@ Property / cites work @@
+Exploration of multi-state environments: Local measures and back-propagation of uncertainty
+Normal rank
@@ Property / cites work @@
+Approximate Dynamic Programming
@@ Property / cites work: Approximate Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Some aspects of the sequential design of experiments
+Normal rank