An asymptotically optimal policy for finite support models in the multiarmed bandit problem (Q415624): Difference between revisions

@@ Property / Mathematics Subject Classification ID @@
+A15
@@ Property / Mathematics Subject Classification ID: 91A15 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+A26
@@ Property / Mathematics Subject Classification ID: 91A26 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C25
@@ Property / Mathematics Subject Classification ID: 90C25 / rank @@
+Normal rank
@@ Property / zbMATH DE Number @@
+6031871
@@ Property / zbMATH DE Number: 6031871 / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+bandit problems
@@ Property / zbMATH Keywords: bandit problems / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+finite-time regret
@@ Property / zbMATH Keywords: finite-time regret / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+MED policy
@@ Property / zbMATH Keywords: MED policy / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+convex optimization
@@ Property / zbMATH Keywords: convex optimization / rank @@
+Normal rank
@@ Property / Wikidata QID @@
+Q56675674
@@ Property / Wikidata QID: Q56675674 / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W2131958277
@@ Property / OpenAlex ID: W2131958277 / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.2776
@@ Property / arXiv ID: 0905.2776 / rank @@
+Normal rank
@@ Property / cites work @@
+The Continuum-Armed Bandit Problem
@@ Property / cites work: The Continuum-Armed Bandit Problem / rank @@
+Normal rank
@@ Property / cites work @@
+Sample mean based index policies by <i>O</i>(log <i>n</i>) regret for the multi-armed bandit problem
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+The Nonstochastic Multiarmed Bandit Problem
@@ Property / cites work: The Nonstochastic Multiarmed Bandit Problem / rank @@
+Normal rank
@@ Property / cites work @@
+Q4821526
@@ Property / cites work: Q4821526 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal adaptive policies for sequential allocation problems
+Normal rank
@@ Property / cites work @@
+Elements of Information Theory
@@ Property / cites work: Elements of Information Theory / rank @@
+Normal rank
@@ Property / cites work @@
+Q3046711
@@ Property / cites work: Q3046711 / rank @@
+Normal rank
@@ Property / cites work @@
+Introduction to sensitivity and stability analysis in nonlinear programming
+Normal rank
@@ Property / cites work @@
+Q4692329
@@ Property / cites work: Q4692329 / rank @@
+Normal rank
@@ Property / cites work @@
+Multi-armed bandit problem revisited
@@ Property / cites work: Multi-armed bandit problem revisited / rank @@
+Normal rank
@@ Property / cites work @@
+The Multi-Armed Bandit Problem: Decomposition and Computation
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Exploration of multi-state environments: Local measures and back-propagation of uncertainty
+Normal rank
@@ Property / cites work @@
+Convergence of stochastic processes
@@ Property / cites work: Convergence of stochastic processes / rank @@
+Normal rank
@@ Property / cites work @@
+Some aspects of the sequential design of experiments
+Normal rank
@@ Property / cites work @@
+Non-overlapping domain decomposition for evolution operators
+Normal rank
@@ Property / cites work @@
+Nonparametric bandit methods
@@ Property / cites work: Nonparametric bandit methods / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:415624