An asymptotically optimal strategy for constrained multi-armed bandit problems (Q784789): Difference between revisions

@@ Property / DOI @@
-.1007/s00186-019-00697-3
@@ Property / DOI: 10.1007/s00186-019-00697-3 / rank @@
-Normal rank
@@ Property / Wikidata QID @@
+Q126414170
@@ Property / Wikidata QID: Q126414170 / rank @@
+Normal rank
@@ Property / cites work @@
+Finite-time analysis of the multiarmed bandit problem
+Normal rank
@@ Property / cites work @@
+Randomised allocation of treatments in sequential trials
+Normal rank
@@ Property / cites work @@
+Q3809068
@@ Property / cites work: Q3809068 / rank @@
+Normal rank
@@ Property / cites work @@
+Pure exploration in finitely-armed and continuous-armed bandits
+Normal rank
@@ Property / cites work @@
+Prediction, Learning, and Games
@@ Property / cites work: Prediction, Learning, and Games / rank @@
+Normal rank
@@ Property / cites work @@
+The multi-armed bandit, with constraints
@@ Property / cites work: The multi-armed bandit, with constraints / rank @@
+Normal rank
@@ Property / cites work @@
+Multi‐Armed Bandit Allocation Indices
@@ Property / cites work: Multi‐Armed Bandit Allocation Indices / rank @@
+Normal rank
@@ Property / cites work @@
+Probability Inequalities for Sums of Bounded Random Variables
+Normal rank
@@ Property / cites work @@
+Asymptotically efficient adaptive allocation rules
+Normal rank
@@ Property / cites work @@
+Algorithms for stochastic optimization with function or expectation constraints
+Normal rank
@@ Property / cites work @@
+Penalty Function with Memory for Discrete Optimization via Simulation with Stochastic Constraints
+Normal rank
@@ Property / cites work @@
+Stochastically Constrained Ranking and Selection via SCORE
+Normal rank
@@ Property / cites work @@
+Some aspects of the sequential design of experiments
+Normal rank
@@ Property / cites work @@
+Introduction to Stochastic Search and Optimization
+Normal rank
@@ Property / cites work @@
+Online Learning Methods for Networking
@@ Property / cites work: Online Learning Methods for Networking / rank @@
+Normal rank
@@ Property / cites work @@
+Sample average approximation of expected value constrained stochastic programs
+Normal rank
@@ Property / DOI @@
+.1007/S00186-019-00697-3
@@ Property / DOI: 10.1007/S00186-019-00697-3 / rank @@
+Normal rank