Sleeping experts and bandits approach to constrained Markov decision processes (Q901196): Difference between revisions

@@ Property / arXiv ID @@
+.4898
@@ Property / arXiv ID: 1412.4898 / rank @@
+Normal rank
@@ Property / cites work @@
+Stochastic approximation algorithms for constrained optimization via simulation
+Normal rank
@@ Property / cites work @@
+An exact iterative search algorithm for constrained Markov decision processes
+Normal rank
@@ Property / cites work @@
+Simulation-based algorithms for Markov decision processes.
+Normal rank
@@ Property / cites work @@
+Non-randomized policies for constrained Markov decision processes
+Normal rank
@@ Property / cites work @@
+${Q}$-Learning Algorithms for Constrained Markov Decision Processes With Randomized Monotone Policies: Application to MIMO Transmission Control
+Normal rank
@@ Property / cites work @@
+Constrained Discounted Markov Decision Processes and Hamiltonian Cycles
+Normal rank
@@ Property / cites work @@
+Probability Inequalities for Sums of Bounded Random Variables
+Normal rank
@@ Property / cites work @@
+Regret bounds for sleeping experts and bandits
@@ Property / cites work: Regret bounds for sleeping experts and bandits / rank @@
+Normal rank
@@ Property / cites work @@
+The Sample Average Approximation Method for Stochastic Discrete Optimization
+Normal rank
@@ Property / cites work @@
+Simulation-Based Discrete Optimization of Stochastic Discrete Event Systems Subject to Non Closed-Form Constraints
+Normal rank
@@ Property / cites work @@
+Stochastically Constrained Ranking and Selection via SCORE
+Normal rank
@@ Property / cites work @@
+Approximate Dynamic Programming
@@ Property / cites work: Approximate Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Sample average approximation of expected value constrained stochastic programs
+Normal rank