An online actor-critic algorithm with function approximation for constrained Markov decision processes (Q438776): Difference between revisions

@@ Property / cites work @@
+Q4264741
@@ Property / cites work: Q4264741 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+OnActor-Critic Algorithms
@@ Property / cites work: OnActor-Critic Algorithms / rank @@
+Normal rank
@@ Property / cites work @@
+Natural actor-critic algorithms
@@ Property / cites work: Natural actor-critic algorithms / rank @@
+Normal rank
@@ Property / cites work @@
+Average cost temporal-difference learning
@@ Property / cites work: Average cost temporal-difference learning / rank @@
+Normal rank
@@ Property / cites work @@
+Simulation-based optimization of Markov reward processes
+Normal rank
@@ Property / cites work @@
+Optimal flow control of a class of queueing networks in equilibrium
+Normal rank
@@ Property / cites work @@
+An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
+Normal rank
@@ Property / cites work @@
+Multivariate stochastic approximation using a simultaneous perturbation gradient approximation
+Normal rank
@@ Property / cites work @@
+An actor-critic algorithm for constrained Markov decision processes
+Normal rank
@@ Property / cites work @@
+Q4714399
@@ Property / cites work: Q4714399 / rank @@
+Normal rank
@@ Property / cites work @@
+The Borkar-Meyn theorem for asynchronous stochastic approximations
+Normal rank
@@ Property / cites work @@
+Asynchronous Stochastic Approximations
@@ Property / cites work: Asynchronous Stochastic Approximations / rank @@
+Normal rank
@@ Property / cites work @@
+Q3527701
@@ Property / cites work: Q3527701 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3997575
@@ Property / cites work: Q3997575 / rank @@
+Normal rank
@@ Property / cites work @@
+The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning
+Normal rank
@@ Property / cites work @@
+Perturbation theory and finite Markov chains
@@ Property / cites work: Perturbation theory and finite Markov chains / rank @@
+Normal rank