An online actor-critic algorithm with function approximation for constrained Markov decision processes (Q438776): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Q4264741 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Natural actor-critic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average cost temporal-difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based optimization of Markov reward processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal flow control of a class of queueing networks in equilibrium / rank
 
Normal rank
Property / cites work
 
Property / cites work: An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multivariate stochastic approximation using a simultaneous perturbation gradient approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: An actor-critic algorithm for constrained Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4714399 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Borkar-Meyn theorem for asynchronous stochastic approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous Stochastic Approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3527701 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3997575 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Perturbation theory and finite Markov chains / rank
 
Normal rank

Latest revision as of 12:54, 5 July 2024

scientific article
Language Label Description Also known as
English
An online actor-critic algorithm with function approximation for constrained Markov decision processes
scientific article

    Statements

    An online actor-critic algorithm with function approximation for constrained Markov decision processes (English)
    0 references
    0 references
    0 references
    31 July 2012
    0 references
    actor critic algorithm
    0 references
    constrained Markov decision process
    0 references
    long-run average cost criterion
    0 references
    function approximation
    0 references

    Identifiers