An actor-critic algorithm for constrained Markov decision processes (Q2504518): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Q4264741 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4547448 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal control and viscosity solutions of Hamilton-Jacobi-Bellman equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3997575 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with two time scales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4547443 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Actor-Critic--Type Learning Algorithms for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093188 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4902563 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Envelope Theorems for Arbitrary Choice Sets / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4377607 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4547446 / rank
 
Normal rank

Latest revision as of 20:43, 24 June 2024

scientific article
Language Label Description Also known as
English
An actor-critic algorithm for constrained Markov decision processes
scientific article

    Statements

    An actor-critic algorithm for constrained Markov decision processes (English)
    0 references
    0 references
    25 September 2006
    0 references
    0 references
    0 references
    0 references
    0 references
    actor-critic algorithms
    0 references
    reinforcement learning
    0 references
    constrained Markov decision processes
    0 references
    stochastic approximation
    0 references
    envelope theorem
    0 references
    0 references