Pages that link to "Item:Q616967"
From MaRDI portal
The following pages link to An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes (Q616967):
Displaying 10 items.
- Constrained Markov decision processes with first passage criteria (Q363565) (← links)
- An online actor-critic algorithm with function approximation for constrained Markov decision processes (Q438776) (← links)
- The Borkar-Meyn theorem for asynchronous stochastic approximations (Q553371) (← links)
- Variance-constrained actor-critic algorithms for discounted and average reward MDPs (Q1689603) (← links)
- Smoothed functional-based gradient algorithms for off-policy reinforcement learning: a non-asymptotic viewpoint (Q2242923) (← links)
- Risk-Constrained Reinforcement Learning with Percentile Risk Criteria (Q4558492) (← links)
- Risk-Sensitive Reinforcement Learning via Policy Gradient Search (Q5102286) (← links)
- Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies (Q5139670) (← links)
- Dimension reduction based adaptive dynamic programming for optimal control of discrete-time nonlinear control-affine systems (Q6052349) (← links)
- Recent advances in reinforcement learning in finance (Q6146668) (← links)