Pages that link to "Item:Q438776"
From MaRDI portal
The following pages link to An online actor-critic algorithm with function approximation for constrained Markov decision processes (Q438776):
Displaying 7 items.
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- Event-based optimization approach for solving stochastic decision problems with probabilistic constraint (Q828677) (← links)
- Variance-constrained actor-critic algorithms for discounted and average reward MDPs (Q1689603) (← links)
- Suboptimal control for nonlinear systems with disturbance via integral sliding mode control and policy iteration (Q2178900) (← links)
- Risk-Constrained Reinforcement Learning with Percentile Risk Criteria (Q4558492) (← links)
- Queueing Network Controls via Deep Reinforcement Learning (Q5084497) (← links)
- Optimal deterministic controller synthesis from steady-state distributions (Q6156635) (← links)