The following pages link to Natural actor-critic algorithms (Q1049136):
Displayed 7 items.
- An online actor-critic algorithm with function approximation for constrained Markov decision processes (Q438776) (← links)
- Hessian matrix distribution for Bayesian policy gradient reinforcement learning (Q545311) (← links)
- The Borkar-Meyn theorem for asynchronous stochastic approximations (Q553371) (← links)
- An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes (Q616967) (← links)
- The factored policy-gradient planner (Q835832) (← links)
- Natural actor-critic algorithms (Q1049136) (← links)
- Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (Q1945130) (← links)