Pages that link to "Item:Q1932736"
From MaRDI portal
The following pages link to Error bounds for constant step-size \(Q\)-learning (Q1932736):
Displaying 7 items.
- Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach (Q511735) (← links)
- Finite-sample analysis of nonlinear stochastic approximation with applications in reinforcement learning (Q2097782) (← links)
- Convergence of Recursive Stochastic Algorithms Using Wasserstein Divergence (Q5018894) (← links)
- Some Limit Properties of Markov Chains Induced by Recursive Stochastic Algorithms (Q5037552) (← links)
- A Discrete-Time Switching System Analysis of Q-Learning (Q6107867) (← links)
- Recent advances in reinforcement learning in finance (Q6146668) (← links)
- Settling the sample complexity of model-based offline reinforcement learning (Q6192326) (← links)