The following pages link to (Q3093180):
Displaying 22 items.
- Cooperative and geometric learning algorithm (CGLA) for path planning of UAVs with limited information (Q462353) (← links)
- Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming (Q851872) (← links)
- Error bounds for constant step-size \(Q\)-learning (Q1932736) (← links)
- Fundamental design principles for reinforcement learning algorithms (Q2094028) (← links)
- Finite-sample analysis of nonlinear stochastic approximation with applications in reinforcement learning (Q2097782) (← links)
- Unified reinforcement Q-learning for mean field game and control problems (Q2153488) (← links)
- Reinforcement learning-based design of side-channel countermeasures (Q2154063) (← links)
- Reinforcement learning with algorithms from probabilistic structure estimation (Q2165986) (← links)
- A concentration bound for \(\operatorname{LSPE}( \lambda )\) (Q2677709) (← links)
- Integrated condition-based maintenance and multi-item lot-sizing with stochastic demand (Q2698605) (← links)
- Empirical Dynamic Programming (Q2806811) (← links)
- Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis (Q5018896) (← links)
- One-dimensional system arising in stochastic gradient descent (Q5022277) (← links)
- (Q5053203) (← links)
- Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures (Q5219554) (← links)
- Convergence Rates and Decoupling in Linear Stochastic Approximation Algorithms (Q5254881) (← links)
- Speedy Categorical Distributional Reinforcement Learning and Complexity Analysis (Q5865896) (← links)
- Concentration of Contractive Stochastic Approximation and Reinforcement Learning (Q5870773) (← links)
- A Discrete-Time Switching System Analysis of Q-Learning (Q6107867) (← links)
- Recent advances in reinforcement learning in finance (Q6146668) (← links)
- A stochastic contraction mapping theorem (Q6161346) (← links)
- Settling the sample complexity of model-based offline reinforcement learning (Q6192326) (← links)