A study of reinfrocement learning in the continuous case by the means of viscosity solutions
From MaRDI portal
Publication:1584837
DOI10.1023/A:1007686309208zbMATH Open0962.68144MaRDI QIDQ1584837FDOQ1584837
Authors: Rémi Munos
Publication date: 5 November 2000
Published in: Machine Learning (Search for Journal in Brave)
Recommendations
- Reinforcement learning solution for HJB equation arising in constrained optimal control problem
- scientific article; zbMATH DE number 4002910
- Reinforcement learning for a class of continuous-time input constrained optimal control problems
- Convergence of a Q-learning variant for continuous states and actions
- Policy iterations for reinforcement learning problems in continuous time and space -- fundamental theory and methods
Cited In (9)
- Reinforcement learning for a class of continuous-time input constrained optimal control problems
- A model for system uncertainty in reinforcement learning
- Exploratory HJB equations and their convergence
- Reinforcement learning solution for HJB equation arising in constrained optimal control problem
- Multilevel preconditioners for temporal-difference learning methods related to recommendation engines
- Optimal harvesting policy for biological resources with uncertain heterogeneity for application in fisheries management
- Convergence results for an averaged LQR problem with applications to reinforcement learning
- On a multilevel preconditioner and its condition numbers for the discretized Laplacian on full and sparse grids in higher dimensions
- Title not available (Why is that?)
This page was built for publication: A study of reinfrocement learning in the continuous case by the means of viscosity solutions
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1584837)