Richard S. Sutton

From MaRDI portal



List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
Reward-respecting subtasks for model-based reinforcement learning
Artificial Intelligence
2023-11-16Paper
Reward is enough
Artificial Intelligence
2021-11-02Paper
Policy iterations for reinforcement learning problems in continuous time and space -- fundamental theory and methods
Automatica
2021-04-20Paper
On generalized Bellman equations and temporal-difference learning
Lecture Notes in Computer Science
2020-08-05Paper
Reinforcement learning. An introduction2019-02-27Paper
On generalized Bellman equations and temporal-difference learning
Journal of Machine Learning Research (JMLR)
2018-11-21Paper
True online temporal-difference learning
Journal of Machine Learning Research (JMLR)
2016-11-22Paper
True online temporal-difference learning
Journal of Machine Learning Research (JMLR)
2016-11-22Paper
An emphatic approach to the problem of off-policy temporal-difference learning
Journal of Machine Learning Research (JMLR)
2016-06-06Paper
An emphatic approach to the problem of off-policy temporal-difference learning
Journal of Machine Learning Research (JMLR)
2016-06-06Paper
Temporal-difference search in Computer Go
Machine Learning
2012-05-23Paper
Natural actor-critic algorithms
Automatica
2010-01-08Paper
Stimulus Representation and the Timing of Reward-Prediction Errors in Models of the Dopamine System
Neural Computation
2008-12-05Paper
Reinforcement learning with replacing eligibility traces
Machine Learning
2006-06-29Paper
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
Artificial Intelligence
2002-07-24Paper
Reinforcement learning with replacing eligibility traces
Machine Learning
1996-08-13Paper
Synthesis of nonlinear control surfaces by a layered associative search network
Biological Cybernetics
1982-01-01Paper
Associative search network: A reinforcement learning associative memory
Biological Cybernetics
1981-01-01Paper
Landmark learning: An illustration of associative search
Biological Cybernetics
1981-01-01Paper


Research outcomes over time


This page was built for person: Richard S. Sutton