Richard S. Sutton

From MaRDI portal
Person:1049133


List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
Reward-respecting subtasks for model-based reinforcement learning
Artificial Intelligence
2023-11-16Paper
Reward is enough
Artificial Intelligence
2021-11-02Paper
Policy iterations for reinforcement learning problems in continuous time and space -- fundamental theory and methods
Automatica
2021-04-20Paper
On generalized Bellman equations and temporal-difference learning
Lecture Notes in Computer Science
2020-08-05Paper
Reinforcement learning. An introduction
 
2019-02-27Paper
On generalized Bellman equations and temporal-difference learning
Journal of Machine Learning Research (JMLR)
2018-11-21Paper
True online temporal-difference learning
Journal of Machine Learning Research (JMLR)
2016-11-22Paper
An emphatic approach to the problem of off-policy temporal-difference learning
Journal of Machine Learning Research (JMLR)
2016-06-06Paper
Temporal-difference search in Computer Go
Machine Learning
2012-05-23Paper
Natural actor-critic algorithms
Automatica
2010-01-08Paper
Stimulus Representation and the Timing of Reward-Prediction Errors in Models of the Dopamine System
Neural Computation
2008-12-05Paper
Reinforcement learning with replacing eligibility traces
Machine Learning
2006-06-29Paper
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
Artificial Intelligence
2002-07-24Paper
Reinforcement learning with replacing eligibility traces
Machine Learning
1996-08-13Paper
Synthesis of nonlinear control surfaces by a layered associative search network
Biological Cybernetics
1982-01-01Paper
Associative search network: A reinforcement learning associative memory
Biological Cybernetics
1981-01-01Paper
Landmark learning: An illustration of associative search
Biological Cybernetics
1981-01-01Paper


Research outcomes over time


This page was built for person: Richard S. Sutton