Richard S. Sutton

MaRDI QIDQ1049133zbMATH OpenDBLPORCIDWikidataFDO

List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

Publication	Date of Publication	Type
Reward-respecting subtasks for model-based reinforcement learning Artificial Intelligence	2023-11-16	Paper
Reward is enough Artificial Intelligence	2021-11-02	Paper
Policy iterations for reinforcement learning problems in continuous time and space -- fundamental theory and methods Automatica	2021-04-20	Paper
On generalized Bellman equations and temporal-difference learning Lecture Notes in Computer Science	2020-08-05	Paper
Reinforcement learning. An introduction	2019-02-27	Paper
On generalized Bellman equations and temporal-difference learning Journal of Machine Learning Research (JMLR)	2018-11-21	Paper
True online temporal-difference learning Journal of Machine Learning Research (JMLR)	2016-11-22	Paper
True online temporal-difference learning Journal of Machine Learning Research (JMLR)	2016-11-22	Paper
An emphatic approach to the problem of off-policy temporal-difference learning Journal of Machine Learning Research (JMLR)	2016-06-06	Paper
An emphatic approach to the problem of off-policy temporal-difference learning Journal of Machine Learning Research (JMLR)	2016-06-06	Paper
Temporal-difference search in Computer Go Machine Learning	2012-05-23	Paper
Natural actor-critic algorithms Automatica	2010-01-08	Paper
Stimulus Representation and the Timing of Reward-Prediction Errors in Models of the Dopamine System Neural Computation	2008-12-05	Paper
Reinforcement learning with replacing eligibility traces Machine Learning	2006-06-29	Paper
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning Artificial Intelligence	2002-07-24	Paper
Reinforcement learning with replacing eligibility traces Machine Learning	1996-08-13	Paper
Synthesis of nonlinear control surfaces by a layered associative search network Biological Cybernetics	1982-01-01	Paper
Associative search network: A reinforcement learning associative memory Biological Cybernetics	1981-01-01	Paper
Landmark learning: An illustration of associative search Biological Cybernetics	1981-01-01	Paper

Research outcomes over time

This page was built for person: Richard S. Sutton