Satinder Pal Singh

From MaRDI portal
(Redirected from Person:1812932)



List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
Reward is enough
Artificial Intelligence
2021-11-02Paper
scientific article; zbMATH DE number 7014212 (Why is no real title available?)2019-02-06Paper
scientific article; zbMATH DE number 7014212 (Why is no real title available?)
(available as arXiv preprint)
2019-02-06Paper
Learning payoff functions in infinite games
Machine Learning
2007-09-20Paper
Reinforcement learning with replacing eligibility traces
Machine Learning
2006-06-29Paper
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
Artificial Intelligence
2002-07-24Paper
Near-optimal reinforcement learning in polynomial time
Machine Learning
2002-07-08Paper
scientific article; zbMATH DE number 1753137 (Why is no real title available?)2002-06-10Paper
Convergence results for single-step on-policy reinforcement-learning algorithms
Machine Learning
2000-06-21Paper
Analytical mean squared error curves for temporal difference learning
Machine Learning
1998-09-07Paper
Reinforcement learning with replacing eligibility traces
Machine Learning
1996-08-13Paper
On the Convergence of Stochastic Iterative Dynamic Programming Algorithms
Neural Computation
1995-10-18Paper
An upper bound on the loss from approximate optimal-value functions
Machine Learning
1995-02-26Paper
Transfer of learning by composing solutions of elemental sequential tasks
Machine Learning
1992-08-11Paper


Research outcomes over time


This page was built for person: Satinder Pal Singh