The following pages link to Satinder Pal Singh (Q1812932):
Displaying 15 items.
- (Q1266171) (redirect page) (← links)
- Analytical mean squared error curves for temporal difference learning (Q1266172) (← links)
- (Q1345142) (redirect page) (← links)
- An upper bound on the loss from approximate optimal-value functions (Q1345144) (← links)
- Convergence results for single-step on-policy reinforcement-learning algorithms (Q1568533) (← links)
- Near-optimal reinforcement learning in polynomial time (Q1604817) (← links)
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning (Q1606316) (← links)
- Transfer of learning by composing solutions of elemental sequential tasks (Q1812933) (← links)
- Reinforcement learning with replacing eligibility traces (Q1911343) (← links)
- Reward is enough (Q2238710) (← links)
- Learning payoff functions in infinite games (Q2384147) (← links)
- On the Convergence of Stochastic Iterative Dynamic Programming Algorithms (Q4323346) (← links)
- (Q4533345) (← links)
- (Q4617629) (← links)
- (Q5477862) (← links)