Asymptotic analysis of value prediction by well-specified and misspecified models
From MaRDI portal
Publication:448322
Recommendations
- Generalized TD learning
- Analytical mean squared error curves for temporal difference learning
- On the convergence of temporal-difference learning with linear function approximation
- A finite time analysis of temporal difference learning with linear function approximation
- Advances in Artificial Intelligence
Cites work
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- A new look at the statistical model identification
- Algorithms for reinforcement learning.
- An analysis of temporal-difference learning with function approximation
- Asymptotic analysis of value prediction by well-specified and misspecified models
- Generalised information criteria in model selection
- Generalized TD learning
- Linear least-squares algorithms for temporal difference learning
- Stochastic optimal control. The discrete time case
- Technical update: Least-squares temporal difference learning
- The optimal unbiased value estimator and its relation to LSTD, TD and MC
This page was built for publication: Asymptotic analysis of value prediction by well-specified and misspecified models
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q448322)