Asymptotic analysis of value prediction by well-specified and misspecified models
DOI10.1016/J.NEUNET.2012.03.004zbMATH Open1251.62034DBLPjournals/nn/UenoMI12OpenAlexW2033534582WikidataQ51610451 ScholiaQ51610451MaRDI QIDQ448322FDOQ448322
Authors: Tsuyoshi Ueno, Shin Ishii, Shin-Ichi Maeda
Publication date: 30 August 2012
Published in: Neural Networks (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.neunet.2012.03.004
Recommendations
- Generalized TD learning
- Analytical mean squared error curves for temporal difference learning
- On the convergence of temporal-difference learning with linear function approximation
- A finite time analysis of temporal difference learning with linear function approximation
- Advances in Artificial Intelligence
M-estimatorsasymptotic analysispolicy evaluationreinforcement learningsemiparametric statistical inference
Nonparametric estimation (62G05) Asymptotic properties of nonparametric inference (62G20) Markov processes: estimation; hidden Markov models (62M05)
Cites Work
- A new look at the statistical model identification
- Title not available (Why is that?)
- Stochastic optimal control. The discrete time case
- Generalised information criteria in model selection
- Linear least-squares algorithms for temporal difference learning
- An analysis of temporal-difference learning with function approximation
- Algorithms for reinforcement learning.
- Technical update: Least-squares temporal difference learning
- The optimal unbiased value estimator and its relation to LSTD, TD and MC
- Asymptotic analysis of value prediction by well-specified and misspecified models
- Title not available (Why is that?)
Cited In (1)
This page was built for publication: Asymptotic analysis of value prediction by well-specified and misspecified models
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q448322)