Asymptotic analysis of value prediction by well-specified and misspecified models
From MaRDI portal
Publication:448322
DOI10.1016/j.neunet.2012.03.004zbMath1251.62034OpenAlexW2033534582WikidataQ51610451 ScholiaQ51610451MaRDI QIDQ448322
Tsuyoshi Ueno, Shin Ishii, Shin-ichi Maeda
Publication date: 30 August 2012
Published in: Neural Networks (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.neunet.2012.03.004
M-estimatorsasymptotic analysisreinforcement learningpolicy evaluationsemiparametric statistical inference
Asymptotic properties of nonparametric inference (62G20) Nonparametric estimation (62G05) Markov processes: estimation; hidden Markov models (62M05)
Related Items (1)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- The optimal unbiased value estimator and its relation to LSTD, TD and MC
- Asymptotic analysis of value prediction by well-specified and misspecified models
- Stochastic optimal control. The discrete time case
- Technical update: Least-squares temporal difference learning
- Algorithms for Reinforcement Learning
- An analysis of temporal-difference learning with function approximation
- Generalised information criteria in model selection
- A new look at the statistical model identification
This page was built for publication: Asymptotic analysis of value prediction by well-specified and misspecified models