Eligibility traces and forgetting factor in recursive least-squares-based temporal difference
From MaRDI portal
Publication:6495643
Recommendations
Cites work
- scientific article; zbMATH DE number 1753141 (Why is no real title available?)
- Adaptive Control Design Based on Adaptive Optimization Principles
- Adaptive Control Tutorial
- Adaptive Dynamic Programming and Adaptive Optimal Output Regulation of Linear Systems
- Adaptive Optimal Control for Large-Scale Nonlinear Systems
- Adaptive critic design with graph Laplacian for online learning control of nonlinear systems
- Adaptive dynamic programming for model-free tracking of trajectories with time-varying parameters
- An adaptive optimization scheme with satisfactory transient performance
- An analysis of temporal-difference learning with function approximation
- Composite Model Reference Adaptive Control with Parameter Convergence Under Finite Excitation
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Initial Excitation-Based Iterative Algorithm for Approximate Optimal Control of Completely Unknown LTI Systems
- Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning
- Linear least-squares algorithms for temporal difference learning
- Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning
- Practical issues in temporal difference learning
- Reinforcement learning for adaptive optimal control of continuous-time linear periodic systems
- Stability of Stochastic Approximations With “Controlled Markov” Noise and Temporal Difference Learning
- Technical update: Least-squares temporal difference learning
- The convergence of \(TD(\lambda)\) for general \(\lambda\)
- \(Q(\lambda )\)-learning adaptive fuzzy logic controllers for pursuit-evasion differential games
This page was built for publication: Eligibility traces and forgetting factor in recursive least-squares-based temporal difference
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6495643)