A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(5 intermediate revisions by 4 users not shown)
Property / describes a project that uses
 
Property / describes a project that uses: ElemStatLearn / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: Approxrl / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s11768-011-0313-y / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2044287460 / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q115144927 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3241581 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Neuro-Dynamic Programming: An Overview and Recent Results / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4936225 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The elements of statistical learning. Data mining, inference, and prediction / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4845461 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based algorithms for Markov decision processes. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based optimization: Parametric optimization techniques and reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Functional Approximations and Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4160185 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, I / rank
 
Normal rank
Property / cites work
 
Property / cites work: Generalized polynomial approximations in Markovian decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Practical issues in temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Feature-based methods for large scale dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477859 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3174155 / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303768966102 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3096132 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Kernel-based reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Kernel Recursive Least-Squares Algorithm / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2834459 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The policy iteration algorithm for average reward Markov decision processes with general state space / rank
 
Normal rank
Property / cites work
 
Property / cites work: Policy Iterations on the Hamilton–Jacobi–Isaacs Equation for $H_{\infty}$ State Feedback Control With Input Saturation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive optimal control for continuous-time linear systems based on policy iteration / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4225048 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov chains and stochastic stability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic optimal control. The discrete time case / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some results on Tchebycheffian spline functions and stochastic processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4776665 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Recursive estimation of regression functions by local polynomial fitting / rank
 
Normal rank

Latest revision as of 06:45, 5 July 2024

scientific article
Language Label Description Also known as
English
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
scientific article

    Statements

    A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (English)
    0 references
    0 references
    0 references
    1 June 2012
    0 references
    approximate dynamic programming
    0 references
    reinforcement learning
    0 references
    optimal control
    0 references
    approximation algorithms
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers