A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (Q2887630): Difference between revisions

Revision as of 07:45, 5 July 2024

scientific article

Language	Label	Description	Also known as
English	A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications	scientific article

Statements

instance of

scholarly article

0 references

title

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications (English)

0 references

0 references

0 references

Journal of Control Theory and Applications

0 references

publication date

1 June 2012

0 references

zbMATH Keywords

approximate dynamic programming

0 references

reinforcement learning

0 references

optimal control

0 references

approximation algorithms

0 references

describes a project that uses

0 references

0 references

0 references

full work available at URL

https://doi.org/10.1007/s11768-011-0313-y

0 references

0 references

0 references

0 references

Neuro-Dynamic Programming: An Overview and Recent Results

0 references

Q4257216

0 references

Q4936225

0 references

The elements of statistical learning. Data mining, inference, and prediction

0 references

Q4845461

0 references

Approximate Dynamic Programming

0 references

Simulation-based algorithms for Markov decision processes.

0 references

Simulation-based optimization: Parametric optimization techniques and reinforcement learning

0 references

Functional Approximations and Dynamic Programming

0 references

Q4160185

0 references

Approximations of Dynamic Programs, I

0 references

Generalized polynomial approximations in Markovian decision processes

0 references

${\mathcal Q}$-learning

0 references

Practical issues in temporal difference learning

0 references

Feature-based methods for large scale dynamic programming

0 references

An analysis of temporal-difference learning with function approximation

0 references

Q5477859

0 references

10.1162/1532443041827907

0 references

Q3174155

0 references

10.1162/153244303768966102

0 references

Model-free $Q$-learning designs for linear discrete-time zero-sum games with application to $H^\infty$ control

0 references

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

0 references

Q3096132

0 references

Kernel-based reinforcement learning

0 references

The Kernel Recursive Least-Squares Algorithm

0 references

Q2834459

0 references

The policy iteration algorithm for average reward Markov decision processes with general state space

0 references

Policy Iterations on the Hamilton–Jacobi–Isaacs Equation for $H_{\infty}$ State Feedback Control With Input Saturation

0 references

Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems

0 references

Adaptive optimal control for continuous-time linear systems based on policy iteration

0 references

Q4225048

0 references

Markov chains and stochastic stability

0 references

Stochastic optimal control. The discrete time case

0 references

Some results on Tchebycheffian spline functions and stochastic processes

0 references

Q4776665

0 references

Recursive estimation of regression functions by local polynomial fitting

0 references

Identifiers

zbMATH Open document ID

1249.90306

0 references

DOI

10.1007/s11768-011-0313-y

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2887630

@@ Property / cites work @@
+Q3241581
@@ Property / cites work: Q3241581 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3266141
@@ Property / cites work: Q3266141 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Neuro-Dynamic Programming: An Overview and Recent Results
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4936225
@@ Property / cites work: Q4936225 / rank @@
+Normal rank
@@ Property / cites work @@
+The elements of statistical learning. Data mining, inference, and prediction
+Normal rank
@@ Property / cites work @@
+Q4845461
@@ Property / cites work: Q4845461 / rank @@
+Normal rank
@@ Property / cites work @@
+Approximate Dynamic Programming
@@ Property / cites work: Approximate Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Simulation-based algorithms for Markov decision processes.
+Normal rank
@@ Property / cites work @@
+Simulation-based optimization: Parametric optimization techniques and reinforcement learning
+Normal rank
@@ Property / cites work @@
+Functional Approximations and Dynamic Programming
@@ Property / cites work: Functional Approximations and Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Q4160185
@@ Property / cites work: Q4160185 / rank @@
+Normal rank
@@ Property / cites work @@
+Approximations of Dynamic Programs, I
@@ Property / cites work: Approximations of Dynamic Programs, I / rank @@
+Normal rank
@@ Property / cites work @@
+Generalized polynomial approximations in Markovian decision processes
+Normal rank
@@ Property / cites work @@
+\({\mathcal Q}\)-learning
@@ Property / cites work: \({\mathcal Q}\)-learning / rank @@
+Normal rank
@@ Property / cites work @@
+Practical issues in temporal difference learning
@@ Property / cites work: Practical issues in temporal difference learning / rank @@
+Normal rank
@@ Property / cites work @@
+Feature-based methods for large scale dynamic programming
+Normal rank
@@ Property / cites work @@
+An analysis of temporal-difference learning with function approximation
+Normal rank
@@ Property / cites work @@
+Q5477859
@@ Property / cites work: Q5477859 / rank @@
+Normal rank
@@ Property / cites work @@
+.1162/1532443041827907
@@ Property / cites work: 10.1162/1532443041827907 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3174155
@@ Property / cites work: Q3174155 / rank @@
+Normal rank
@@ Property / cites work @@
+.1162/153244303768966102
@@ Property / cites work: 10.1162/153244303768966102 / rank @@
+Normal rank
@@ Property / cites work @@
+Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control
+Normal rank
@@ Property / cites work @@
+Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
+Normal rank
@@ Property / cites work @@
+Q3096132
@@ Property / cites work: Q3096132 / rank @@
+Normal rank
@@ Property / cites work @@
+Kernel-based reinforcement learning
@@ Property / cites work: Kernel-based reinforcement learning / rank @@
+Normal rank
@@ Property / cites work @@
+The Kernel Recursive Least-Squares Algorithm
@@ Property / cites work: The Kernel Recursive Least-Squares Algorithm / rank @@
+Normal rank
@@ Property / cites work @@
+Q2834459
@@ Property / cites work: Q2834459 / rank @@
+Normal rank
@@ Property / cites work @@
+The policy iteration algorithm for average reward Markov decision processes with general state space
+Normal rank
@@ Property / cites work @@
+Policy Iterations on the Hamilton–Jacobi–Isaacs Equation for $H_{\infty}$ State Feedback Control With Input Saturation
+Normal rank
@@ Property / cites work @@
+Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
+Normal rank
@@ Property / cites work @@
+Adaptive optimal control for continuous-time linear systems based on policy iteration
+Normal rank
@@ Property / cites work @@
+Q4225048
@@ Property / cites work: Q4225048 / rank @@
+Normal rank
@@ Property / cites work @@
+Markov chains and stochastic stability
@@ Property / cites work: Markov chains and stochastic stability / rank @@
+Normal rank
@@ Property / cites work @@
+Stochastic optimal control. The discrete time case
+Normal rank
@@ Property / cites work @@
+Some results on Tchebycheffian spline functions and stochastic processes
+Normal rank
@@ Property / cites work @@
+Q4776665
@@ Property / cites work: Q4776665 / rank @@
+Normal rank
@@ Property / cites work @@
+Recursive estimation of regression functions by local polynomial fitting
+Normal rank