Approximate receding horizon approach for Markov decision processes: average reward case (Q1414220): Difference between revisions

The authors consider an approximation scheme for solving Markov decision processes (MDPs) with countable state space, finite action space, and bounded rewards that uses an approximate solution of a fixed finite-horizon sub-MDP of a given infinite-horizon MDP to create a stationary policy, which they call ''approximate receding horizon control''. They analyze the performance of the approximate receding horizon control in some conditions, study two examples, also provide a simple proof on the policy improvement for countable state space, and discuss practical implementations of these schemes via simulation.

0 references

zbMATH Keywords

Markov decision process

0 references

receding horizon control

0 references

Infinite-horizon average reward

0 references

policy improvement

0 references

ergodicity

0 references

reviewed by

Mariano Ruiz Espejo

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey

0 references

Enlarging the region of convergence of Newton's method for constrained optimization

0 references

Rollout algorithms for stochastic scheduling problems

0 references

Q4257216

0 references

An on-line procedure in discounted infinite-horizon stochastic optimal control

0 references

On the structure of value functions for threshold policies in queueing models

0 references

Moving horizon control in dynamic games

0 references

Q3237805

0 references

Detection of minimal forecast horizons in dynamic programs with multiple indicators of the future

0 references

Finite horizon approximations of infinite horizon linear programs

0 references

Error bounds for rolling horizon policies in discrete-time Markov control processes

0 references

A forecast horizon and a stopping rule for general Markov decision processes

0 references

Adaptive Markov control processes

0 references

Q4002098

0 references

Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations

0 references

On the value function of a priority queue with an application to a controlled polling model

0 references

A probabilistic analysis of bias optimality in unichain Markov decision processes

0 references

Receding horizon control of nonlinear systems

0 references

The policy iteration algorithm for average reward Markov decision processes with general state space

0 references

Complexity of finite-horizon Markov decision process problems

0 references

Separable routing: A scheme for state-dependent routing of circuit switched telephone traffic

0 references

Q4386528

0 references

Optimal Infinite-Horizon Undiscounted Control of Finite Probabilistic Systems

0 references

Q4315289

0 references

Comparing neuro-dynamic programming algorithms for the vehicle routing problem with stochastic demands

0 references

The Determination of Approximately Optimal Policies in Markov Decision Processes by the Use of Bounds

0 references

Identifiers

zbMATH Open document ID

1064.90051

0 references

DOI

10.1016/S0022-247X(03)00506-7

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1414220

@@ Property / cites work @@
+Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
+Normal rank
@@ Property / cites work @@
+Enlarging the region of convergence of Newton's method for constrained optimization
+Normal rank
@@ Property / cites work @@
+Rollout algorithms for stochastic scheduling problems
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+An on-line procedure in discounted infinite-horizon stochastic optimal control
+Normal rank
@@ Property / cites work @@
+On the structure of value functions for threshold policies in queueing models
+Normal rank
@@ Property / cites work @@
+Moving horizon control in dynamic games
@@ Property / cites work: Moving horizon control in dynamic games / rank @@
+Normal rank
@@ Property / cites work @@
+Q3237805
@@ Property / cites work: Q3237805 / rank @@
+Normal rank
@@ Property / cites work @@
+Detection of minimal forecast horizons in dynamic programs with multiple indicators of the future
+Normal rank
@@ Property / cites work @@
+Finite horizon approximations of infinite horizon linear programs
+Normal rank
@@ Property / cites work @@
+Error bounds for rolling horizon policies in discrete-time Markov control processes
+Normal rank
@@ Property / cites work @@
+A forecast horizon and a stopping rule for general Markov decision processes
+Normal rank
@@ Property / cites work @@
+Adaptive Markov control processes
@@ Property / cites work: Adaptive Markov control processes / rank @@
+Normal rank
@@ Property / cites work @@
+Q4002098
@@ Property / cites work: Q4002098 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations
+Normal rank
@@ Property / cites work @@
+On the value function of a priority queue with an application to a controlled polling model
+Normal rank
@@ Property / cites work @@
+A probabilistic analysis of bias optimality in unichain Markov decision processes
+Normal rank
@@ Property / cites work @@
+Receding horizon control of nonlinear systems
@@ Property / cites work: Receding horizon control of nonlinear systems / rank @@
+Normal rank
@@ Property / cites work @@
+The policy iteration algorithm for average reward Markov decision processes with general state space
+Normal rank
@@ Property / cites work @@
+Complexity of finite-horizon Markov decision process problems
+Normal rank
@@ Property / cites work @@
+Separable routing: A scheme for state-dependent routing of circuit switched telephone traffic
+Normal rank
@@ Property / cites work @@
+Q4386528
@@ Property / cites work: Q4386528 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal Infinite-Horizon Undiscounted Control of Finite Probabilistic Systems
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Comparing neuro-dynamic programming algorithms for the vehicle routing problem with stochastic demands
+Normal rank
@@ Property / cites work @@
+The Determination of Approximately Optimal Policies in Markov Decision Processes by the Use of Bounds
+Normal rank