Notes on average Markov decision processes with a minimum-variance criterion (Q1612012): Difference between revisions

In Markov decision processes (here with countable state and action spaces), one of the main objectives is the average reward per unit of time, the expectation of which is to be maximized. For a risk-aversing decision-maker, an optimal policy under this objective may have an unacceptably high variance. So the variance minimization became more and more interesting for research. The author carefully analyses two relevant papers by \textit{M. Kurano} [J. Math. Anal. Appl. 123, 572--583 (1987; Zbl 0619.90080)] and \textit{X. Guo} [Math. Meth. Oper. Res. 49, 87--96 (1999; Zbl 1016.90071)], and detected mistakes in the proofs of the main theorems so that they appeared as not yet proved. Using a slightly modified variance criterion and postulating a mild condition, the author proves the existence of a Markov policy which is \(\varepsilon\)-strong variance optimal for any \(\varepsilon>0\).

0 references

Mathematics Subject Classification ID

90C40

0 references

0 references

0 references

Markov decision process

0 references

average criterion

0 references

variance minimization

0 references

\(\varepsilon\)-strong variance optimal policy

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Markov Decision Problems and State-Action Frequencies

0 references

Variability Sensitive Markov Decision Processes

0 references

Discounted MDP’s: Distribution Functions and Exponential Utility Maximization

0 references

A note on maximal mean/standard deviation ratio in an undiscounted MDP

0 references

Mean-Variance Tradeoffs in an Undiscounted MDP: The Unichain Case

0 references

Finite-horizon variance penalised Markov decision processes

0 references

Q5822308

0 references

Variance-Penalized Markov Decision Processes

0 references

Nonstationary denumerable state Markov decision processes -- with average variance criterion

0 references

Q4501351

0 references

Nonhomogeneous Markov Decision Processes with Borel State Space—The Average Criterion with Nonuniformly Bounded Rewards

0 references

Markov Decision Processes with a New Optimality Criterion: Small Interest Rates

0 references

Markov decision processes with a new optimality criterion: Discrete time

0 references

A variance minimization problem for a Markov decision process

0 references

VARIANCE CONSTRAINED MARKOV DECISION PROCESS

0 references

Markov decision processes with a minimum-variance criterion

0 references

Markov decision programming–the moment optimal problem for the first-passage model

0 references

Q5618142

0 references

Estimation and control in Markov chains

0 references

Q5284147

0 references

Q4315289

0 references

Dynamic programming of expectation and variance

0 references

The variance of discounted Markov decision processes

0 references

Maximal mean/standard deviation ratio in an undiscounted MDP

0 references

Mean-Variance Tradeoffs in an Undiscounted MDP

0 references

Mean, variance and probabilistic criteria in finite Markov decision processes: A review

0 references

Mean-Variance Analysis in Infinite Horizon Non-Discounted Markov Decision Processes: Technical Note

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1612012

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / cites work @@
+Markov Decision Problems and State-Action Frequencies
+Normal rank
@@ Property / cites work @@
+Variability Sensitive Markov Decision Processes
@@ Property / cites work: Variability Sensitive Markov Decision Processes / rank @@
+Normal rank
@@ Property / cites work @@
+Discounted MDP’s: Distribution Functions and Exponential Utility Maximization
+Normal rank
@@ Property / cites work @@
+A note on maximal mean/standard deviation ratio in an undiscounted MDP
+Normal rank
@@ Property / cites work @@
+Mean-Variance Tradeoffs in an Undiscounted MDP: The Unichain Case
+Normal rank
@@ Property / cites work @@
+Finite-horizon variance penalised Markov decision processes
+Normal rank
@@ Property / cites work @@
+Q5822308
@@ Property / cites work: Q5822308 / rank @@
+Normal rank
@@ Property / cites work @@
+Variance-Penalized Markov Decision Processes
@@ Property / cites work: Variance-Penalized Markov Decision Processes / rank @@
+Normal rank
@@ Property / cites work @@
+Nonstationary denumerable state Markov decision processes -- with average variance criterion
+Normal rank
@@ Property / cites work @@
+Q4501351
@@ Property / cites work: Q4501351 / rank @@
+Normal rank
@@ Property / cites work @@
+Nonhomogeneous Markov Decision Processes with Borel State Space—The Average Criterion with Nonuniformly Bounded Rewards
+Normal rank
@@ Property / cites work @@
+Markov Decision Processes with a New Optimality Criterion: Small Interest Rates
+Normal rank
@@ Property / cites work @@
+Markov decision processes with a new optimality criterion: Discrete time
+Normal rank
@@ Property / cites work @@
+A variance minimization problem for a Markov decision process
+Normal rank
@@ Property / cites work @@
+VARIANCE CONSTRAINED MARKOV DECISION PROCESS
@@ Property / cites work: VARIANCE CONSTRAINED MARKOV DECISION PROCESS / rank @@
+Normal rank
@@ Property / cites work @@
+Markov decision processes with a minimum-variance criterion
+Normal rank
@@ Property / cites work @@
+Markov decision programming–the moment optimal problem for the first-passage model
+Normal rank
@@ Property / cites work @@
+Q5618142
@@ Property / cites work: Q5618142 / rank @@
+Normal rank
@@ Property / cites work @@
+Estimation and control in Markov chains
@@ Property / cites work: Estimation and control in Markov chains / rank @@
+Normal rank
@@ Property / cites work @@
+Q5284147
@@ Property / cites work: Q5284147 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Dynamic programming of expectation and variance
@@ Property / cites work: Dynamic programming of expectation and variance / rank @@
+Normal rank
@@ Property / cites work @@
+The variance of discounted Markov decision processes
+Normal rank
@@ Property / cites work @@
+Maximal mean/standard deviation ratio in an undiscounted MDP
+Normal rank
@@ Property / cites work @@
+Mean-Variance Tradeoffs in an Undiscounted MDP
@@ Property / cites work: Mean-Variance Tradeoffs in an Undiscounted MDP / rank @@
+Normal rank
@@ Property / cites work @@
+Mean, variance and probabilistic criteria in finite Markov decision processes: A review
+Normal rank
@@ Property / cites work @@
+Mean-Variance Analysis in Infinite Horizon Non-Discounted Markov Decision Processes: Technical Note
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:1612012