Notes on average Markov decision processes with a minimum-variance criterion (Q1612012): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Decision Problems and State-Action Frequencies / rank
 
Normal rank
Property / cites work
 
Property / cites work: Variability Sensitive Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounted MDP’s: Distribution Functions and Exponential Utility Maximization / rank
 
Normal rank
Property / cites work
 
Property / cites work: A note on maximal mean/standard deviation ratio in an undiscounted MDP / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mean-Variance Tradeoffs in an Undiscounted MDP: The Unichain Case / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-horizon variance penalised Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5822308 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Variance-Penalized Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonstationary denumerable state Markov decision processes -- with average variance criterion / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4501351 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonhomogeneous Markov Decision Processes with Borel State Space—The Average Criterion with Nonuniformly Bounded Rewards / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Decision Processes with a New Optimality Criterion: Small Interest Rates / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov decision processes with a new optimality criterion: Discrete time / rank
 
Normal rank
Property / cites work
 
Property / cites work: A variance minimization problem for a Markov decision process / rank
 
Normal rank
Property / cites work
 
Property / cites work: VARIANCE CONSTRAINED MARKOV DECISION PROCESS / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov decision processes with a minimum-variance criterion / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov decision programming–the moment optimal problem for the first-passage model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5618142 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Estimation and control in Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5284147 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic programming of expectation and variance / rank
 
Normal rank
Property / cites work
 
Property / cites work: The variance of discounted Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Maximal mean/standard deviation ratio in an undiscounted MDP / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mean-Variance Tradeoffs in an Undiscounted MDP / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mean, variance and probabilistic criteria in finite Markov decision processes: A review / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mean-Variance Analysis in Infinite Horizon Non-Discounted Markov Decision Processes: Technical Note / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Revision as of 16:16, 4 June 2024

scientific article
Language Label Description Also known as
English
Notes on average Markov decision processes with a minimum-variance criterion
scientific article

    Statements

    Notes on average Markov decision processes with a minimum-variance criterion (English)
    0 references
    0 references
    28 August 2002
    0 references
    In Markov decision processes (here with countable state and action spaces), one of the main objectives is the average reward per unit of time, the expectation of which is to be maximized. For a risk-aversing decision-maker, an optimal policy under this objective may have an unacceptably high variance. So the variance minimization became more and more interesting for research. The author carefully analyses two relevant papers by \textit{M. Kurano} [J. Math. Anal. Appl. 123, 572--583 (1987; Zbl 0619.90080)] and \textit{X. Guo} [Math. Meth. Oper. Res. 49, 87--96 (1999; Zbl 1016.90071)], and detected mistakes in the proofs of the main theorems so that they appeared as not yet proved. Using a slightly modified variance criterion and postulating a mild condition, the author proves the existence of a Markov policy which is \(\varepsilon\)-strong variance optimal for any \(\varepsilon>0\).
    0 references
    0 references
    Markov decision process
    0 references
    average criterion
    0 references
    variance minimization
    0 references
    \(\varepsilon\)-strong variance optimal policy
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references