On terminating Markov decision processes with a risk-averse objective function (Q5947647): Difference between revisions

This paper deals with terminating risk-sensitive finite states Markov decision processes with an absorbing and cost-free extra state. So the terminating problem is to seek stochastic shortest paths. Introducing two dynamic programming operators, the author gives the following results. (i) The existence and characterization of an optimal policy. (ii) Convergence properties for value iteration and policy iteration. Moreover, he illustrates the results with two computational examples.

0 references

reviewed by

Makiko Nisio

0 references

zbMATH Keywords

risk-sensitive finite states Markov decision processes

0 references

terminating problem

0 references

stochastic shortest paths

0 references

dynamic programming

0 references

convergence

0 references

value iteration

0 references

policy iteration

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Q4001523

0 references

An Analysis of Stochastic Shortest Path Problems

0 references

Discounted MDP’s: Distribution Functions and Exponential Utility Maximization

0 references

Risk-sensitive and minimax control of discrete-time, finite-state Markov decision processes

0 references

Q4186114

0 references

Q3140691

0 references

State-space formulae for all stabilizing controllers that satisfy an \(H_{\infty}\)-norm bound and relations to risk sensitivity

0 references

Risk sensitive control of Markov processes in countable state space

0 references

Risk-Sensitive Markov Decision Processes

0 references

Optimal stochastic linear systems with exponential performance criteria and their relation to deterministic differential games

0 references

Q5284147

0 references

Stochastic Shortest Path Games

0 references

Multiplicative Markov Decision Chains

0 references

The equivalence between infinite-horizon optimal control of stochastic systems with exponential-of-integral performance index and stochastic differential games

0 references

Risk-sensitive linear/quadratic/gaussian control

0 references

Q3997540

0 references

Q4339077

0 references

Q4714003

0 references

Identifiers

zbMATH Open document ID

0995.93075

0 references

DOI

10.1016/S0005-1098(01)00084-X

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:5947647

Revision as of 20:01, 3 June 2024 ReferenceBot (talk \| contribs) Bots 1,903,984 edits ‎Changed an Item ← Older edit	Revision as of 21:47, 29 July 2024 Daniel (talk \| contribs) Bureaucrats, Interface administrators, private, Suppressors, Administrators 626,773 edits ‎Created claim: Wikidata QID (P12): Q126742945, #quickstatements; #temporary_batch_1722284575798 Tag: QuickStatements [1.0.4] Newer edit →
	Property / Wikidata QID
		Q126742945
	Property / Wikidata QID: Q126742945 / rank
		Normal rank