The discounted method and equivalence of average criteria for risk-sensitive Markov decision processes on Borel spaces (Q964743): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(7 intermediate revisions by 6 users not shown)
Property / author
 
Property / author: Francisco Sergio Salem-Silva / rank
Normal rank
 
Property / author
 
Property / author: Francisco Sergio Salem-Silva / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s00245-009-9080-2 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2004904423 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4386538 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Solution to the risk-sensitive average cost optimality equation in a class of Markov decision processes with finite state space / rank
 
Normal rank
Property / cites work
 
Property / cites work: Infinite horizon risk sensitive control of discrete time Markov processes with small risk / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4388221 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4382287 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Sensitive Control of Finite State Machines on an Infinite Horizon I / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Sensitive Control on an Infinite Time Horizon / rank
 
Normal rank
Property / cites work
 
Property / cites work: A forecast horizon and a stopping rule for general Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4863593 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Existence of risk-sensitive optimal stationary policies for controlled Markov processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal Plans for Dynamic Programming Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Sensitive Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average optimality for risk-sensitive control with general state space / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the optimality equation for average cost Markov control processes with Feller transition probabilities / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3220809 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4324881 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal / rank
 
Normal rank
Property / cites work
 
Property / cites work: A new condition for the existence of optimal stationary policies in average cost Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Another set of conditions for average optimality in Markov control processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3684346 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 17:36, 2 July 2024

scientific article
Language Label Description Also known as
English
The discounted method and equivalence of average criteria for risk-sensitive Markov decision processes on Borel spaces
scientific article

    Statements

    The discounted method and equivalence of average criteria for risk-sensitive Markov decision processes on Borel spaces (English)
    0 references
    20 April 2010
    0 references
    The work concerns discrete-time Markov decision processes evolving on a Borel space. The system is driven by a risk-averse decision maker with a constant risk sensitivity coefficient \(\lambda > 0\), and the performance of a control policy is measured by the (superior limit) risk-sensitive average cost criterion associated with a nonnegative cost function. Under mild (semi-) continuity and compactness conditions, the following problems are studied via the discounted approach: (i) establishing the existence of optimal stationary policies, and (ii) determination of conditions under which the equality of the optimal value functions associated with the inferior limit and superior limit average criteria can be ensured. The approach of the paper relies on standard dynamic programming ideas and on a simple analytical derivation of a Tauberian relation.
    0 references
    0 references
    Hölder's inequality
    0 references
    contractive operators
    0 references
    generalized Fatou's lemma
    0 references
    risk-sensitive discount approach
    0 references
    weak continuity
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references