Stationary policies and Markov policies in Borel dynamic programming (Q1071658): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic optimal control. The discrete time case / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5583572 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The stochastic processes of Borel gambling and dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: The optimal reward operator in dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Measurable selections of extrema / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3695031 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3316968 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5343895 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Persistently ϵ-Optimal Strategies / rank
 
Normal rank
Property / cites work
 
Property / cites work: Countably additive gambling and optimal stopping / rank
 
Normal rank
Property / cites work
 
Property / cites work: On stationary strategies for absolutely continuous houses / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal stopping and almost sure convergence of random sequences / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3329244 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3696892 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion / rank
 
Normal rank
Property / cites work
 
Property / cites work: On a Problem of D. Lackwell from the Theory of Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5599448 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Renewal Plans and Persistent Optimality in Countably Additive Gambling / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Existence of Stationary Optimal Strategies / rank
 
Normal rank
Property / cites work
 
Property / cites work: Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stationary Policies in Dynamic Programming Models Under Compactness Assumptions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Negative Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Existence of Good Stationary Strategies / rank
 
Normal rank
Property / cites work
 
Property / cites work: A "Fatou Equation" for Randomly Stopped Variables / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Stationary Strategies in Countable State Total Reward Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3040956 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 12:55, 17 June 2024

scientific article
Language Label Description Also known as
English
Stationary policies and Markov policies in Borel dynamic programming
scientific article

    Statements

    Stationary policies and Markov policies in Borel dynamic programming (English)
    0 references
    0 references
    0 references
    1986
    0 references
    The question of the existence of good Markov (good stationary) policies is studied for a general class of Borel (stationary) dynamic programming models. It is shown, for example, that Markov (stationary) policies are uniformly adequate if every transition law is absolutely continuous with respect to a fixed measure (and the reward function is positive or the model satisfies certain compactness and continuity conditions).
    0 references
    gambling
    0 references
    Markov policy
    0 references
    stationary policy
    0 references
    persistently optimal
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references