The convergence of value iteration in average cost Markov decision chains (Q2564235): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by 3 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1016/0167-6377(96)00018-1 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2054561613 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3795523 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Minimum Cost Per Unit Time Control of Markov Chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Control of Markov Chains with Long-Run Average Cost Criterion: The Dynamic Programming Equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Comparing recent assumptions for the existence of average optimal stationary policies / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4888202 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On strong average optimality of Markov decision processes with unbounded costs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linear Programming and Average Optimality of Markov Control Processes on Borel Spaces—Unbounded Costs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal control of diffusion processes with reflection / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3683893 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Another set of conditions for average optimality in Markov control processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4851818 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 10:06, 27 May 2024

scientific article
Language Label Description Also known as
English
The convergence of value iteration in average cost Markov decision chains
scientific article

    Statements

    The convergence of value iteration in average cost Markov decision chains (English)
    0 references
    0 references
    7 January 1997
    0 references
    0 references
    stochastic dynamic programming
    0 references
    value iteration
    0 references
    minimum long-run expected average cost
    0 references
    Markov decision chain
    0 references
    countable state space
    0 references
    0 references