Target-level criterion in Markov decision processes (Q1904951): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
Set OpenAlex properties.
 
(One intermediate revision by one other user not shown)
Property / cites work
 
Property / cites work: Q4131339 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3251780 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Newsboy Problem under Alternative Optimization Objectives / rank
 
Normal rank
Property / cites work
 
Property / cites work: Note—Note on “Optimal Ordering Quantity to Realize a Pre-Determined Level of Profit” / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Preference Order Dynamic Program for a Stochastic Traveling Salesman Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Percentiles and Markovian decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3313617 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The variance of discounted Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounted MDP’s: Distribution Functions and Exponential Utility Maximization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5615108 / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/bf02193458 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2066118288 / rank
 
Normal rank

Latest revision as of 11:11, 30 July 2024

scientific article
Language Label Description Also known as
English
Target-level criterion in Markov decision processes
scientific article

    Statements

    Target-level criterion in Markov decision processes (English)
    0 references
    0 references
    0 references
    15 January 1996
    0 references
    target-level criterion
    0 references
    successive approximations
    0 references
    total discounted rewards
    0 references
    optimal return operator
    0 references
    infinite planning-horizon
    0 references
    optimal value function
    0 references
    maximal fixed point
    0 references
    turnpike results
    0 references

    Identifiers