A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies (Q3465941): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Import241208061232 (talk | contribs)
Normalize DOI.
 
(5 intermediate revisions by 5 users not shown)
Property / DOI
 
Property / DOI: 10.1287/moor.2014.0704 / rank
Normal rank
 
Property / MaRDI profile type
 
Property / MaRDI profile type: Publication / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2126905415 / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1308.3814 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey / rank
 
Normal rank
Property / cites work
 
Property / cites work: Infinite time reachability of state-space regions by using feedback control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Monotone Mappings with Application in Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Memoryless Strategies in Finite-Stage Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounted Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Borel Set Not Containing a Graph / rank
 
Normal rank
Property / cites work
 
Property / cites work: Borel-programmable functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Non-Existence of Everywhere Proper Conditional Distributions / rank
 
Normal rank
Property / cites work
 
Property / cites work: The optimal reward operator in dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Value iteration and optimization of multiclass queueing networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Handbook of Markov decision processes. Methods and applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities / rank
 
Normal rank
Property / cites work
 
Property / cites work: The optimal reward operator in special classes of dynamic programming problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markovian Decision Processes with Compact Action Spaces / rank
 
Normal rank
Property / cites work
 
Property / cites work: A simple proof of Whittle's bridging condition in dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4255598 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5599448 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Optimality of Structured Policies in Countable Stage Decision Processes. II: Positive and Negative Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5578711 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Optimal Reward Operator in Negative Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete Dynamic Programming with a Small Interest Rate / rank
 
Normal rank
Property / cites work
 
Property / cites work: Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal / rank
 
Normal rank
Property / cites work
 
Property / cites work: Borel-approachable functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Alternative Theoretical Frameworks for Finite Horizon Discrete-Time Stochastic Optimal Control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Universally Measurable Policies in Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Negative Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Finding Optimal Policies in Discrete Dynamic Programming with No Discounting / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete Dynamic Programming with Sensitive Discount Optimality Criteria / rank
 
Normal rank
Property / cites work
 
Property / cites work: A simple condition for regularity in negative programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stability and characterisation conditions in negative programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-learning and policy iteration algorithms for stochastic shortest path problems / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1287/MOOR.2014.0704 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 09:42, 21 December 2024

scientific article
Language Label Description Also known as
English
A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies
scientific article

    Statements

    A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies (English)
    0 references
    0 references
    0 references
    29 January 2016
    0 references
    discrete-time stochastic control
    0 references
    Borel spaces Markov decision process
    0 references
    total cost criteria
    0 references
    measurability
    0 references
    value iteration
    0 references
    policy iteration
    0 references
    convergence
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references