A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces (Q330284): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.3934/jdg.2016014 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2507723477 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Fixed Point Iteration with an Application to Infinite Horizon Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate dynamic programming via direct search in the space of value function approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3795523 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate policy iteration: a survey and some new methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4433637 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The approximation of continuous functions by positive linear operators / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximation of Infinite Horizon Discounted Cost Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the existence of fixed points for approximate value iteration and temporal-difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive Markov control processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4255598 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Approximate Dynamic Programming Algorithm for Monotone Value Functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance Bounds in $L_p$‐norm for Approximate Value Iteration / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Perspectives of approximate dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4369442 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous state dynamic programming via nonexpansive approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3630866 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A survey of Markov decision models for control of networks of queues / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance Loss Bounds for Approximate Value Iteration with State Aggregation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Application of average dynamic programming to inventory systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Survey of Applications of Markov Decision Processes / rank
 
Normal rank

Latest revision as of 18:54, 12 July 2024

scientific article
Language Label Description Also known as
English
A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces
scientific article

    Statements

    A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces (English)
    0 references
    25 October 2016
    0 references
    Markov decision processes
    0 references
    discounted criterion
    0 references
    approximate value iteration algorithm
    0 references
    perturbed models
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references