A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces (Q330284): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(4 intermediate revisions by 4 users not shown)
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 93E20 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 90C59 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 90C40 / rank
 
Normal rank
Property / zbMATH DE Number
 
Property / zbMATH DE Number: 6643067 / rank
 
Normal rank
Property / zbMATH Keywords
 
Markov decision processes
Property / zbMATH Keywords: Markov decision processes / rank
 
Normal rank
Property / zbMATH Keywords
 
discounted criterion
Property / zbMATH Keywords: discounted criterion / rank
 
Normal rank
Property / zbMATH Keywords
 
approximate value iteration algorithm
Property / zbMATH Keywords: approximate value iteration algorithm / rank
 
Normal rank
Property / zbMATH Keywords
 
perturbed models
Property / zbMATH Keywords: perturbed models / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.3934/jdg.2016014 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2507723477 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Fixed Point Iteration with an Application to Infinite Horizon Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate dynamic programming via direct search in the space of value function approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3795523 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate policy iteration: a survey and some new methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4433637 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The approximation of continuous functions by positive linear operators / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximation of Infinite Horizon Discounted Cost Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the existence of fixed points for approximate value iteration and temporal-difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive Markov control processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4255598 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Approximate Dynamic Programming Algorithm for Monotone Value Functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance Bounds in $L_p$‐norm for Approximate Value Iteration / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Perspectives of approximate dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4369442 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous state dynamic programming via nonexpansive approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3630866 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A survey of Markov decision models for control of networks of queues / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance Loss Bounds for Approximate Value Iteration with State Aggregation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Application of average dynamic programming to inventory systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Survey of Applications of Markov Decision Processes / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 19:54, 12 July 2024

scientific article
Language Label Description Also known as
English
A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces
scientific article

    Statements

    A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces (English)
    0 references
    0 references
    25 October 2016
    0 references
    0 references
    0 references
    0 references
    0 references
    Markov decision processes
    0 references
    discounted criterion
    0 references
    approximate value iteration algorithm
    0 references
    perturbed models
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references