Approximate dynamic programming via direct search in the space of value function approximations (Q713118): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Q3134873 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3241581 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4209222 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Projected equation methods for approximate solution of large linear systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: A new learning algorithm for optimal stopping / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance Loss Bounds for Approximate Value Iteration with State Aggregation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discrete Dynamic Programming with Unbounded Rewards / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5630824 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Direct search methods: Then and now / rank
 
Normal rank
Property / cites work
 
Property / cites work: Basis function adaptation in temporal difference reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Control Techniques for Complex Networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Practical issues in temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Convergence of Pattern Search Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: An empirical study of policy convergence in Markov decision process value iteration / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence Results for Some Temporal Difference Methods Based on Least Squares / rank
 
Normal rank

Latest revision as of 19:06, 5 July 2024

scientific article
Language Label Description Also known as
English
Approximate dynamic programming via direct search in the space of value function approximations
scientific article

    Statements

    Approximate dynamic programming via direct search in the space of value function approximations (English)
    0 references
    26 October 2012
    0 references
    dynamic programming
    0 references
    Markov decision processes
    0 references
    convex optimization
    0 references
    direct search methods
    0 references
    0 references
    0 references
    0 references

    Identifiers