(Approximate) iterated successive approximations algorithm for sequential decision processes (Q378751): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(4 intermediate revisions by 4 users not shown)
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 90C40 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 90C59 / rank
 
Normal rank
Property / zbMATH DE Number
 
Property / zbMATH DE Number: 6225980 / rank
 
Normal rank
Property / zbMATH Keywords
 
sequential decision processes
Property / zbMATH Keywords: sequential decision processes / rank
 
Normal rank
Property / zbMATH Keywords
 
Markov decision chains
Property / zbMATH Keywords: Markov decision chains / rank
 
Normal rank
Property / zbMATH Keywords
 
successive approximations
Property / zbMATH Keywords: successive approximations / rank
 
Normal rank
Property / zbMATH Keywords
 
modified policy iteration
Property / zbMATH Keywords: modified policy iteration / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s10479-012-1073-x / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2014981566 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Truncated policy iteration methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Contraction Mappings in the Theory Underlying Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Affine Structure and Invariant Policies for Dynamic Programs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Block-successive approximation for a discounted Markov decision model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3313617 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4547434 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Bounds for Discounted Sequential Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Improved iterative computation of the expected discounted return in Markov and semi-Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Convergence of Policy Iteration in Stationary Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Modified Policy Iteration Algorithms for Discounted Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Action Elimination Procedures for Modified Policy Iteration Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounted Markov games: Generalized policy iteration method / rank
 
Normal rank
Property / cites work
 
Property / cites work: A set of successive approximation methods for discounted Markovian decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4190426 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, I / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 01:48, 7 July 2024

scientific article
Language Label Description Also known as
English
(Approximate) iterated successive approximations algorithm for sequential decision processes
scientific article

    Statements

    (Approximate) iterated successive approximations algorithm for sequential decision processes (English)
    0 references
    0 references
    0 references
    12 November 2013
    0 references
    0 references
    sequential decision processes
    0 references
    Markov decision chains
    0 references
    successive approximations
    0 references
    modified policy iteration
    0 references
    0 references