A polynomial time bound for Howard's policy improvement algorithm (Q1079511): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Added link to MaRDI item.
links / mardi / namelinks / mardi / name
 

Revision as of 01:29, 31 January 2024

scientific article
Language Label Description Also known as
English
A polynomial time bound for Howard's policy improvement algorithm
scientific article

    Statements

    A polynomial time bound for Howard's policy improvement algorithm (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    1986
    0 references
    0 references
    discounted Markovian Decision Process
    0 references
    finite state and action space
    0 references
    Howard's policy improvement
    0 references
    optimal policy
    0 references