Policy set iteration for Markov decision processes (Q2350853)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Policy set iteration for Markov decision processes |
scientific article; zbMATH DE number 6450331
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Policy set iteration for Markov decision processes |
scientific article; zbMATH DE number 6450331 |
Statements
Policy set iteration for Markov decision processes (English)
0 references
25 June 2015
0 references
Markov decision processes
0 references
policy iteration
0 references
dynamic programming
0 references
randomization
0 references
0.8487301468849182
0 references
0.7811881303787231
0 references
0.7730043530464172
0 references
0.769660472869873
0 references