No label defined (Q3745652)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: scientific article; zbMATH DE number 3980959 |
scientific article; zbMATH DE number 3980959
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | No label defined |
scientific article; zbMATH DE number 3980959 |
Statements
1985
0 references
naive feedback controller
0 references
average reward adaptive Markov decision processes
0 references
countable state space
0 references
compact feasible action sets
0 references
strong scrambling condition
0 references
successive approximation
0 references
nonstationary value- iteration
0 references
0.867232620716095
0 references
0.8553093671798706
0 references
0.8466004729270935
0 references