Simulation‐based Uniform Value Function Estimates of Markov Decision Processes (Q3593009)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Simulation‐based Uniform Value Function Estimates of Markov Decision Processes |
scientific article; zbMATH DE number 5194905
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Simulation‐based Uniform Value Function Estimates of Markov Decision Processes |
scientific article; zbMATH DE number 5194905 |
Statements
Simulation‐based Uniform Value Function Estimates of Markov Decision Processes (English)
0 references
24 September 2007
0 references
Markov decision processes
0 references
Markov games
0 references
empirical process theory
0 references
PAC learning
0 references
value function estimation
0 references
uniform rate of convergence
0 references
0.88816375
0 references
0.88769597
0 references
0.8842571
0 references
0.88249123
0 references
0.88166106
0 references
0.86942315
0 references