An incremental off-policy search in a model-free Markov decision process using a single sample path (Q1621868)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | An incremental off-policy search in a model-free Markov decision process using a single sample path |
scientific article |
Statements
An incremental off-policy search in a model-free Markov decision process using a single sample path (English)
0 references
12 November 2018
0 references
Markov decision process
0 references
off-policy prediction
0 references
control problem
0 references
stochastic approximation method
0 references
cross entropy method
0 references
linear function approximation
0 references
ODE method
0 references
global optimization
0 references
0 references
0 references
0 references