An incremental off-policy search in a model-free Markov decision process using a single sample path (Q1621868): Difference between revisions
From MaRDI portal
Set profile property. |
Set OpenAlex properties. |
||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2963057120 / rank | |||
Normal rank |
Revision as of 20:01, 19 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | An incremental off-policy search in a model-free Markov decision process using a single sample path |
scientific article |
Statements
An incremental off-policy search in a model-free Markov decision process using a single sample path (English)
0 references
12 November 2018
0 references
Markov decision process
0 references
off-policy prediction
0 references
control problem
0 references
stochastic approximation method
0 references
cross entropy method
0 references
linear function approximation
0 references
ODE method
0 references
global optimization
0 references