An incremental off-policy search in a model-free Markov decision process using a single sample path (Q1621868): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
Changed an Item |
||
Property / arXiv ID | |||
Property / arXiv ID: 1801.10287 / rank | |||
Normal rank |
Revision as of 19:57, 18 April 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | An incremental off-policy search in a model-free Markov decision process using a single sample path |
scientific article |
Statements
An incremental off-policy search in a model-free Markov decision process using a single sample path (English)
0 references
12 November 2018
0 references
Markov decision process
0 references
off-policy prediction
0 references
control problem
0 references
stochastic approximation method
0 references
cross entropy method
0 references
linear function approximation
0 references
ODE method
0 references
global optimization
0 references