An incremental off-policy search in a model-free Markov decision process using a single sample path (Q1621868): Difference between revisions
From MaRDI portal
Changed an Item |
Set profile property. |
||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank |
Revision as of 04:06, 5 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | An incremental off-policy search in a model-free Markov decision process using a single sample path |
scientific article |
Statements
An incremental off-policy search in a model-free Markov decision process using a single sample path (English)
0 references
12 November 2018
0 references
Markov decision process
0 references
off-policy prediction
0 references
control problem
0 references
stochastic approximation method
0 references
cross entropy method
0 references
linear function approximation
0 references
ODE method
0 references
global optimization
0 references