Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph (Q3109039): Difference between revisions
From MaRDI portal
Added link to MaRDI item. |
Set profile property. |
||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank |
Latest revision as of 10:43, 5 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph |
scientific article |
Statements
Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph (English)
0 references
27 January 2012
0 references
state-action graph
0 references
geodesic Gaussian kernel
0 references
basis function
0 references
policy iteration
0 references
reinforcement learning
0 references