Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph (Q3109039): Difference between revisions
From MaRDI portal
Created claim: Wikidata QID (P12): Q114882168, #quickstatements; #temporary_batch_1705098335825 |
Added link to MaRDI item. |
||
links / mardi / name | links / mardi / name | ||
Revision as of 22:50, 3 February 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph |
scientific article |
Statements
Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph (English)
0 references
27 January 2012
0 references
state-action graph
0 references
geodesic Gaussian kernel
0 references
basis function
0 references
policy iteration
0 references
reinforcement learning
0 references