Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph (Q3109039): Difference between revisions
From MaRDI portal
Created a new Item |
Set profile property. |
||
(2 intermediate revisions by 2 users not shown) | |||
Property / Wikidata QID | |||
Property / Wikidata QID: Q114882168 / rank | |||
Normal rank | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 10:43, 5 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph |
scientific article |
Statements
Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph (English)
0 references
27 January 2012
0 references
state-action graph
0 references
geodesic Gaussian kernel
0 references
basis function
0 references
policy iteration
0 references
reinforcement learning
0 references