Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph (Q3109039): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Import240304020342 (talk | contribs)
Set profile property.
 
(2 intermediate revisions by 2 users not shown)
Property / Wikidata QID
 
Property / Wikidata QID: Q114882168 / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 10:43, 5 March 2024

scientific article
Language Label Description Also known as
English
Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph
scientific article

    Statements

    Policy Iteration Reinforcement Learning Based on Geodesic Gaussian Basis Defined on State-action Graph (English)
    0 references
    0 references
    0 references
    0 references
    27 January 2012
    0 references
    state-action graph
    0 references
    geodesic Gaussian kernel
    0 references
    basis function
    0 references
    policy iteration
    0 references
    reinforcement learning
    0 references

    Identifiers