Chaotic dynamics and convergence analysis of temporal difference algorithms with bang-bang control (Q2800471): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1002/oca.2156 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W1520148289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal chaos control through reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Temporal Difference Methods for General Projected Equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303768966102 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: The convergence of \(TD(\lambda)\) for general \(\lambda\) / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the worst-case analysis of temporal-difference learning algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: First-Order System Least Squares for Second-Order Partial Differential Equations: Part II / rank
 
Normal rank
Property / cites work
 
Property / cites work: Micro-chaotic dynamics due to digital sampling in hybrid systems of Filippov type / rank
 
Normal rank

Latest revision as of 20:01, 11 July 2024

scientific article
Language Label Description Also known as
English
Chaotic dynamics and convergence analysis of temporal difference algorithms with bang-bang control
scientific article

    Statements

    Chaotic dynamics and convergence analysis of temporal difference algorithms with bang-bang control (English)
    0 references
    15 April 2016
    0 references
    badly conditioned learning
    0 references
    polynomial basis functions
    0 references
    rate of convergence
    0 references
    temporal difference learning
    0 references
    value function approximation
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references