Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning (Q5219302): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Q3324260 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4938927 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Approximations and Differential Inclusions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3997575 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4858374 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with two time scales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with `controlled Markov' noise / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linear stochastic approximation driven by slowly varying Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: OnActor-Critic Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximations for finite-state Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Basis function adaptation in temporal difference reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Applications of a Kushner and Clark lemma to general classes of stochastic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5526189 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence and convergence rate of stochastic gradient search in the case of multiple and non-isolated extrema / rank
 
Normal rank
Property / cites work
 
Property / cites work: Least Squares Temporal Difference Methods: An Analysis under General Conditions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2953645 / rank
 
Normal rank

Latest revision as of 02:28, 22 July 2024

scientific article; zbMATH DE number 7179328
Language Label Description Also known as
English
Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning
scientific article; zbMATH DE number 7179328

    Statements

    Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning (English)
    0 references
    0 references
    0 references
    11 March 2020
    0 references
    0 references
    0 references
    0 references
    0 references
    Markov noise
    0 references
    two time-scale stochastic approximation
    0 references
    asymptotic convergence
    0 references
    temporal-difference learning
    0 references
    0 references
    0 references