Asymptotics of Reinforcement Learning with Neural Networks (Q5084496): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by 3 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W3211566408 / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q114925237 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous Stochastic Approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3721531 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5270493 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonlinearity creates linear independence / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4421713 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A mean field view of the landscape of two-layer neural networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mean field analysis of neural networks: a central limit theorem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mean Field Analysis of Neural Networks: A Law of Large Numbers / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous stochastic approximation and Q-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank

Latest revision as of 11:29, 29 July 2024

scientific article; zbMATH DE number 7547882
Language Label Description Also known as
English
Asymptotics of Reinforcement Learning with Neural Networks
scientific article; zbMATH DE number 7547882

    Statements

    Asymptotics of Reinforcement Learning with Neural Networks (English)
    0 references
    24 June 2022
    0 references
    0 references
    reinforcement learning
    0 references
    neural networks
    0 references
    Q-learning
    0 references
    deep reinforcement learning
    0 references
    weak convergence
    0 references
    0 references
    0 references