Error bounds for constant step-size \(Q\)-learning (Q1932736): Difference between revisions

From MaRDI portal
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1016/j.sysconle.2012.08.014 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W1999254175 / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous stochastic approximation and Q-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Convergence of Stochastic Iterative Dynamic Programming Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093180 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4346705 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3527701 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Boundedness of iterates in \(Q\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-learning and policy iteration algorithms for stochastic shortest path problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming / rank
 
Normal rank

Latest revision as of 02:27, 6 July 2024

scientific article
Language Label Description Also known as
English
Error bounds for constant step-size \(Q\)-learning
scientific article

    Statements

    Identifiers