New algorithms of the Q-learning type (Q2440701): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by 3 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1016/j.automatica.2007.09.009 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2118458590 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation with two time scales / rank
 
Normal rank
Property / cites work
 
Property / cites work: Actor-Critic--Type Learning Algorithms for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multivariate stochastic approximation using a simultaneous perturbation gradient approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: A one-measurement form of simultaneous perturbation stochastic approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous stochastic approximation and Q-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 12:29, 7 July 2024

scientific article
Language Label Description Also known as
English
New algorithms of the Q-learning type
scientific article

    Statements