\({\mathcal Q}\)-learning (Q1812931): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(3 intermediate revisions by 3 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q57424214 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3292915 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4013741 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic approximation methods for constrained and unconstrained systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3683893 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning control of finite Markov chains with an explicit trade-off between estimation and control / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 15:52, 14 May 2024

scientific article
Language Label Description Also known as
English
\({\mathcal Q}\)-learning
scientific article

    Statements

    \({\mathcal Q}\)-learning (English)
    0 references
    0 references
    11 August 1992
    0 references
    reinforcement learning
    0 references
    temporal differences
    0 references
    asynchronous dynamic programming
    0 references
    \({\mathcal Q}\)-learning
    0 references

    Identifiers