Offline reinforcement learning with task hierarchies (Q1698854): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Recent advances in hierarchical reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Recent advances in hierarchical reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4527272 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3174040 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5405216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4878667 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5305630 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4737595 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank

Latest revision as of 04:43, 15 July 2024

scientific article
Language Label Description Also known as
English
Offline reinforcement learning with task hierarchies
scientific article

    Statements

    Offline reinforcement learning with task hierarchies (English)
    0 references
    0 references
    0 references
    16 February 2018
    0 references
    0 references
    0 references
    0 references
    0 references
    reinforcement learning
    0 references
    hierarchical reinforcement learning
    0 references
    MAXQ
    0 references
    least-squares policy iteration (LSPI)
    0 references
    0 references