Probabilistic inference for determining options in reinforcement learning (Q331688): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Q5483032 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3188018 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Using Expectation-Maximization for Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4527272 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3174169 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Policy search for motor primitives in robotics / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4709211 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4737595 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2896181 / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank

Revision as of 19:23, 12 July 2024

scientific article
Language Label Description Also known as
English
Probabilistic inference for determining options in reinforcement learning
scientific article

    Statements

    Probabilistic inference for determining options in reinforcement learning (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    27 October 2016
    0 references
    reinforcement learning
    0 references
    robot learning
    0 references
    options
    0 references
    semi-Markov decision process
    0 references

    Identifiers