Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems (Q2071401): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(4 intermediate revisions by 3 users not shown)
Property / describes a project that uses
 
Property / describes a project that uses: SUMO / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: CARLA / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s10994-020-05939-8 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W3137303899 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4533362 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On a routing problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Deep hedging / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3996777 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4708017 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Policy search for motor primitives in robotics / rank
 
Normal rank
Property / cites work
 
Property / cites work: A well-conditioned estimator for large-dimensional covariance matrices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2810828 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5491447 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093299 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simple statistical gradient-following algorithms for connectionist reinforcement learning / rank
 
Normal rank

Latest revision as of 21:36, 27 July 2024

scientific article
Language Label Description Also known as
English
Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems
scientific article

    Statements

    Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    28 January 2022
    0 references
    inverse reinforcement learning
    0 references
    model-free IRL
    0 references
    truly batch IRL
    0 references
    IRL for real life
    0 references
    multiple experts IRL
    0 references
    non-stationary IRL
    0 references
    0 references
    0 references

    Identifiers