Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (Q1945130)

From MaRDI portal
Revision as of 03:05, 4 April 2024 by Daniel (talk | contribs) (‎Created claim: Wikidata QID (P12): Q59195227, #quickstatements; #temporary_batch_1712190744730)
scientific article
Language Label Description Also known as
English
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm
scientific article

    Statements

    Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    2 April 2013
    0 references
    0 references
    reinforcement learning
    0 references
    preference learning
    0 references
    0 references
    0 references