Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (Q1945130): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
Created claim: Wikidata QID (P12): Q59195227, #quickstatements; #temporary_batch_1712190744730 |
||
Property / Wikidata QID | |||
Property / Wikidata QID: Q59195227 / rank | |||
Normal rank |
Revision as of 03:05, 4 April 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Preference-based reinforcement learning: a formal framework and a policy iteration algorithm |
scientific article |
Statements
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (English)
0 references
2 April 2013
0 references
reinforcement learning
0 references
preference learning
0 references