Preference-based reinforcement learning: a formal framework and a policy iteration algorithm

From MaRDI portal
Revision as of 15:58, 1 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:1945130

DOI10.1007/S10994-012-5313-8zbMath1260.68328OpenAlexW2154023516WikidataQ59195227 ScholiaQ59195227MaRDI QIDQ1945130

Weiwei Cheng, Johannes Fürnkranz, Sang-Hyeun Park, Eyke Hüllermeier

Publication date: 2 April 2013

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10994-012-5313-8




Related Items (7)


Uses Software



Cites Work




This page was built for publication: Preference-based reinforcement learning: a formal framework and a policy iteration algorithm