Preference-based reinforcement learning: a formal framework and a policy iteration algorithm

From MaRDI portal
Publication:1945130

DOI10.1007/s10994-012-5313-8zbMath1260.68328OpenAlexW2154023516WikidataQ59195227 ScholiaQ59195227MaRDI QIDQ1945130

Weiwei Cheng, Johannes Fürnkranz, Sang-Hyeun Park, Eyke Hüllermeier

Publication date: 2 April 2013

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10994-012-5313-8



Related Items


Uses Software


Cites Work