Joint learning of reward machines and policies in environments with partially known semantics

From MaRDI portal
Publication:6579299

DOI10.1016/J.ARTINT.2024.104146zbMATH Open1543.6832MaRDI QIDQ6579299FDOQ6579299


Authors: Christos K. Verginis, Cevahir Koprulu, Sandeep Chinchali, Ufuk Topcu Edit this on Wikidata


Publication date: 25 July 2024

Published in: Artificial Intelligence (Search for Journal in Brave)





Recommendations




Cites Work






This page was built for publication: Joint learning of reward machines and policies in environments with partially known semantics

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6579299)