Joint learning of reward machines and policies in environments with partially known semantics

From MaRDI portal

Publication:6579299

Jump to:navigation, search

DOI10.1016/J.ARTINT.2024.104146MaRDI QIDQ6579299zbMATH OpenFDO

Authors Christos K. Verginis, Cevahir Koprulu, Sandeep Chinchali, Ufuk Topcu

Publication date 25 July 2024

Published in Artificial Intelligence (Search for Journal in Brave)

zbMATH Keywords

reinforcement learning reward machines perception limitations

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Formal languages and automata (68Q45) Computational learning theory (68Q32)

Recommendations

Cites work

This page was built for publication: Joint learning of reward machines and policies in environments with partially known semantics

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6579299)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6579299&oldid=40121238"