Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Joint learning of reward machines and policies in environments with partially known semantics

From MaRDI portal
Publication:6579299
Jump to:navigation, search

DOI10.1016/J.ARTINT.2024.104146zbMATH Open1543.6832MaRDI QIDQ6579299FDOQ6579299


Authors: Christos K. Verginis, Cevahir Koprulu, Sandeep Chinchali, Ufuk Topcu Edit this on Wikidata


Publication date: 25 July 2024

Published in: Artificial Intelligence (Search for Journal in Brave)






zbMATH Keywords

reinforcement learningreward machinesperception limitations


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Formal languages and automata (68Q45) Computational learning theory (68Q32)


Cites Work

  • PySAT: a Python toolkit for prototyping with SAT oracles
  • \({\mathcal Q}\)-learning
  • Title not available (Why is that?)
  • Complexity of automaton identification from given data
  • Exact DFA Identification Using SAT Solvers
  • Omega-Regular Objectives in Model-Free Reinforcement Learning
  • Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning






This page was built for publication: Joint learning of reward machines and policies in environments with partially known semantics

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6579299)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6579299&oldid=40121238"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 13 February 2025, at 17:40. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki