Planning for potential: efficient safe reinforcement learning
From MaRDI portal
Publication:2163254
DOI10.1007/s10994-022-06143-6OpenAlexW4220904784MaRDI QIDQ2163254
Mark Hoogendoorn, V. François-Lavet, Floris den Hengst, Frank van Harmelen
Publication date: 10 August 2022
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-022-06143-6
Uses Software
Cites Work
- \({\mathcal Q}\)-learning
- Deep reinforcement learning with temporal logics
- Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
- Infinite Games
- Shield Synthesis:
- A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
- Unnamed Item
- Unnamed Item
- Unnamed Item