scientific article; zbMATH DE number 7559459
From MaRDI portal
Publication:5089265
DOI10.4230/LIPICS.CONCUR.2020.3MaRDI QIDQ5089265FDOQ5089265
Authors: Nils Jansen, Bettina Könighofer, Sebastian Junges, Alex Serban, Roderick Bloem
Publication date: 18 July 2022
Full work available at URL: https://arxiv.org/abs/1807.06096
Title of this publication is not available (Why is that?)
Cites Work
- dtControl
- Title not available (Why is that?)
- Path-constrained Markov decision processes: bridging the gap between probabilistic model-checking and decision-theoretic planning
- Title not available (Why is that?)
- Reinforcement learning. An introduction
- A survey of multi-objective sequential decision-making
- Title not available (Why is that?)
- Shield synthesis
- Deep reinforcement learning with temporal logics
- Permissive controller synthesis for probabilistic systems
- Graph Games and Reactive Synthesis
- Verification of Markov decision processes using learning algorithms
- Learning-based mean-payoff optimization in an unknown MDP under omega-regular constraints
- Shield synthesis: runtime enforcement for reactive systems
- A comprehensive survey on safe reinforcement learning
- Estimator-based reactive synthesis under incomplete information
- The probabilistic model checking landscape
- Safety-aware apprenticeship learning
Cited In (10)
- Lifted model checking for relational MDPs
- Enforcing almost-sure reachability in POMDPs
- Runtime monitors for Markov decision processes
- Safe Exploration of State and Action Spaces in Reinforcement Learning
- A learner-verifier framework for neural network controllers and certificates of stochastic systems
- DSMC evaluation stages: fostering robust and safe behavior in deep reinforcement learning -- extended version
- Risk-aware shielding of partially observable Monte Carlo planning policies
- Dynamic shielding for reinforcement learning in black-box environments
- Safety-constrained reinforcement learning with a distributional safety critic
- Deductive controller synthesis for probabilistic hyperproperties
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5089265)