Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints (Q5009420)

From MaRDI portal

Jump to:navigation, search

scientific article; zbMATH DE number 7378552

Language	Label	Description	Also known as
English	Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints	scientific article; zbMATH DE number 7378552

Statements

scholarly article

0 references

Jan Křetínský

0 references

Guillermo A. Pérez

0 references

Jean-François Raskin

0 references

publication date

4 August 2021

0 references

full work available at URL

https://arxiv.org/abs/1804.08924

0 references

zbMATH Keywords

Markov decision processes

0 references

reinforcement learning

0 references

beyond worst case

0 references

MaRDI profile type

MaRDI publication profile

0 references

0 references

First-cycle games

0 references

0 references

0 references

Permissive strategies: from parity games to safety games

0 references

Threshold Constraints with Guarantees for Parity Objectives in Markov Decision Processes

0 references

Verification of Markov Decision Processes Using Learning Algorithms

0 references

Automated technology for verification and analysis. 14th international symposium, ATVA 2016, Chiba, Japan, October 17--20, 2016. Proceedings

0 references

Meet Your Expectations With Guarantees: Beyond Worst-Case Synthesis in Quantitative Games

0 references

Deciding parity games in quasipolynomial time

0 references

Concurrent games with tail objectives

0 references

Robustness of Structurally Equivalent Concurrent Parity Games

0 references

Mathematical foundations of computer science 2011. 36th international symposium, MFCS 2011, Warsaw, Poland, August 22--26, 2011. Proceedings

0 references

Multidimensional beyond Worst-Case and Almost-Sure Problems for Mean-Payoff Objectives

0 references

Tools and algorithms for the construction and analysis of systems. 22nd international conference, TACAS 2016, held as part of the European joint conferences on theory and practice of software, ETAPS 2016, Eindhoven, The Netherlands, April 2--8, 2016. Proceedings

0 references

On Time with Minimal Expected Cost!

0 references

Pure Stationary Optimal Strategies in Markov Decision Processes

0 references

0 references

Shortest paths without a map

0 references

0 references

Continuity of the value of competitive Markov decision processes

0 references

0 references

On the synthesis of strategies in infinite games

0 references

\({\mathcal Q}\)-learning

0 references

Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints (English)

0 references

Identifiers

10.4230/LIPIcs.CONCUR.2018.8

0 references

Mathematics Subject Classification ID

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:5009420

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q5009420&oldid=37338204"