Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints (Q5009420)

From MaRDI portal
scientific article; zbMATH DE number 7378552
Language Label Description Also known as
English
Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints
scientific article; zbMATH DE number 7378552

    Statements

    0 references
    0 references
    0 references
    4 August 2021
    0 references
    Markov decision processes
    0 references
    reinforcement learning
    0 references
    beyond worst case
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints (English)
    0 references

    Identifiers