Policy Gradient for Continuing Tasks in Discounted Markov Decision Processes (Q6075992)

From MaRDI portal
scientific article; zbMATH DE number 7740953
Language Label Description Also known as
English
Policy Gradient for Continuing Tasks in Discounted Markov Decision Processes
scientific article; zbMATH DE number 7740953

    Statements

    Policy Gradient for Continuing Tasks in Discounted Markov Decision Processes (English)
    0 references
    0 references
    0 references
    21 September 2023
    0 references
    adaptive systems
    0 references
    gradient methods
    0 references
    reinforcement learning
    0 references
    stochastic systems
    0 references

    Identifiers