A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis (Q1771225)

From MaRDI portal
Revision as of 21:44, 4 April 2024 by Daniel (talk | contribs) (‎Created claim: Wikidata QID (P12): Q115010233, #quickstatements; #temporary_batch_1712261475387)





scientific article
Language Label Description Also known as
English
A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis
scientific article

    Statements

    A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis (English)
    0 references
    0 references
    7 April 2005
    0 references
    reinforcement learning
    0 references
    average reward
    0 references
    policy iteration
    0 references

    Identifiers