Adaptive aggregation for reinforcement learning in average reward Markov decision processes (Q378753): Difference between revisions

From MaRDI portal
Normalize DOI.
Import241208061232 (talk | contribs)
Normalize DOI.
 
Property / DOI
 
Property / DOI: 10.1007/S10479-012-1064-Y / rank
Normal rank
 
Property / DOI
 
Property / DOI: 10.1007/S10479-012-1064-Y / rank
 
Normal rank

Latest revision as of 15:46, 9 December 2024

scientific article
Language Label Description Also known as
English
Adaptive aggregation for reinforcement learning in average reward Markov decision processes
scientific article

    Statements

    Adaptive aggregation for reinforcement learning in average reward Markov decision processes (English)
    0 references
    0 references
    12 November 2013
    0 references
    reinforcement learning
    0 references
    Markov decision process
    0 references
    bounded parameter MDP
    0 references
    regret
    0 references

    Identifiers