Adaptive aggregation for reinforcement learning in average reward Markov decision processes (Q378753)

From MaRDI portal
Revision as of 15:46, 9 December 2024 by Import241208061232 (talk | contribs) (Normalize DOI.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Adaptive aggregation for reinforcement learning in average reward Markov decision processes
scientific article

    Statements

    Adaptive aggregation for reinforcement learning in average reward Markov decision processes (English)
    0 references
    0 references
    12 November 2013
    0 references
    reinforcement learning
    0 references
    Markov decision process
    0 references
    bounded parameter MDP
    0 references
    regret
    0 references

    Identifiers