Adaptive aggregation for reinforcement learning in average reward Markov decision processes (Q378753)

From MaRDI portal





scientific article; zbMATH DE number 6225981
Language Label Description Also known as
default for all languages
No label defined
    English
    Adaptive aggregation for reinforcement learning in average reward Markov decision processes
    scientific article; zbMATH DE number 6225981

      Statements

      Adaptive aggregation for reinforcement learning in average reward Markov decision processes (English)
      0 references
      0 references
      12 November 2013
      0 references
      reinforcement learning
      0 references
      Markov decision process
      0 references
      bounded parameter MDP
      0 references
      regret
      0 references

      Identifiers