Exploiting the structural properties of the underlying Markov decision problem in the Q-learning algorithm (Q2901012)

From MaRDI portal





scientific article; zbMATH DE number 6060380
Language Label Description Also known as
default for all languages
No label defined
    English
    Exploiting the structural properties of the underlying Markov decision problem in the Q-learning algorithm
    scientific article; zbMATH DE number 6060380

      Statements

      0 references
      0 references
      28 July 2012
      0 references
      Markov decision processes
      0 references
      Q-learning
      0 references
      stochastic approximation methods
      0 references
      Exploiting the structural properties of the underlying Markov decision problem in the Q-learning algorithm (English)
      0 references

      Identifiers