An emphatic approach to the problem of off-policy temporal-difference learning (Q2810885)

From MaRDI portal





scientific article; zbMATH DE number 6589487
Language Label Description Also known as
default for all languages
No label defined
    English
    An emphatic approach to the problem of off-policy temporal-difference learning
    scientific article; zbMATH DE number 6589487

      Statements

      0 references
      0 references
      0 references
      6 June 2016
      0 references
      temporal-difference learning
      0 references
      off-policy learning
      0 references
      function approximation
      0 references
      stability
      0 references
      convergence
      0 references
      An emphatic approach to the problem of off-policy temporal-difference learning (English)
      0 references

      Identifiers