Adaptive playouts for online learning of policies during Monte Carlo tree search (Q307776)

From MaRDI portal





scientific article; zbMATH DE number 6623300
Language Label Description Also known as
default for all languages
No label defined
    English
    Adaptive playouts for online learning of policies during Monte Carlo tree search
    scientific article; zbMATH DE number 6623300

      Statements

      Adaptive playouts for online learning of policies during Monte Carlo tree search (English)
      0 references
      0 references
      0 references
      5 September 2016
      0 references
      Monte Carlo tree search
      0 references
      adaptive playouts
      0 references
      computer Go
      0 references
      reinforcement learning
      0 references

      Identifiers