Adaptive importance sampling for value function approximation in off-policy reinforcement learning (Q1784527)

From MaRDI portal





scientific article; zbMATH DE number 6944328
Language Label Description Also known as
default for all languages
No label defined
    English
    Adaptive importance sampling for value function approximation in off-policy reinforcement learning
    scientific article; zbMATH DE number 6944328

      Statements

      Adaptive importance sampling for value function approximation in off-policy reinforcement learning (English)
      0 references
      0 references
      0 references
      0 references
      0 references
      27 September 2018
      0 references
      off-policy reinforcement learning
      0 references
      value function approximation
      0 references
      policy iteration
      0 references
      adaptive importance sampling
      0 references
      importance-weighted cross-validation
      0 references
      efficient sample reuse
      0 references

      Identifiers