Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective (Q6182771)

From MaRDI portal





scientific article; zbMATH DE number 7795126
Language Label Description Also known as
default for all languages
No label defined
    English
    Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
    scientific article; zbMATH DE number 7795126

      Statements

      Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective (English)
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      26 January 2024
      0 references
      AGI safety
      0 references
      reinforcement learning
      0 references
      Bayesian learning
      0 references
      causality
      0 references
      decision theory
      0 references
      causal influence diagrams
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references