Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective (Q6182771)

!

WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective

scientific article; zbMATH DE number 7795126

Language	Label	Description	Also known as
default for all languages	No label defined
English	Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective	scientific article; zbMATH DE number 7795126

Statements

instance of

scholarly article

0 references

title

Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective (English)

0 references

0 references

0 references

0 references

0 references

0 references

26 January 2024

0 references

full work available at URL

https://arxiv.org/abs/1908.04734

0 references

zbMATH Keywords

AGI safety

0 references

reinforcement learning

0 references

Bayesian learning

0 references

causality

0 references

decision theory

0 references

causal influence diagrams

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Planning and acting in partially observable stochastic domains

0 references

Multi-agent influence diagrams for representing and solving games.

0 references

General time consistent discounting

0 references

Representing and solving decision problems with limited information

0 references

Artificial general intelligence 2008. Proceedings of the 1st AGI conference.

0 references

Causality. Models, reasoning, and inference

0 references

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

0 references

Complete identification methods for the causal hierarchy

0 references

Reinforcement learning. An introduction

0 references

Identifiers

zbMATH Open document ID

1529.68309

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

10.1007/S11229-021-03141-4

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6182771