Pessimistic value iteration for multi-task data sharing in offline reinforcement learning (Q6152665)

scientific article; zbMATH DE number 7803985

Language	Label	Description	Also known as
default for all languages	No label defined
English	Pessimistic value iteration for multi-task data sharing in offline reinforcement learning	scientific article; zbMATH DE number 7803985

Statements

instance of

scholarly article

0 references

title

Pessimistic value iteration for multi-task data sharing in offline reinforcement learning (English)

0 references

published in

Artificial Intelligence

0 references

publication date

13 February 2024

0 references

zbMATH Keywords

uncertainty quantification

0 references

data sharing

0 references

pessimistic value iteration

0 references

offline reinforcement learning

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Reinforcement learning. An introduction

0 references

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

0 references

Entropy-SGD: biasing gradient descent into wide valleys

0 references

Near-optimal regret bounds for reinforcement learning

0 references

Identifiers

Mathematics Subject Classification ID

0 references

0 references

10.1016/J.ARTINT.2023.104048

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6152665