Pessimistic value iteration for multi-task data sharing in offline reinforcement learning (Q6152665)

scientific article; zbMATH DE number 7803985

Language	Label	Description	Also known as
English	Pessimistic value iteration for multi-task data sharing in offline reinforcement learning	scientific article; zbMATH DE number 7803985

Statements

instance of

scholarly article

0 references

title

Pessimistic value iteration for multi-task data sharing in offline reinforcement learning (English)

0 references

published in

Artificial Intelligence

0 references

publication date

13 February 2024

0 references

zbMATH Keywords

uncertainty quantification

0 references

data sharing

0 references

pessimistic value iteration

0 references

offline reinforcement learning

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Q4626283

0 references

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

0 references

Entropy-SGD: biasing gradient descent into wide valleys

0 references

Q2896090

0 references

Identifiers

DOI

10.1016/j.artint.2023.104048

0 references

Mathematics Subject Classification ID

68T05

0 references

zbMATH DE Number

7803985

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6152665