Reliable off-policy evaluation for reinforcement learning (Q6579655)

From MaRDI portal





scientific article; zbMATH DE number 7887720
Language Label Description Also known as
default for all languages
No label defined
    English
    Reliable off-policy evaluation for reinforcement learning
    scientific article; zbMATH DE number 7887720

      Statements

      Identifiers