Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning (Q6401740)

From MaRDI portal





preprint article from arXiv
Language Label Description Also known as
default for all languages
No label defined
    English
    Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
    preprint article from arXiv

      Statements

      10 June 2022
      0 references
      cs.LG
      0 references
      math.OC
      0 references
      Ruida Zhou
      0 references
      Tao Liu
      0 references
      Dileep Kalathil
      0 references
      P. R. Kumar
      0 references
      Chao Tian
      0 references

      Identifiers

      0 references