Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques (Q5166474)

From MaRDI portal





scientific article; zbMATH DE number 6309105
Language Label Description Also known as
default for all languages
No label defined
    English
    Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques
    scientific article; zbMATH DE number 6309105

      Statements

      Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques (English)
      0 references
      0 references
      27 June 2014
      0 references
      variance-penalized MDPs
      0 references
      dynamic programming
      0 references
      risk penalties
      0 references
      reinforcement learning
      0 references
      Bellman equation
      0 references

      Identifiers