Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques (Q5166474)

From MaRDI portal
!
WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

scientific article; zbMATH DE number 6309105
Language Label Description Also known as
default for all languages
No label defined
    English
    Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques
    scientific article; zbMATH DE number 6309105

      Statements

      Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques (English)
      0 references
      0 references
      27 June 2014
      0 references
      variance-penalized MDPs
      0 references
      dynamic programming
      0 references
      risk penalties
      0 references
      reinforcement learning
      0 references
      Bellman equation
      0 references

      Identifiers