Finding optimal memoryless policies of POMDPs under the expected average reward criterion (Q418072)

From MaRDI portal





scientific article; zbMATH DE number 6034961
Language Label Description Also known as
default for all languages
No label defined
    English
    Finding optimal memoryless policies of POMDPs under the expected average reward criterion
    scientific article; zbMATH DE number 6034961

      Statements

      Finding optimal memoryless policies of POMDPs under the expected average reward criterion (English)
      0 references
      0 references
      0 references
      0 references
      14 May 2012
      0 references
      POMDPs
      0 references
      performance difference
      0 references
      policy iteration with step sizes
      0 references
      correlated actions
      0 references
      memoryless policy
      0 references

      Identifiers