Finding optimal memoryless policies of POMDPs under the expected average reward criterion (Q418072)

scientific article; zbMATH DE number 6034961

Language	Label	Description	Also known as
default for all languages	No label defined
English	Finding optimal memoryless policies of POMDPs under the expected average reward criterion	scientific article; zbMATH DE number 6034961

Statements

instance of

scholarly article

0 references

title

Finding optimal memoryless policies of POMDPs under the expected average reward criterion (English)

0 references

0 references

0 references

0 references

European Journal of Operational Research

0 references

publication date

14 May 2012

0 references

zbMATH Keywords

POMDPs

0 references

performance difference

0 references

policy iteration with step sizes

0 references

correlated actions

0 references

memoryless policy

0 references

describes a project that uses

POMDP

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/j.ejor.2010.12.014

0 references

0 references

0 references

0 references

Basic ideas for event-based optimization of Markov systems

0 references

Stochastic learning and optimization. A sensitivity-based approach.

0 references

Perturbation realization, potentials, and sensitivity analysis of Markov processes

0 references

Event-Based Optimization of Markov Systems

0 references

The $n$th-Order Bias Optimality for Multichain Markov Decision Processes

0 references

CONVERGENCE OF SIMULATION-BASED POLICY ITERATION

0 references

Performance optimization algorithms based on potentials for semi-Markov control processes

0 references

Potential-Based Online Policy Iteration Algorithms for Markov Decision Processes

0 references

A survey of algorithmic methods for partially observed Markov decision processes

0 references

Simulation-based optimization of Markov reward processes

0 references

Q4315289

0 references

The Optimal Control of Partially Observable Markov Processes over a Finite Horizon

0 references

Optimization of a special case of continuous-time Markov decision processes with compact action set

0 references

Identifiers

zbMATH Open document ID

1237.90250

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

10.1016/J.EJOR.2010.12.014

0 references

Sitelinks

Mathematics(1 entry)

mardi Finding optimal memoryless policies of POMDPs under the expected average reward criterion