A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (Q481787)

!

WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion

scientific article; zbMATH DE number 6380451

Language	Label	Description	Also known as
default for all languages	No label defined
English	A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion	scientific article; zbMATH DE number 6380451

Statements

instance of

scholarly article

0 references

title

A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (English)

0 references

author

Rolando Cavazos-Cadena

0 references

Karel Sladký

0 references

Raúl Montes-de-Oca

0 references

published in

Journal of Optimization Theory and Applications

0 references

publication date

15 December 2014

0 references

zbMATH Keywords

strong sample-path optimality

0 references

Lyapunov function condition

0 references

stationary policy

0 references

expected average reward criterion

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1007/s10957-013-0474-6

0 references

cites work

Q4771778

0 references

Sample-path optimality in average Markov decision chains under a double Lyapunov function condition

0 references

Q4315289

0 references

Denumerable controlled Markov chains with average reward criterion: Sample path optimality

0 references

Sample-path average optimality for Markov control processes

0 references

Sample path optimality for a Markov optimization problem

0 references

Q5615108

0 references

Identifiers

zbMATH Open document ID

1302.90241

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

10.1007/S10957-013-0474-6

0 references

Sitelinks

Mathematics(1 entry)

mardi A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion