On bidecision processes (Q1340581): Difference between revisions

The author studies a (so-called) Markov bidecision process resulting from the standard Markov decision process by incorporating steps of maximization as well as minimization. With the help of an extended optimality equation he constructs a pair of policies, maximizing (resp. minimizing) the total reward in some sense. The pair of policies is found by a policy iteration method.

0 references

reviewed by

Karl-Heinz Waldmann

0 references

zbMATH Keywords

Markov bidecision process

0 references

extended optimality equation

0 references

policy iteration

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1006/jmaa.1994.1382

0 references

Identifiers

zbMATH Open document ID

0829.90135

0 references

DOI

10.1006/jmaa.1994.1382

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1340581

@@ Property / full work available at URL @@
+https://doi.org/10.1006/jmaa.1994.1382
+Normal rank
@@ Property / OpenAlex ID @@
+W1976430771
@@ Property / OpenAlex ID: W1976430771 / rank @@
+Normal rank