New average optimality conditions for semi-Markov decision processes in Borel spaces (Q438786): Difference between revisions
From MaRDI portal
Created a new Item |
Changed an Item |
||
Property / review text | |||
This paper deals with semi-Markov decision processes on a Borel space with the so-called ratio-average expected cost criterion. The objective is to provide a set of conditions under which there exists an optimal stationary policy. The key assumption is that the so-called relative difference of the optimal discounted cost value function is bounded by integrable two functions. Further, the authors make use of the well-known \textit{P. J. Schweitzer's} data transformation [J. Math. Anal. Appl. 34, 495--501 (1971; Zbl 0218.90070)] and analyze two optimality inequalities. The optimal policy is obtained as a selector of the minima in one of the inequalities. Similar techniques of semi-Markov decision processes with unbounded cost functions were used by different authors, see for example [\textit{F. Luque-Vásquez} and \textit{O. Hernández-Lerma}, Appl. Math. 26, No. 3, 315--331 (1999; Zbl 1050.90566)]; [\textit{A. Jaśkiewicz}, Math. Methods Oper. Res. 54, No. 1, 1--19 (2001; Zbl 1031.90062)]; [\textit{A. Federgruen}, \textit{P. J. Schweitzer} and \textit{H. C. Tijms}, Math. Oper. Res. 8, 298--313 (1983; Zbl 0513.90085)]. | |||
Property / review text: This paper deals with semi-Markov decision processes on a Borel space with the so-called ratio-average expected cost criterion. The objective is to provide a set of conditions under which there exists an optimal stationary policy. The key assumption is that the so-called relative difference of the optimal discounted cost value function is bounded by integrable two functions. Further, the authors make use of the well-known \textit{P. J. Schweitzer's} data transformation [J. Math. Anal. Appl. 34, 495--501 (1971; Zbl 0218.90070)] and analyze two optimality inequalities. The optimal policy is obtained as a selector of the minima in one of the inequalities. Similar techniques of semi-Markov decision processes with unbounded cost functions were used by different authors, see for example [\textit{F. Luque-Vásquez} and \textit{O. Hernández-Lerma}, Appl. Math. 26, No. 3, 315--331 (1999; Zbl 1050.90566)]; [\textit{A. Jaśkiewicz}, Math. Methods Oper. Res. 54, No. 1, 1--19 (2001; Zbl 1031.90062)]; [\textit{A. Federgruen}, \textit{P. J. Schweitzer} and \textit{H. C. Tijms}, Math. Oper. Res. 8, 298--313 (1983; Zbl 0513.90085)]. / rank | |||
Normal rank | |||
Property / reviewed by | |||
Property / reviewed by: Anna Jaśkiewicz / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 90C40 / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 90C46 / rank | |||
Normal rank | |||
Property / zbMATH DE Number | |||
Property / zbMATH DE Number: 6062513 / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
semi-Markov decision process | |||
Property / zbMATH Keywords: semi-Markov decision process / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
ratio-average cost criterion | |||
Property / zbMATH Keywords: ratio-average cost criterion / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
optimality inequality | |||
Property / zbMATH Keywords: optimality inequality / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
optimal stationary policy | |||
Property / zbMATH Keywords: optimal stationary policy / rank | |||
Normal rank |
Revision as of 00:34, 30 June 2023
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | New average optimality conditions for semi-Markov decision processes in Borel spaces |
scientific article |
Statements
New average optimality conditions for semi-Markov decision processes in Borel spaces (English)
0 references
31 July 2012
0 references
This paper deals with semi-Markov decision processes on a Borel space with the so-called ratio-average expected cost criterion. The objective is to provide a set of conditions under which there exists an optimal stationary policy. The key assumption is that the so-called relative difference of the optimal discounted cost value function is bounded by integrable two functions. Further, the authors make use of the well-known \textit{P. J. Schweitzer's} data transformation [J. Math. Anal. Appl. 34, 495--501 (1971; Zbl 0218.90070)] and analyze two optimality inequalities. The optimal policy is obtained as a selector of the minima in one of the inequalities. Similar techniques of semi-Markov decision processes with unbounded cost functions were used by different authors, see for example [\textit{F. Luque-Vásquez} and \textit{O. Hernández-Lerma}, Appl. Math. 26, No. 3, 315--331 (1999; Zbl 1050.90566)]; [\textit{A. Jaśkiewicz}, Math. Methods Oper. Res. 54, No. 1, 1--19 (2001; Zbl 1031.90062)]; [\textit{A. Federgruen}, \textit{P. J. Schweitzer} and \textit{H. C. Tijms}, Math. Oper. Res. 8, 298--313 (1983; Zbl 0513.90085)].
0 references
semi-Markov decision process
0 references
ratio-average cost criterion
0 references
optimality inequality
0 references
optimal stationary policy
0 references