New average optimality conditions for semi-Markov decision processes in Borel spaces (Q438786): Difference between revisions

This paper deals with semi-Markov decision processes on a Borel space with the so-called ratio-average expected cost criterion. The objective is to provide a set of conditions under which there exists an optimal stationary policy. The key assumption is that the so-called relative difference of the optimal discounted cost value function is bounded by integrable two functions. Further, the authors make use of the well-known \textit{P. J. Schweitzer's} data transformation [J. Math. Anal. Appl. 34, 495--501 (1971; Zbl 0218.90070)] and analyze two optimality inequalities. The optimal policy is obtained as a selector of the minima in one of the inequalities. Similar techniques of semi-Markov decision processes with unbounded cost functions were used by different authors, see for example [\textit{F. Luque-Vásquez} and \textit{O. Hernández-Lerma}, Appl. Math. 26, No. 3, 315--331 (1999; Zbl 1050.90566)]; [\textit{A. Jaśkiewicz}, Math. Methods Oper. Res. 54, No. 1, 1--19 (2001; Zbl 1031.90062)]; [\textit{A. Federgruen}, \textit{P. J. Schweitzer} and \textit{H. C. Tijms}, Math. Oper. Res. 8, 298--313 (1983; Zbl 0513.90085)].

0 references

zbMATH Keywords

semi-Markov decision process

0 references

ratio-average cost criterion

0 references

optimality inequality

0 references

optimal stationary policy

0 references

reviewed by

Anna Jaśkiewicz

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1007/s10957-012-9986-8

0 references

cites work

Semi-Markov control models with average costs

0 references

SEMI-MARKOV DECISION PROCESSES AND THEIR APPLICATIONS IN REPLACEMENT MODELS

0 references

Q4315289

0 references

Q4223191

0 references

Uniformization for semi-Markov decision processes under stationary policies

0 references

The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms

0 references

Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion

0 references

Denumerable Undiscounted Semi-Markov Decision Processes with Unbounded Rewards

0 references

Constrained Semi-Markov decision processes with average rewards

0 references

Q3313617

0 references

Q5615108

0 references

Iterative solution of the functional equations of undiscounted Markov renewal programming

0 references

Average Cost Semi-Markov Decision Processes and the Control of Queueing Systems

0 references

An approximation approach to ergodic semi-Markov control processes

0 references

On the Equivalence of Two Expected Average Cost Criteria for Semi-Markov Control Processes

0 references

Optimality in Feller semi-Markov control processes

0 references

A Fixed Point Approach to Solve the Average Cost Optimality Equation for Semi-Markov Decision Processes with Feller Transition Probabilities

0 references

Average optimality for continuous-time Markov decision processes in Polish spaces

0 references

Average optimality for Markov decision processes in borel spaces: a new condition and approach

0 references

Q4863593

0 references

Q4255598

0 references

Continuous-time Markov decision processes. Theory and applications

0 references

New sufficient conditions for average optimality in continuous-time Markov decision processes

0 references

First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs

0 references

Computable bounds for geometric convergence rates of Markov chains

0 references

Identifiers

zbMATH Open document ID

1266.90190

0 references

DOI

10.1007/s10957-012-9986-8

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:438786

@@ Property / review text @@
+This paper deals with semi-Markov decision processes on a Borel space with the so-called ratio-average expected cost criterion.   The objective is to provide a set of conditions under which there exists an optimal stationary policy. The key assumption is that the so-called relative difference of the optimal discounted cost value function is bounded by integrable two functions. Further, the authors make use of the well-known \textit{P. J. Schweitzer's} data transformation [J. Math. Anal. Appl. 34, 495--501 (1971; Zbl 0218.90070)] and analyze two optimality inequalities. The optimal policy is obtained as a selector of the minima in one of the inequalities.   Similar techniques of semi-Markov decision processes with unbounded cost functions were used by different authors, see for example [\textit{F. Luque-Vásquez} and \textit{O. Hernández-Lerma}, Appl. Math. 26, No. 3, 315--331 (1999; Zbl 1050.90566)]; [\textit{A. Jaśkiewicz}, Math. Methods Oper. Res. 54, No. 1, 1--19 (2001; Zbl 1031.90062)]; [\textit{A. Federgruen}, \textit{P. J. Schweitzer} and \textit{H. C. Tijms}, Math. Oper. Res. 8, 298--313 (1983; Zbl 0513.90085)].
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C40
@@ Property / Mathematics Subject Classification ID: 90C40 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C46
@@ Property / Mathematics Subject Classification ID: 90C46 / rank @@
+Normal rank
@@ Property / zbMATH DE Number @@
+6062513
@@ Property / zbMATH DE Number: 6062513 / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+semi-Markov decision process
@@ Property / zbMATH Keywords: semi-Markov decision process / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+ratio-average cost criterion
@@ Property / zbMATH Keywords: ratio-average cost criterion / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+optimality inequality
@@ Property / zbMATH Keywords: optimality inequality / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+optimal stationary policy
@@ Property / zbMATH Keywords: optimal stationary policy / rank @@
+Normal rank
@@ Property / reviewed by @@
+Anna Jaśkiewicz
@@ Property / reviewed by: Anna Jaśkiewicz / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10957-012-9986-8
+Normal rank
@@ Property / OpenAlex ID @@
+W1967276139
@@ Property / OpenAlex ID: W1967276139 / rank @@
+Normal rank
@@ Property / cites work @@
+Semi-Markov control models with average costs
@@ Property / cites work: Semi-Markov control models with average costs / rank @@
+Normal rank
@@ Property / cites work @@
+SEMI-MARKOV DECISION PROCESSES AND THEIR APPLICATIONS IN REPLACEMENT MODELS
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4223191
@@ Property / cites work: Q4223191 / rank @@
+Normal rank
@@ Property / cites work @@
+Uniformization for semi-Markov decision processes under stationary policies
+Normal rank
@@ Property / cites work @@
+The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms
+Normal rank
@@ Property / cites work @@
+Denumerable state semi-Markov decision processes with unbounded costs, average cost criterion
+Normal rank
@@ Property / cites work @@
+Denumerable Undiscounted Semi-Markov Decision Processes with Unbounded Rewards
+Normal rank
@@ Property / cites work @@
+Constrained Semi-Markov decision processes with average rewards
+Normal rank
@@ Property / cites work @@
+Q3313617
@@ Property / cites work: Q3313617 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5615108
@@ Property / cites work: Q5615108 / rank @@
+Normal rank
@@ Property / cites work @@
+Iterative solution of the functional equations of undiscounted Markov renewal programming
+Normal rank
@@ Property / cites work @@
+Average Cost Semi-Markov Decision Processes and the Control of Queueing Systems
+Normal rank
@@ Property / cites work @@
+An approximation approach to ergodic semi-Markov control processes
+Normal rank
@@ Property / cites work @@
+On the Equivalence of Two Expected Average Cost Criteria for Semi-Markov Control Processes
+Normal rank
@@ Property / cites work @@
+Optimality in Feller semi-Markov control processes
+Normal rank
@@ Property / cites work @@
+A Fixed Point Approach to Solve the Average Cost Optimality Equation for Semi-Markov Decision Processes with Feller Transition Probabilities
+Normal rank
@@ Property / cites work @@
+Average optimality for continuous-time Markov decision processes in Polish spaces
+Normal rank
@@ Property / cites work @@
+Average optimality for Markov decision processes in borel spaces: a new condition and approach
+Normal rank
@@ Property / cites work @@
+Q4863593
@@ Property / cites work: Q4863593 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4255598
@@ Property / cites work: Q4255598 / rank @@
+Normal rank
@@ Property / cites work @@
+Continuous-time Markov decision processes. Theory and applications
+Normal rank
@@ Property / cites work @@
+New sufficient conditions for average optimality in continuous-time Markov decision processes
+Normal rank
@@ Property / cites work @@
+First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs
+Normal rank
@@ Property / cites work @@
+Computable bounds for geometric convergence rates of Markov chains
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:438786