Bounds and Transformations for Discounted Finite Markov Decision Chains

From MaRDI portal

Publication:4085522

Jump to:navigation, search

DOI10.1287/opre.23.4.761zbMath0322.90073OpenAlexW1983920151MaRDI QIDQ4085522

Evan L. Porteus

Publication date: 1975

Published in: Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/opre.23.4.761

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40) Applications of queueing theory (congestion, allocation, storage, traffic, etc.) (60K30) Mathematical programming (90C99) Hamilton-Jacobi theories (49L99)

Related Items

Approximating Markov decision processes using expected state transitions ⋮ Computation techniques for large scale undiscounted markov decision processes ⋮ Computational comparison of policy iteration algorithms for discounted Markov decision processes ⋮ Approximate dynamic programming with state aggregation applied to UAV perimeter patrol ⋮ Block-scaling of value-iteration for discounted Markov renewal programming ⋮ Generic rank-one corrections for value iteration in Markovian decision problems ⋮ On efficiency of linear programming applied to discounted Markovian decision problems ⋮ The numerical exploitation of periodicity in Markov decision processes ⋮ Error bounds for stochastic shortest path problems ⋮ On theory and algorithms for Markov decision problems with the total reward criterion ⋮ Solving linear systems by methods based on a probabilistic interpretation ⋮ Markov renewal decision processes with finite horizon ⋮ Conditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programs ⋮ Action-dependent stopping times and Markov decision process with unbounded rewards ⋮ A multi-period TSP with stochastic regular and urgent demands ⋮ Markov decision processes ⋮ Value function approximation via linear programming for FMS scheduling ⋮ Discounted Markov games; successive approximation and stopping times ⋮ Improved iterative computation of the expected discounted return in Markov and semi-Markov chains ⋮ A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes ⋮ Computational comparison of value iteration algorithms for discounted Markov decision processes ⋮ Generalized polynomial approximations in Markovian decision processes

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4085522&oldid=17840160"