Linear Programming and Markov Decision Chains
From MaRDI portal
Publication:3854945
DOI10.1287/mnsc.25.4.352zbMath0421.90076OpenAlexW2112492957WikidataQ111675450 ScholiaQ111675450MaRDI QIDQ3854945
Arie Hordijk, Lodewijk C. M. Kallenberg
Publication date: 1979
Published in: Management Science (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/mnsc.25.4.352
infinite horizonaverage reward criterionaverage optimal policyfinite Markov decision processsolution by linear programming
Related Items (44)
Variational characterizations in Markov decision processes ⋮ LP based upper and lower bounds for Cesàro and Abel limits of the optimal values in problems of control of stochastic discrete time systems ⋮ Approximate dynamic programming with state aggregation applied to UAV perimeter patrol ⋮ Quadratic programming and the single-controller stochastic game ⋮ Communicating MDPs: Equivalence and LP properties ⋮ On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs ⋮ Survey of linear programming for standard and nonstandard Markovian control problems. Part I: Theory ⋮ Linear programming formulation of MDPs in countable state space: The multichain case ⋮ Derman's book as inspiration: some results on LP for MDPs ⋮ Nonlinear programming and stationary strategies in stochastic games ⋮ Computing semi-stationary optimal policies for multichain semi-Markov decision processes ⋮ Linear programming estimates for Cesàro and Abel limits of optimal values in optimal control problems ⋮ The stochastic shortest path problem: a polyhedral combinatorics perspective ⋮ Value iteration for simple stochastic games: stopping criterion and learning algorithm ⋮ A value-iteration scheme for undiscounted multichain Markov renewal programs ⋮ Linear programming formulation of long-run average optimal control problem ⋮ State partitioning based linear program for stochastic dynamic programs: an invariance property ⋮ The Linear Program approach in multi-chain Markov Decision Processes revisited ⋮ MF-OMO: An Optimization Formulation of Mean-Field Games ⋮ Linear programming and undiscounted stochastic games in which one player controls transitions ⋮ A value iteration method for undiscounted multichain Markov decision processes ⋮ Maximizing the set of recurrent states of an MDP subject to convex constraints ⋮ Linear programming formulation for non-stationary, finite-horizon Markov decision process models ⋮ Nonlinear programming and stationary equilibria in stochastic games ⋮ A structured pattern matrix algorithm for multichain Markov decision processes ⋮ Separable Markovian decision problems. The linear programming method in the multichain case ⋮ Saddle-point calculation for constrained finite Markov chains ⋮ Determining the optimal strategies for discrete control problems on stochastic networks with discounted costs ⋮ The Completely Mixed Single-Controller Stochastic Game ⋮ Transient policies in discrete dynamic programming: Linear programming including suboptimality tests and additional constraints ⋮ PIVOTING ALGORITHMS FOR SOME CLASSES OF STOCHASTIC GAMES: A SURVEY ⋮ Linear Programming and Zero-Sum Two-Person Undiscounted Semi-Markov Games ⋮ Singularly perturbed linear programs and Markov decision processes ⋮ Markov Branching Decision Chains with Interest-Rate-Dependent Rewards ⋮ Admission control strategies for tandem Markovian loss systems ⋮ LP Formulations of Discrete Time Long-Run Average Optimal Control Problems: The NonErgodic Case ⋮ Linear programming in tector criterion markov and semi-Markov decision processes ⋮ On completely mixed stochastic games ⋮ Maximum-Stopping-Value Policies in Finite Markov Population Decision Chains ⋮ Percentiles and Markovian decision processes ⋮ On the block upper-triangularity of undiscounted multi-chain Markov decision problems ⋮ MARKOV DECISION PROCESSES ⋮ Generalized polynomial approximations in Markovian decision processes ⋮ On stationary equilibria of a single-controller stochastic game
This page was built for publication: Linear Programming and Markov Decision Chains