Handbook of Markov decision processes. Methods and applications
From MaRDI portal
Publication:5954951
zbMath0979.90001MaRDI QIDQ5954951
Eugene A. Feinberg, Adam Shwartz
Publication date: 6 February 2002
Published in: International Series in Operations Research \& Management Science (Search for Journal in Brave)
Collections of articles of miscellaneous specific interest (00B15) Proceedings, conferences, collections, etc. pertaining to operations research and mathematical programming (90-06) General reference works (handbooks, dictionaries, bibliographies, etc.) pertaining to operations research and mathematical programming (90-00)
Related Items (54)
Polynomial Time Algorithms for Branching Markov Decision Processes and Probabilistic Min(Max) Polynomial Bellman Equations ⋮ An incremental off-policy search in a model-free Markov decision process using a single sample path ⋮ Algorithms for Optimal Control of Stochastic Switching Systems ⋮ Continue, quit, restart probability model ⋮ A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies ⋮ Reachability analysis of uncertain systems using bounded-parameter Markov decision processes ⋮ A model for equilibrium in some service-provider user-set interactions ⋮ Redundant data transmission in control/estimation over lossy networks ⋮ On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs ⋮ Optimization of Markov decision processes under the variance criterion ⋮ A mean-variance optimization problem for discounted Markov decision processes ⋮ Bias optimality of admission control in a non-stationary repairable queue ⋮ Markov decision processes with burstiness constraints ⋮ Model-based preference quantification ⋮ Optimization of a special case of continuous-time Markov decision processes with compact action set ⋮ A note on the existence of optimal stationary policies for average Markov decision processes with countable states ⋮ A Markovian model for the spread of the SARS-CoV-2 virus ⋮ Dynamic optimization over infinite-time horizon: web-building strategy in an orb-weaving spider as a case study ⋮ Optimal balanced control for call centers ⋮ On the terminal condition for the Bellman equation for dynamic optimization with an infinite horizon ⋮ Stochastic control via direct comparison ⋮ An axiomatic approach to Markov decision processes ⋮ Exact finite approximations of average-cost countable Markov decision processes ⋮ Computational bounds for elevator control policies by large scale linear programming ⋮ Reachability in recursive Markov decision processes ⋮ PageRank optimization by edge selection ⋮ A Markovian approach for optimizing highway life-cycle with genetic algorithms by considering maintenance of roadside appurtenances ⋮ Computation of weighted sums of rewards for concurrent MDPs ⋮ Semi-Infinite Weighted Markov Decision Processes ⋮ QUASY: Quantitative Synthesis Tool ⋮ Parameterized Markov decision process and its application to service rate control ⋮ Stochastic dynamic programming with non-linear discounting ⋮ An Overview for Markov Decision Processes in Queues and Networks ⋮ A projected primal-dual gradient optimal control method for deep reinforcement learning ⋮ Variance minimization of parameterized Markov decision processes ⋮ Quantitative model-checking of controlled discrete-time Markov processes ⋮ Robust Markov perfect equilibria ⋮ Measuring the confinement of probabilistic systems ⋮ On average control generating families for singularly perturbed optimal control problems with long run average optimality criteria ⋮ Dual-based methods for solving infinite-horizon nonstationary deterministic dynamic programs ⋮ Generalised discounting in dynamic programming with unbounded returns ⋮ Sensitivity analysis and optimal ultimately stationary deterministic policies in some constrained discounted cost models ⋮ Recursive Markov Decision Processes and Recursive Stochastic Games ⋮ A survey of recent results on continuous-time Markov decision processes (with comments and rejoinder) ⋮ Tax evasion: models with self-audit ⋮ Optimality of admission control in a repairable queue ⋮ Bias optimality for multichain continuous-time Markov decision processes ⋮ Optimality of admission control in an M∕M∕1∕N queue with varying services ⋮ Dynamic dispatching and preventive maintenance for parallel machines with dispatching-dependent deterioration ⋮ Approximations of Countably Infinite Linear Programs over Bounded Measure Spaces ⋮ On the optimality equation for average cost Markov decision processes and its validity for inventory control ⋮ Bounds for synchronizing Markov decision processes ⋮ On essential information in sequential decision processes ⋮ Computational Methods for Risk-Averse Undiscounted Transient Markov Models
This page was built for publication: Handbook of Markov decision processes. Methods and applications