Discrete Dynamic Programming

From MaRDI portal
Publication:5342868


DOI10.1214/aoms/1177704593zbMath0133.12906WikidataQ110952978 ScholiaQ110952978MaRDI QIDQ5342868

David Blackwell

Publication date: 1962

Published in: The Annals of Mathematical Statistics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aoms/1177704593



Related Items

Semi-Markov strategies in stochastic games, On the solvability of Bellman's functional equations for Markov renewal programming, A note on the vanishing interest rate approach in average Markov decision chains with continuous and bounded costs, A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases, Long-term average cost control problems for continuous time Markov processes: A survey, Decentralized evolutionary mechanisms for intertemporal economies: A possibility result, A fuzzy approach to Markov decision processes with uncertain transition probabilities, Marginal productivity index policies for scheduling a multiclass delay-/loss-sensitive queue, Remarks on sensitive equilibria in stochastic games with additive reward and transition structure, Perfect equilibria in stochastic games, Dynamic priority allocation via restless bandit marginal productivity indices, A survey of recent results on continuous-time Markov decision processes (with comments and rejoinder), Sensitivity analysis in discounted Markovian decision problems, Optimal search with positive switch cost is NP-hard, Bilinear programming and structured stochastic games, On efficiency of linear programming applied to discounted Markovian decision problems, Strong 1-optimal stationary policies in denumerable Markov decision processes, Conditions for existence of average and Blackwell optimal stationary policies in denumerable Markov decision processes, Communicating MDPs: Equivalence and LP properties, A generalized inverse method for asymptotic linear programming, The optimal frequency of information purchases, An orderfield property for stochastic games when one player controls transition probabilities, Ordered field property for stochastic games when the player who controls transitions changes from state to state, On optimality criteria for dynamic programs with long finite horizons, Invariant problems in dynamic programming - average reward criterion, Resolvent expansions of matrices and applications, Capital accumulation and the optimization of renewable resource models, Singulary perturbed Markov control problem: Limiting average cost, Denumerable semi-Markov decision chains with small interest rates, Nonlinear programming and stationary equilibria in stochastic games, Estimation and control in multichain processes, Review of a Markov decision algorithm for optimal inspections and revisions in a maintenance system with partial information, Optimal inspection policies for a manufacturing station, Some remarks on the new optimality criterion of Mine and Tabata, On the convergence of the average expected return in dynamic programming, Continuous versus measurable recourse in N-stage stochastic programming, Continuous time control of Markov processes on an arbitrary state space: average return criterion, An optimality principle for Markovian decision processes, Stochastic convex programming: Kuhn-Tucker conditions, Problemi di ottimizzazione nella teoria delle code, Foolproof convergence in multichain policy iteration, Optimization of stochastic maintenance policies, Bounded variation of \(\{V_ n\}\) and its limit, Markov-type fuzzy decision processes with a discounted reward on a closed interval, Optimization models for the first arrival target distribution function in discrete time, Exact formula for sensitivity analysis of Markov chains, Computational aspects in applied stochastic control, Cyclic Markov equilibria in stochastic games, Randomization and simplification in dynamic decision-making., A decomposition algorithm for limiting average Markov decision problems., Optimal threshold probability in undiscounted Markov decision processes with a target set., Herbert Robbins and sequential analysis, Two-player stochastic games. II: The case of recursive games, Fuzzy decision processes with an average reward criterion., Sensitivity of finite Markov chains under perturbation, Stationary \(\varepsilon\)-optimal strategies in stochastic games, Optimal replenishment for a periodic review inventory system with two supply modes., Sequential identification and adaptive control in stochastic systems, Are limits of \(\alpha\)-discounted optimal policies Blackwell optimal? A counterexample, Controlled semi-Markov models under long-run average rewards, Blackwell optimality in Markov decision processes with partial observation., Controlled Markov set-chains under average criteria, Index-based policies for discounted multi-armed bandits on parallel machines., Dynamic diagnostic and decision procedures under uncertainty, A canonical form for pencils of matrices with applications to asymptotic linear programs, On the existence of relative values for undiscounted multichain Markov decision processes, An efficient basis update for asymptotic linear programming, Solvable states in stochastic games, A finite step algorithm via a bimatrix game to a single controller non- zero sum stochastic game, Bias optimality and strong \(n\) \((n= -1,0)\) discount optimality for Markov decision processes, Markovian sequential control processes. Denumerable state space, Fuzzy optimality relation for perceptive MDPs-the average case, Sample-path optimality and variance-maximization for Markov decision processes, Semi-infinite semi-Markov stochastic games., Blackwell optimality in the class of Markov policies for continuous-time controlled Markov chains, The optimization of K-effect models by linear and dynamic programming, Finite state continuous time Markov decision processes with an infinite planning horizon, Some remarks on a Markovian decision problem with an absorbing state, Linear programming considerations on Markovian decision processes with no discounting, Linear programming algorithms for semi-Markovian decision processes, On direct sums of Markovian decision process, On the set of optimal policies in discrete dynamic programming, On a set of optimal policies in continuous time Markovian decision problem, A new optimality criterion for discrete dynamic programming, Algorithms for discounted stochastic games, On the solvability of Bellman's functional equation for a Markovian decision process, Optimal control of stationary Markov processes, Finite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimality, An improved algorithm for solving communicating average reward Markov decision processes, On Markovian decision programming with recursive reward functions, On regularly perturbed fundamental matrices, STRONG AVERAGE OPTIMALITY FOR CONTROLLED NONHOMOGENEOUS MARKOV CHAINS*, Transient policies in discrete dynamic programming: Linear programming including suboptimality tests and additional constraints, A Fixed Point Approach to Undiscounted Markov Renewal Programs, MARKOV DECISION PROCESSES, Optimality equations and sensitive optimality in bounded Markov decision processes1, Finite state dynamic programming with the total reward criterion, Some basic concepts of numerical treatment of Markov decision models, Entscheidungsmodelle über angeordneten körpern, Sensitivitätsanalysen in entscheidungsmodellen, Survey of linear programming for standard and nonstandard Markovian control problems. Part I: Theory, Solution procedures for multi-objective markov decision processes, Blackwell optimal policies in a Markov decision process with a Borel state space, Strong 0-discount optimal policies in a Markov decision process with a Borel state space, An application of Markov potential theory to Markovian decision processes, On the chance to visit a goal set infinitely often, Unnamed Item, Continuous-Time Markov Decision Processes with Unbounded Transition and Discounted-Reward Rates, Solution of a Markovian decision problem by successive overrelaxation, Another Set of Conditions for Strongn(n = −1, 0) Discount Optimality in Markov Decision Processes, Optimality of intuitive checkpointing policies, Optimality of intuitive checkpointing policies, Unnamed Item, Optimality in transient markov chains and linear programming, A further anticycling rule in multichain policy iteration for undiscounted Markov renewal programs, Unnamed Item, Continuous time markov decision processes with interventions, Deterministic discrete dynamic programming with discount factor greater than one: Some further results and algorithms, A decision exclusion algorithm for a class of Markovian Decision Processes, A set of successive approximation methods for discounted Markovian decision problems