Discrete Dynamic Programming

From MaRDI portal

Publication:5342868

Jump to:navigation, search

DOI10.1214/aoms/1177704593zbMath0133.12906OpenAlexW1986389067WikidataQ110952978 ScholiaQ110952978MaRDI QIDQ5342868

David Blackwell

Publication date: 1962

Published in: The Annals of Mathematical Statistics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aoms/1177704593

zbMATH Keywords

operations research

Related Items

Late marriage and transition from arranged marriages to love matches: A search-theoretic approach ⋮ Continuous time markov decision processes with interventions ⋮ OPTIMALITY OF TRUNK RESERVATION FOR AN M/M/K/N QUEUE WITH SEVERAL CUSTOMER TYPES AND HOLDING COSTS ⋮ Optimality equations and sensitive optimality in bounded Markov decision processes¹ ⋮ BLACKWELL OPTIMAL STRATEGIES IN PRIORITY MEAN-PAYOFF GAMES ⋮ An improved algorithm for solving communicating average reward Markov decision processes ⋮ On Markovian decision programming with recursive reward functions ⋮ On regularly perturbed fundamental matrices ⋮ Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities ⋮ Finite state dynamic programming with the total reward criterion ⋮ Finite-Memory Strategies in POMDPs with Long-Run Average Objectives ⋮ Sporadic overtaking optimality in Markov decision problems ⋮ Some basic concepts of numerical treatment of Markov decision models ⋮ Blackwell Optimality for Controlled Diffusion Processes ⋮ Entscheidungsmodelle über angeordneten körpern ⋮ Sensitivitätsanalysen in entscheidungsmodellen ⋮ Blackwell optimal policies in a Markov decision process with a Borel state space ⋮ Survey of linear programming for standard and nonstandard Markovian control problems. Part I: Theory ⋮ Strong 0-discount optimal policies in a Markov decision process with a Borel state space ⋮ A Markovian decision model of adaptive cancer treatment and quality of life ⋮ Approximations for the distribution of perpetuities with small discount rates ⋮ An epistemic approach to stochastic games ⋮ Four Canadian Contributions to Stochastic Modeling ⋮ Does free information provision crowd out costly information acquisition? It's a matter of timing ⋮ Solution of a Markovian decision problem by successive overrelaxation ⋮ Deterministic discrete dynamic programming with discount factor greater than one: Some further results and algorithms ⋮ An axiomatic approach to Markov decision processes ⋮ Unnamed Item ⋮ STRONG AVERAGE OPTIMALITY FOR CONTROLLED NONHOMOGENEOUS MARKOV CHAINS^* ⋮ Interview with Andrzej Nowak - Laureate of the Rufus Isaacs Award ⋮ Another Set of Conditions for Strongn(n = −1, 0) Discount Optimality in Markov Decision Processes ⋮ Fuzzy optimality relation for perceptive MDPs-the average case ⋮ Unnamed Item ⋮ BLACKWELL OPTIMALITY IN STOCHASTIC GAMES ⋮ Sample-path optimality and variance-maximization for Markov decision processes ⋮ A Policy Improvement Algorithm for Solving a Mixture Class of Perfect Information and AR-AT Semi-Markov Games ⋮ A decision exclusion algorithm for a class of Markovian Decision Processes ⋮ Semi-infinite semi-Markov stochastic games. ⋮ Optimality of intuitive checkpointing policies ⋮ Solution procedures for multi-objective markov decision processes ⋮ A set of successive approximation methods for discounted Markovian decision problems ⋮ Transient policies in discrete dynamic programming: Linear programming including suboptimality tests and additional constraints ⋮ An application of Markov potential theory to Markovian decision processes ⋮ A Fixed Point Approach to Undiscounted Markov Renewal Programs ⋮ Optimality in transient markov chains and linear programming ⋮ Blackwell optimality in the class of Markov policies for continuous-time controlled Markov chains ⋮ On the chance to visit a goal set infinitely often ⋮ Finitely Additive Dynamic Programming ⋮ Markov Branching Decision Chains with Interest-Rate-Dependent Rewards ⋮ Continuous-Time Markov Decision Processes with Unbounded Transition and Discounted-Reward Rates ⋮ Credibilistic Markov decision processes: The average case ⋮ Ergodic Control, Bias, and Sensitive Discount Optimality for Markov Diffusion Processes ⋮ The optimization of K-effect models by linear and dynamic programming ⋮ Finite state continuous time Markov decision processes with an infinite planning horizon ⋮ Some remarks on a Markovian decision problem with an absorbing state ⋮ Linear programming considerations on Markovian decision processes with no discounting ⋮ Strong Uniform Value in Gambling Houses and Partially Observable Markov Decision Processes ⋮ Linear programming algorithms for semi-Markovian decision processes ⋮ On direct sums of Markovian decision process ⋮ On the set of optimal policies in discrete dynamic programming ⋮ Optimality of intuitive checkpointing policies ⋮ On a set of optimal policies in continuous time Markovian decision problem ⋮ Pure Equilibrium Strategies for Stochastic Games via Potential Functions ⋮ On zero-sum two-person undiscounted semi-Markov games with a multichain structure ⋮ Uniform Tauberian theorem in differential games ⋮ A new optimality criterion for discrete dynamic programming ⋮ Algorithms for discounted stochastic games ⋮ History-dependent Evaluations in Partially Observable Markov Decision Process ⋮ A further anticycling rule in multichain policy iteration for undiscounted Markov renewal programs ⋮ On the solvability of Bellman's functional equation for a Markovian decision process ⋮ Optimal control of stationary Markov processes ⋮ Finite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimality ⋮ Maximum-Stopping-Value Policies in Finite Markov Population Decision Chains ⋮ Unnamed Item ⋮ Ordered Field Property for Semi-Markov Games when One Player Controls Transition Probabilities and Transition Times ⋮ Commutative Stochastic Games ⋮ Optimal Inventory Control and Allocation for Sequential Internet Auctions ⋮ MARKOV DECISION PROCESSES ⋮ Semi-supervised learning with regularized Laplacian ⋮ Index-based policies for discounted multi-armed bandits on parallel machines. ⋮ Bilinear programming and structured stochastic games ⋮ Optimal inventory control with fixed ordering cost for selling by Internet auctions ⋮ An efficient basis update for asymptotic linear programming ⋮ Solvable states in stochastic games ⋮ On undiscounted semi-Markov decision processes with absorbing states ⋮ A finite step algorithm via a bimatrix game to a single controller non- zero sum stochastic game ⋮ Bias optimality and strong \(n\) \((n= -1,0)\) discount optimality for Markov decision processes ⋮ Computational aspects in applied stochastic control ⋮ Unbounded dynamic programming via the Q-transform ⋮ An information-theoretic analysis of return maximization in reinforcement learning ⋮ A fuzzy approach to Markov decision processes with uncertain transition probabilities ⋮ Tauberian theorem for value functions ⋮ Marginal productivity index policies for scheduling a multiclass delay-/loss-sensitive queue ⋮ Acceptable strategy profiles in stochastic games ⋮ Remarks on sensitive equilibria in stochastic games with additive reward and transition structure ⋮ Dynamic diagnostic and decision procedures under uncertainty ⋮ On efficiency of linear programming applied to discounted Markovian decision problems ⋮ Strong 1-optimal stationary policies in denumerable Markov decision processes ⋮ Stability-constrained Markov decision processes using MPC ⋮ Cyclic Markov equilibria in stochastic games ⋮ A canonical form for pencils of matrices with applications to asymptotic linear programs ⋮ Markovian sequential control processes. Denumerable state space ⋮ Conditions for existence of average and Blackwell optimal stationary policies in denumerable Markov decision processes ⋮ Communicating MDPs: Equivalence and LP properties ⋮ A pseudometric in supervisory control of probabilistic discrete event systems ⋮ On the solvability of Bellman's functional equations for Markov renewal programming ⋮ On canonical forms for zero-sum stochastic mean payoff games ⋮ A generalized inverse method for asymptotic linear programming ⋮ Discounting axioms imply risk neutrality ⋮ Computing semi-stationary optimal policies for multichain semi-Markov decision processes ⋮ Optimal eviction policies for stochastic address traces ⋮ The optimal frequency of information purchases ⋮ Dynamic competition with consumer inertia ⋮ An orderfield property for stochastic games when one player controls transition probabilities ⋮ Ordered field property for stochastic games when the player who controls transitions changes from state to state ⋮ Reachability and safety objectives in Markov decision processes on long but finite horizons ⋮ On optimality criteria for dynamic programs with long finite horizons ⋮ On Nash equilibria and improvement cycles in pure positional strategies for chess-like and backgammon-like \(n\)-person games ⋮ Invariant problems in dynamic programming - average reward criterion ⋮ Randomization and simplification in dynamic decision-making. ⋮ Resolvent expansions of matrices and applications ⋮ A decomposition algorithm for limiting average Markov decision problems. ⋮ Capital accumulation and the optimization of renewable resource models ⋮ Perfect equilibria in stochastic games ⋮ Control: a perspective ⋮ Optimal threshold probability in undiscounted Markov decision processes with a target set. ⋮ Dynamic priority allocation via restless bandit marginal productivity indices ⋮ Herbert Robbins and sequential analysis ⋮ Singulary perturbed Markov control problem: Limiting average cost ⋮ Denumerable semi-Markov decision chains with small interest rates ⋮ An elementary approach to discrete models of dividend strategies ⋮ Dynamic programming and Hamilton-Jacobi-Bellman equations on time scales ⋮ Nonlinear programming and stationary equilibria in stochastic games ⋮ Estimation and control in multichain processes ⋮ Policy improvement for perfect information additive reward and additive transition stochastic games with discounted and average payoffs ⋮ General limit value in dynamic programming ⋮ A note on the vanishing interest rate approach in average Markov decision chains with continuous and bounded costs ⋮ Quantum games: a review of the history, current state, and interpretation ⋮ A nested family of \(k\)-total effective rewards for positional games ⋮ The value functions of Markov decision processes ⋮ Review of a Markov decision algorithm for optimal inspections and revisions in a maintenance system with partial information ⋮ Optimal inspection policies for a manufacturing station ⋮ Some remarks on the new optimality criterion of Mine and Tabata ⋮ On the convergence of the average expected return in dynamic programming ⋮ A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases ⋮ Semi-Markov decision processes with limiting ratio average rewards ⋮ On the existence of relative values for undiscounted multichain Markov decision processes ⋮ Continuous versus measurable recourse in N-stage stochastic programming ⋮ Continuous time control of Markov processes on an arbitrary state space: average return criterion ⋮ An optimality principle for Markovian decision processes ⋮ Optimal threshold probability and expectation in semi-Markov decision processes ⋮ Should I remember more than you? Best responses to factored strategies ⋮ Stochastic convex programming: Kuhn-Tucker conditions ⋮ Problemi di ottimizzazione nella teoria delle code ⋮ Semi-Markov strategies in stochastic games ⋮ Singularly perturbed linear programs and Markov decision processes ⋮ Foolproof convergence in multichain policy iteration ⋮ A survey of recent results on continuous-time Markov decision processes (with comments and rejoinder) ⋮ Optimization of stochastic maintenance policies ⋮ Sensitivity of finite Markov chains under perturbation ⋮ Stationary \(\varepsilon\)-optimal strategies in stochastic games ⋮ Bounded variation of \(\{V_ n\}\) and its limit ⋮ Pure equilibria in a simple dynamic model of strategic market game ⋮ Admission control in a two-class loss system with periodically varying parameters and abandonments ⋮ Optimal replenishment for a periodic review inventory system with two supply modes. ⋮ Markov-type fuzzy decision processes with a discounted reward on a closed interval ⋮ Sequential identification and adaptive control in stochastic systems ⋮ Optimization models for the first arrival target distribution function in discrete time ⋮ Are limits of \(\alpha\)-discounted optimal policies Blackwell optimal? A counterexample ⋮ Controlled semi-Markov models under long-run average rewards ⋮ Long-term average cost control problems for continuous time Markov processes: A survey ⋮ Blackwell optimality in Markov decision processes with partial observation. ⋮ Sensitivity analysis in discounted Markovian decision problems ⋮ Controlled Markov set-chains under average criteria ⋮ Two-player stochastic games. II: The case of recursive games ⋮ Fuzzy decision processes with an average reward criterion. ⋮ Optimal search with positive switch cost is NP-hard ⋮ Decentralized evolutionary mechanisms for intertemporal economies: A possibility result ⋮ Exact formula for sensitivity analysis of Markov chains

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5342868&oldid=20037122"