Contraction Mappings in the Theory Underlying Dynamic Programming

From MaRDI portal
Publication:5535549

DOI10.1137/1009030zbMath0154.45101OpenAlexW2045374782WikidataQ56040446 ScholiaQ56040446MaRDI QIDQ5535549

Eric V. Denardo

Publication date: 1967

Published in: SIAM Review (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1137/1009030



Related Items

Discounted Stochastic Ratio Games, Approximate policy iteration: a survey and some new methods, PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES AND PERIODIC POLICIES WITH APPLICATIONS, Optimal Liquidation in a Level-I Limit Order Book for Large-Tick Stocks, On a nonseparable convex maximization problem with continuous Knapsack constraints, Approximation solution and suboptimality for discounted semi-markov decision problems with countable state space, Survey of linear programming for standard and nonstandard Markovian control problems. Part I: Theory, On the convergence of reinforcement learning with Monte Carlo exploring starts, Regular Policies in Abstract Dynamic Programming, IDENTIFICATION OF DISCRETE CHOICE DYNAMIC PROGRAMMING MODELS WITH NONPARAMETRIC DISTRIBUTION OF UNOBSERVABLES, The Repair VS. Replacement problem: A stochastic control approach, Four Canadian Contributions to Stochastic Modeling, Application of fixed point theory and solitary wave solutions for the time-fractional nonlinear unsteady convection-diffusion system, Value Iteration is Optic Composition, Solution of a Markovian decision problem by successive overrelaxation, Discounted semi-Markov decision processes: linear programming and policy iteration, Turnpikes and computation of piecewise open-loop equilibria in stochastic differential games, Smooth dynamics and computation in models of economic growth, SWITCHING AND SEQUENCING AVAILABLE THERAPIES SO AS TO MAXIMIZE A PATIENT'S EXPECTED TOTAL LIFETIME, Markov decision processes, Turnpikes and computation of piecewise open-loop equilibria in stochastic differential games, Smooth dynamics and computation in models of economic growth, Heuristics for determining economic processing rates in a flexible manufacturing system, On Markov policies for minimax decision processes, Multigrid methods for two‐player zero‐sum stochastic games, Calculating the variance in Markov-processes with random reward, A set of successive approximation methods for discounted Markovian decision problems, Transient policies in discrete dynamic programming: Linear programming including suboptimality tests and additional constraints, A Fixed Point Approach to Undiscounted Markov Renewal Programs, The bellman equation for vector-valued semi-markovian dyanmic programiing, Probabilistic models for optimizing patients survival rates, Optimality in transient markov chains and linear programming, A pegging algorithm for the nonlinear resource allocation problem, Improved iterative computation of the expected discounted return in Markov and semi-Markov chains, On the convergence of successive approximations in dynamic programming with non-zero terminal reward, Finite-state approximations to denumerable-state dynamic programs, A method of bisection for discounted Markov decision problems, Stochastic Inventory Models with Limited Production Capacity and Periodically Varying Parameters, Discretizing dynamic programs, Adaptive age replacement, Reducing the number of multiplications in iterative processes, Solvable classes of discrete dynamic programming, An elimination condition to check the validity of the principle of optimality, On constrained Markov decision processes, Iterative Bounds on the Equilibrium Distribution of a Finite Markov Chain, Heuristic Assignments of Redundant Software Versions and Processors in Fault-tolerant Computer Systems for Maximum Reliability, Robust shortest path planning and semicontractive dynamic programming, On the reduction of total‐cost and average‐cost MDPs to discounted MDPs, Optimization of STEOR networks via Markov renewal programming, MARKOV DECISION PROCESSES, Zur Extrapolation in Markoffschen Entscheidungsmodellen mit Diskontierung, Pansystems optimization, generalized principles of optimality, and fundamental equations of dynamic programming, Discrete convexity: Convexity for functions defined on discrete spaces, Dynamic programming and graph optimization problems, The shortest path problem with two objective functions, Variational characterizations in Markov decision processes, Asymptotic expansions for dynamic programming recursions with general nonnegative matrices, Boundedly optimal control of piecewise deterministic systems, Scheduling jobs with release times on a machine with finite storage, Fixed point theorems for discounted finite Markov decision processes, Using geometric techniques to improve dynamic programming algorithms for the economic lot-sizing problem and extensions, A generalized theorem of the maximum, A computational theory of decision networks, A model of project evaluation with limited attention, Sequential Stackelberg equilibria in two-person games, Contraction mappings underlying undiscounted Markov decision problems. II, An average polynomial algorithm for solving antagonistic games on graphs, Fuzzy approach to multilevel knapsack problems, On the estimation of the unknown sample size from the number of records, Solving Markovian decision processes by successive elimination of variables, Minimizing the error bound for the dynamic lot size model, Semi-Markov information model for revenue management and dynamic pricing, On efficiency of linear programming applied to discounted Markovian decision problems, A global shooting algorithm for the facility location and capacity acquisition problem on a line with dense demand, On a language for discrete dynamic programming and a microcomputer implementation, On the indeterminacy of capital accumulation paths, Optimal policies in continuous time inventory control models with limited supply, Theory and applications of generalized dynamic programming: An overview, Controlled semi-Markov models - the discounted case, An acquisition policy for a multi-supplier system with a finite-time horizon, The multi-armed bandit, with constraints, (Approximate) iterated successive approximations algorithm for sequential decision processes, Discounting axioms imply risk neutrality, A dynamic game of reputation and economic performances in nondemocratic regimes, A priori bounds for approximations of Markov programs, Nonstationary Markov decision problems with converging parameters, Finite-state approximations for denumerable-state infinite-horizon discounted Markov decision processes, Optimality of the fastest available server policy, On theory and algorithms for Markov decision problems with the total reward criterion, Pareto optimal policies for harvesting with multiple objectives, Conditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programs, Isotone optimal policies for structured Markov decision processes, Partially observable Markov decision model for the treatment of early prostate cancer, Generalized dynamic programming for multicriteria optimization, A natural extension of the MacQueen extrapolation, On Bellman's principle with inequality constraints, Capacity expansion for a loss system with exponential demand growth., A multi-objective version of Bellman's inventory problem, Optimal threshold probability in undiscounted Markov decision processes with a target set., On variable discounting in dynamic programming: applications to resource extraction and other economic models, A multi-period TSP with stochastic regular and urgent demands, An abstract topological approach to dynamic programming, Stochastic dynamic programming with non-linear discounting, A structured pattern matrix algorithm for multichain Markov decision processes, Dynamic programming and maximum principle for discrete Goursat systems, Optimal location of dwell points in a single loop AGV system with time restrictions on vehicle availability, Multi-period production control in a centralized fully flexible manufacturing system, Some structured dynamic programs arising in economics, Policy iteration and Newton-Raphson methods for Markov decision processes under average cost criterion, An efficient algorithm for the dynamic economic lot size problem, Brouwer's fixed point theorem and finite state space Markovian decision theory, Classes of discrete optimization problems and their decision problems, Block-successive approximation for a discounted Markov decision model, The effect on optimal consumption on increased uncertainty in labor income in the multiperiod case, Bounds on the fixed point of a monotone contraction operator, Turnpike properties for a class of piecewise deterministic systems arising in manufacturing flow control, Markov programming by successive approximations with respect to weighted supremum norms, Approximation of two-person zero-sum continuous-time Markov games with average payoff criterion, Optimizing over pure stationary equilibria in consensus stopping games, A zero-sum stochastic game model of duopoly, Markov decision processes and strongly excessive functions, Contraction mappings underlying undiscounted Markov decision problems, Long-term values in Markov decision processes, (co)algebraically, Composing batches with yield uncertainty, Finite state continuous time Markov decision processes with an infinite planning horizon, Dynamic programming and the Lagrange multipliers, Partial termination rule of Lagrangian relaxation for manufacturing cell formation problems, Piecewise affine approximations for the control of a one-reservoir hydroelectric system, Dynamic programming processes within dynamic programming processes, Engineering applications of discrete time optimal control, Fixed points for extrema of contractions, Recursive utility and the Ramsey problem, Applications of fixed-point methods to discrete variational and quasi- variational inequalities, Data-driven optimal control with a relaxed linear program, On a set of optimal policies in continuous time Markovian decision problem, A polynomial-time algorithm for computing an optimal admission policy in a GI/M/1/N queue, Monotonicity and the principle of optimality, Finite state approximations for denumerable state infinite horizon discounted Markov decision processes with unbounded rewards, Transformation of partially observable Markov decision processes into piecewise linear ones, Stochastic control theory and operational research, Representations and characterizations of vertices of bounded-shape partition polytopes, Truncated policy iteration methods, Designing an optimal production system with inspection, Inventory control of service parts in the final phase, The nonlinear knapsack problem - algorithms and applications, System planning and configuration problems for optimal system design, Finite state approximation algorithms for average cost denumerable state Markov decision processes, Optimal pricing of a product with periodic enhancements, Conjugate duality and the curse of dimensionality, Optimal control of a facility with periodic interrupted demand, Contingent planning under uncertainty via stochastic satisfiability, A new characterization for the dynamic lot size problem with bounded inventory