Discounted Dynamic Programming

From MaRDI portal
Publication:5343970


DOI10.1214/aoms/1177700285zbMath0133.42805MaRDI QIDQ5343970

David Blackwell

Publication date: 1965

Published in: The Annals of Mathematical Statistics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aoms/1177700285


90C39: Dynamic programming


Related Items

Optimal growth with many consumers, Global asymptotic stability results for multisector models of optional growth under uncertainty when future utilities are discounted, Recursive utility and the Ramsey problem, On maximizing the average time at a goal, Existence of equilibrium stationary strategies in discounted noncooperative stochastic games with uncountable state space, Isotone policies for the value iteration method for Markov decision processes, On continuous-time discounted stochastic dynamic programming, Comparative statics in dynamic programming models with an application to job search, A generalized model of commitment, Constructions of Nash equilibria in stochastic games of resource extraction with additive transition structure, On Nikaido-Isoda type theorems for discounted stochastic games, Good news and bad news in two-armed bandits, Transformation of partially observable Markov decision processes into piecewise linear ones, Optimal research and development expenditures under an incremental tax incentive scheme, Conditions for the existence of decision horizons for discounted problems in a stochastic environment: A note, Nonrandomized strategy equilibria in noncooperative stochastic games with additive transition and reward structure, Decomposition in multi-item inventory control, On \(\epsilon\)-optimal continuous selectors and their application in discounted dynamic programming, Matching, search, and bargaining, Utility, probabilistic constraints, mean and variance of discounted rewards in Markov decision processes, The existence of good Markov strategies for decision processes with general payoffs, Lipschitz continuous policy functions for strongly concave optimization problems, Stability estimates for controlled Markov chains with a minorant, Optimale Innovationspolitik bei unvollständiger Information. (Optimal innovation policy under incomplete information), Discount-isotone policies for Markov decision processes, The Bellman's principle of optimality in the discounted dynamic programming, Controlled semi-Markov models - the discounted case, On optimality criteria for dynamic programs with long finite horizons, On theory and algorithms for Markov decision problems with the total reward criterion, Invariant problems in dynamic programming - average reward criterion, Conditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programs, Denumerable semi-Markov decision chains with small interest rates, On the existence of optimal processes in non-stationary environments, Estimation and control in multichain processes, Stochastic dynamic models with stock-dependent rewards, Learning in mis-specified models and the possibility of cycles, Finite automata equilibria with discounting, An abstract topological approach to dynamic programming, Irreversibility and the behavior of aggregate stochastic growth models, Prestable strategies in discounted duopoly games, Optimal pricing against a simple learning rule, A stochastic model for the economic management of a renewable animal resource, Controlled jump processes, On dynamic programming: Compactness of the space of policies, On stopped decision processes with discrete time parameter, Estimates for finite-stage dynamic programs, The effect on optimal consumption on increased uncertainty in labor income in the multiperiod case, Markov programming by successive approximations with respect to weighted supremum norms, Multiple feedback at a single-server station, Discounted Markov games; successive approximation and stopping times, Stochastic evolution and control of an economic activity, Discounted, positive, and noncooperative stochastic games, Optimal systems for equipment maintenance and replacement under Markovian deterioration, Markov decision processes and strongly excessive functions, On some aspects in stochastic dynamic programming with terminal region, Perturbation theory for games in normal form and stochastic games, Bounded variation of \(\{V_ n\}\) and its limit, Markov-type fuzzy decision processes with a discounted reward on a closed interval, Mixed Markov decision processes in a semi-Markov environment with discounted criterion, Markov equilibria in discounted stochastic games, Risk, uncertainty, and complexity, Optimal control of a facility with periodic interrupted demand, Existence of optimal stationary policies in discounted Markov decision processes: Approaches by occupation measures, On the complexity of linear quadratic control, Markovian equilibrium in a class of stochastic games: Existence theorems for discounted and undiscounted models, A general-equilibrium intertemporal model of an open economy, On the generic nonconvergence of Bayesian actions and beliefs, Computational aspects in applied stochastic control, Biconvergent stochastic dynamic programming, asymptotic impatience, and `average' growth, A strategic market game with secured lending, On determining the importance of attributes with a stopping problem, Theory of dynamic portfolio for survival under uncertainty, Sequential process control under capacity constraints., Herbert Robbins and sequential analysis, Optimal strategies for an inventory system with cost functions of general form, A strategic market game with active bankruptcy, Two-player stochastic games. I: A reduction, Rationality and bounded information in repeated games, with application to the iterated prisoner's dilemma, A lattice-theoretic approach to a class of dynamic games, Perfect equilibrium in non-randomized strategies in a class of symmetric dynamic games, Characterization of optimal plans for stochastic dynamic programs, Computational comparison of value iteration algorithms for discounted Markov decision processes, A model of project evaluation with limited attention, Optimal learning with costly adjustment, Markov-achievable payoffs for finite-horizon decision models., Optimal dividend payout under compound Poisson income, Equilibrium learning in simple contests, On stochastic games in economics, A modified dynamic programming method for Markovian decision problems, On stochastic games, On stochastic games. II, Stopped decision processes on complete separable metric spaces, A counterexample in discounted dynamic programming, Instationäre dynamische Optimierung bei schwachen Voraussetzungen über die Gewinnfunktionen, Dynamic programming for non-additive stochastic objectives, A survey of algorithmic methods for partially observed Markov decision processes, Generalized Markovian decision processes, Optimal Advertizing Policy for Selling a Single Asset, On a Continuously Discounted Vector Valued Markov Decision Process, Adaptive Policies in Markov Decision Processes with Uncertain Transition Matrices, On the Existence of Good Markov Strategies, Finite-stage stochastic decision processes with recursive reward structure I: optimality equations and deterministic strategies, Continuous time shock markov decision processes with discounted criterion, Optimality of (σS)-type policies for a stationary multi-product inventory model with a canonical diffusion process as demand process, Analysis for some properties of discrete time Markov decision processes, Über ein stochastisches dynamisches entselieidungsmodell mit allgemeinen ertragsfunktionalen, On the chance to visit a goal set infinitely often, Unnamed Item, Generalized Bandit Problems, On continuous dynamic programming with discrete time-parameter, The optimality of (s,S) inventory policies in the infinite period model*, Measurable Gambling Houses, On the Existence of Stationary Optimal Strategies, Parametric continuity in dynamic programming problems, Markov decision processes, Parametric continuity in dynamic programming problems, On Markov policies for minimax decision processes, Decomposable jump decision processes, Possibilities of solution in stochastic decision models with recursive reward functions, Utility Functions Which Ensure the Adequacy of Stationary Strategies, MARKOV DECISION PROCESSES, Some basic concepts of numerical treatment of Markov decision models, Sensitivitätsanalysen in entscheidungsmodellen, Arbitrary state semi-Markov decision processes, Über ein Vektoroptimierungsproblem für endliche stochastische zellulare Systeme, Discrete type shock semi-markov decision processes with borel state space, A semi-markovian game of economic survival, On the convergence of successive approximations in dynamic programming with non-zero terminal reward, Unnamed Item, Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal, A set of successive approximation methods for discounted Markovian decision problems, Solving stochastic dynamic programming problems by linear programming — An annotated bibliography, Unnamed Item, A Markovian Decision Process with hidden states and hidden costs, Solving a general discounted dynamic program by linear programming