Discounted Dynamic Programming

From MaRDI portal
Revision as of 00:36, 9 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:5343970

DOI10.1214/aoms/1177700285zbMath0133.42805OpenAlexW2074232820MaRDI QIDQ5343970

David Blackwell

Publication date: 1965

Published in: The Annals of Mathematical Statistics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aoms/1177700285



Related Items

Existence of optimal stationary policies in discounted Markov decision processes: Approaches by occupation measures, Decomposition in multi-item inventory control, On \(\epsilon\)-optimal continuous selectors and their application in discounted dynamic programming, On the complexity of linear quadratic control, The effect of interest rates on consumption in an income fluctuation problem, Optimal strategies for the multi-task inventory control model, Markovian equilibrium in a class of stochastic games: Existence theorems for discounted and undiscounted models, A general-equilibrium intertemporal model of an open economy, On the generic nonconvergence of Bayesian actions and beliefs, Computational aspects in applied stochastic control, Matching, search, and bargaining, Utility, probabilistic constraints, mean and variance of discounted rewards in Markov decision processes, The existence of good Markov strategies for decision processes with general payoffs, Linear quadratic game of exploitation of common renewable resources with inherent constraints, Biconvergent stochastic dynamic programming, asymptotic impatience, and `average' growth, Lipschitz continuous policy functions for strongly concave optimization problems, Stability estimates for controlled Markov chains with a minorant, Optimale Innovationspolitik bei unvollständiger Information. (Optimal innovation policy under incomplete information), Discount-isotone policies for Markov decision processes, Belief distorted Nash equilibria: introduction of a new kind of equilibrium in dynamic games with distorted information, A generalized model of commitment, Envelope condition method with an application to default risk models, The Bellman's principle of optimality in the discounted dynamic programming, A strategic market game with secured lending, Optimal growth with many consumers, Robust optimal strategies in Markov decision problems, Global asymptotic stability results for multisector models of optional growth under uncertainty when future utilities are discounted, Controlled semi-Markov models - the discounted case, On determining the importance of attributes with a stopping problem, Theory of dynamic portfolio for survival under uncertainty, Discounting axioms imply risk neutrality, Symmetric paths and evolution to equilibrium in the discounted prisoners' dilemma, Constructions of Nash equilibria in stochastic games of resource extraction with additive transition structure, On Nikaido-Isoda type theorems for discounted stochastic games, Robust Markov control processes, Subgame perfect equilibria in discounted stochastic games, Blockbusting: brokers and the dynamics of segregation, On optimality criteria for dynamic programs with long finite horizons, On theory and algorithms for Markov decision problems with the total reward criterion, Invariant problems in dynamic programming - average reward criterion, On the existence and uniqueness of value functions in models of labor market dynamics, Conditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programs, Discounted dynamic programming with unbounded returns: application to economic models, Asset pricing in a Lucas fruit-tree economy with the best and worst in mind, Control: a perspective, Sequential process control under capacity constraints., Herbert Robbins and sequential analysis, Denumerable semi-Markov decision chains with small interest rates, On variable discounting in dynamic programming: applications to resource extraction and other economic models, On the existence of optimal processes in non-stationary environments, Estimation and control in multichain processes, Stochastic dynamic models with stock-dependent rewards, Learning in mis-specified models and the possibility of cycles, Stochastic games of resource extraction, Finite automata equilibria with discounting, An abstract topological approach to dynamic programming, Irreversibility and the behavior of aggregate stochastic growth models, Dynamic mechanism design with interdependent valuations, Prestable strategies in discounted duopoly games, Optimal pricing against a simple learning rule, Dynamics in \textit{Art of war}, A note on negative dynamic programming for risk-sensitive control, Stochastic games with unbounded payoffs: applications to robust control in economics, A stochastic model for the economic management of a renewable animal resource, Controlled jump processes, On dynamic programming: Compactness of the space of policies, On stopped decision processes with discrete time parameter, Estimates for finite-stage dynamic programs, The effect on optimal consumption on increased uncertainty in labor income in the multiperiod case, Markov programming by successive approximations with respect to weighted supremum norms, Multiple feedback at a single-server station, Discounted Markov games; successive approximation and stopping times, Stochastic evolution and control of an economic activity, Discounted, positive, and noncooperative stochastic games, Good news and bad news in two-armed bandits, Optimal systems for equipment maintenance and replacement under Markovian deterioration, Markov decision processes and strongly excessive functions, On some aspects in stochastic dynamic programming with terminal region, Perturbation theory for games in normal form and stochastic games, Bounded variation of \(\{V_ n\}\) and its limit, Markov-type fuzzy decision processes with a discounted reward on a closed interval, Recursive utility and the Ramsey problem, Mixed Markov decision processes in a semi-Markov environment with discounted criterion, Optimal strategies for an inventory system with cost functions of general form, Optimal Markov strategies, Optimality, equilibrium, and curb sets in decision problems without commitment, Markov equilibria in discounted stochastic games, On maximizing the average time at a goal, Transformation of partially observable Markov decision processes into piecewise linear ones, A strategic market game with active bankruptcy, Existence of equilibrium stationary strategies in discounted noncooperative stochastic games with uncountable state space, Optimal research and development expenditures under an incremental tax incentive scheme, Conditions for the existence of decision horizons for discounted problems in a stochastic environment: A note, Two-player stochastic games. I: A reduction, Risk, uncertainty, and complexity, Isotone policies for the value iteration method for Markov decision processes, On continuous-time discounted stochastic dynamic programming, Comparative statics in dynamic programming models with an application to job search, Nonrandomized strategy equilibria in noncooperative stochastic games with additive transition and reward structure, Optimal control of a facility with periodic interrupted demand, Unnamed Item, On continuous dynamic programming with discrete time-parameter, The optimality of (s,S) inventory policies in the infinite period model*, Measurable Gambling Houses, Pareto extrapolation: An analytical framework for studying tail inequality, On the Existence of Stationary Optimal Strategies, Layered networks, equilibrium dynamics, and stable coalitions, Correction to: ``Layered networks, equilibrium dynamics, and stable coalitions, Stability in repeated matching markets, Losing money to make money: the benefits of redistribution in collective bargaining in sports, Parametric continuity in dynamic programming problems, Unnamed Item, Markov decision processes, Parametric continuity in dynamic programming problems, On Markov policies for minimax decision processes, Decomposable jump decision processes, Possibilities of solution in stochastic decision models with recursive reward functions, Solutions of semi-Markov control models with recursive discount rates and approximation by $\epsilon-$optimal policies, Generalized Bandit Problems, Utility Functions Which Ensure the Adequacy of Stationary Strategies, Does backwards induction imply subgame perfection?, Constrained discounted stochastic games, PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES AND PERIODIC POLICIES WITH APPLICATIONS, Equilibrium learning in simple contests, Unbounded dynamic programming via the Q-transform, Stochastic output feedback MPC with intermittent observations, A model of project evaluation with limited attention, A survey of algorithmic methods for partially observed Markov decision processes, On incentive compatibility in dynamic mechanism design with exit option in a Markovian environment, On the Existence of Good Markov Strategies, A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies, Optimal learning with costly adjustment, IT'S ABOUT TIME: IMPLICATIONS OF THE PERIOD LENGTH IN AN EQUILIBRIUM SEARCH MODEL, Stationary Markov perfect equilibria in discounted stochastic games, Some basic concepts of numerical treatment of Markov decision models, Sensitivitätsanalysen in entscheidungsmodellen, The income fluctuation problem and the evolution of wealth, Necessity of the terminal condition in the infinite horizon dynamic optimization problems with unbounded payoff, Generalized Markovian decision processes, Uniqueness of equilibrium in a Bewley-Aiyagari model, On a Continuously Discounted Vector Valued Markov Decision Process, Markov perfect equilibria in a dynamic decision model with quasi-hyperbolic discounting, Subadditive and multiplicative ergodic theorems, Improvement paths in repeated pure coordination games, Dynamic focus programming: a new approach to sequential decision problems under uncertainty, Direct coupling coherent quantum observers with discounted mean square performance criteria and penalized back-action, A Markovian decision model of adaptive cancer treatment and quality of life, A class of linear quadratic dynamic optimization problems with state dependent constraints, Spatial dynamic models with intertemporal optimization: specification and estimation, Dynamic contractual incentives in the face of a Samaritans's dilemma, Arbitrary state semi-Markov decision processes, Optimal epidemic control in equilibrium with imperfect testing and enforcement, A survey of average cost problems in deterministic discrete-time control systems, Finite-stage stochastic decision processes with recursive reward structure I: optimality equations and deterministic strategies, Continuous time shock markov decision processes with discounted criterion, Optimality of (σS)-type policies for a stationary multi-product inventory model with a canonical diffusion process as demand process, On the terminal condition for the Bellman equation for dynamic optimization with an infinite horizon, Limit theorems for monotone Markov processes, On discounted dynamic programming with unbounded returns, Unique solutions for stochastic recursive utilities, Two characterizations of optimality in dynamic programming, Long-Term Partnership for Achieving Efficient Capacity Allocation, Bayesian Social Learning from Consumer Reviews, Markov-achievable payoffs for finite-horizon decision models., Interview with Andrzej Nowak - Laureate of the Rufus Isaacs Award, Inference in dynamic discrete choice problems under local misspecification, Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal, Optimal dividend payout under compound Poisson income, Stochastic dynamic programming with non-linear discounting, Über ein Vektoroptimierungsproblem für endliche stochastische zellulare Systeme, On stochastic games in economics, Dynamic programming with state-dependent discounting, Piracy on the internet: accommodate it or fight it? A dynamic approach, Open and closed loop Nash equilibria in games with a continuum of players, Discrete type shock semi-markov decision processes with borel state space, The first arrival model of continuous time Markovian decision programming -- the discounted rate is 0, A set of successive approximation methods for discounted Markovian decision problems, Dynamic mechanism design: dynamic arrivals and changing values, A folk theorem for stochastic games with private almost-perfect monitoring, Rationality and bounded information in repeated games, with application to the iterated prisoner's dilemma, Solving stochastic dynamic programming problems by linear programming — An annotated bibliography, Unnamed Item, Generalised discounting in dynamic programming with unbounded returns, Optimizing over pure stationary equilibria in consensus stopping games, On the chance to visit a goal set infinitely often, Finitely Additive Dynamic Programming, Stackelberg equilibrium in a dynamic stimulation model with complete information, \(K\)-correspondences, USCOs, and fixed point problems arising in discounted stochastic games, Conditional expectation of correspondences and economic applications, Stationary Almost Markov Perfect Equilibria in Discounted Stochastic Games, Sufficient statistics for unobserved heterogeneity in structural dynamic logit models, Minimax representation of nonexpansive functions and application to zero-sum recursive games, A modified dynamic programming method for Markovian decision problems, A Markovian Decision Process with hidden states and hidden costs, A semi-markovian game of economic survival, On stochastic games, Solving a general discounted dynamic program by linear programming, Nash equilibrium in a special case of symmetric resource extraction games, On the convergence of successive approximations in dynamic programming with non-zero terminal reward, Perov's contraction principle and dynamic programming with stochastic discounting, On stochastic games. II, Über ein stochastisches dynamisches entselieidungsmodell mit allgemeinen ertragsfunktionalen, Analysis for some properties of discrete time Markov decision processes, Stopped decision processes on complete separable metric spaces, A lattice-theoretic approach to a class of dynamic games, Perfect equilibrium in non-randomized strategies in a class of symmetric dynamic games, A counterexample in discounted dynamic programming, Instationäre dynamische Optimierung bei schwachen Voraussetzungen über die Gewinnfunktionen, Unnamed Item, Asynchronous games with transfers: uniqueness and optimality, Dynamic programming for non-additive stochastic objectives, Optimal Advertizing Policy for Selling a Single Asset, Characterization of optimal plans for stochastic dynamic programs, Computational comparison of value iteration algorithms for discounted Markov decision processes, Adaptive Policies in Markov Decision Processes with Uncertain Transition Matrices, MARKOV DECISION PROCESSES, Continuous vs. discrete time: some computational insights, A fixed point theorem for measurable selection valued correspondences induced by upper Caratheodory correspondences, On pure stationary almost Markov Nash equilibria in nonzero-sum ARAT stochastic games, Strategies for Dividend Distribution: A Review