Approximations of Dynamic Programs, I

From MaRDI portal

Publication:4175068

Jump to:navigation, search

DOI10.1287/moor.3.3.231zbMath0393.90094OpenAlexW2149148771MaRDI QIDQ4175068

Publication date: 1978

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/moor.3.3.231

zbMATH Keywords

Dynamic Programming Stochastic Games Markov Decision Processes

Mathematics Subject Classification ID

Minimax problems in mathematical programming (90C47) Dynamic programming in optimal control and differential games (49L20) Dynamic programming (90C39) Probabilistic games; gambling (91A60)

Related Items

Some Limit Properties of Markov Chains Induced by Recursive Stochastic Algorithms, A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications, A convex optimization approach to dynamic programming in continuous state and action spaces, Computable approximations for continuous-time Markov decision processes on Borel spaces based on empirical measures, On hedging in finite security markets, Reward revision and the average reward Markov decision process, Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes, On the construction of \(\epsilon\)-optimal strategies in partially observed MDPs, On truncations and perturbations of Markov decision problems with an application to queueing network overflow control, Feature-based methods for large scale dynamic programming, Some basic concepts of numerical treatment of Markov decision models, State aggregation in dynamic programming - an application to scheduling of independent jobs on parallel processors, Discretization procedures for adaptive Markov control processes, Markov Teams ? An analytical approach to process migration in distributed computing systems, (Approximate) iterated successive approximations algorithm for sequential decision processes, Robustness inequality for Markov control processes with unbounded costs, Conditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programs, Exponential lower bounds on the complexity of a class of dynamic programs for combinatorial optimization problems, Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains, Easy Affine Markov Decision Processes, A tutorial on event-based optimization -- a new optimization framework, A multi-period TSP with stochastic regular and urgent demands, Approximation of Markov decision processes with general state space, Error bounds for state space truncation of finite Jackson networks, Discounted Continuous-Time Controlled Markov Chains: Convergence of Control Models, Dynamic coordination of production planning and sales admission control in the presence of a spot market, Markov decision processes, Unnamed Item, A survey of computational complexity results in systems and control, Discrete type shock semi-markov decision processes with borel state space, Unnamed Item, Approximating infinite horizon stochastic optimal control in discrete time with constraints, Bounds for aggregating nodes in network problems, Empirical Dynamic Programming, The complexity of dynamic programming, Concepts and methods for discrete and continuous time control under uncertainty, A Bayesian dynamic programming approach to optimal maintenance combined with burn-in, Explicit solutions for multivariate, discrete-time control problems under uncertainty, Unnamed Item, Optimal control of discrete time population processes, Approximations of inventory models, Approximation of Dynamic Programs, Error bounds for nonnegative dynamic models, Computation of optimal policies in discounted semi-Markov decision chains, Stochastic approximations of constrained discounted Markov decision processes, Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds, A simulation-based approach to stochastic dynamic programming, Estimating equilibrium probabilities for band diagonal Markov chains using aggregation and disaggregation techniques, A unified view of aggregation and coherency in networks and Markov chains†, Finite state approximation algorithms for average cost denumerable state Markov decision processes, Approximations and bounds for a generalized optimal stopping problem, Algorithmic aspects of mean-variance optimization in Markov decision processes

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4175068&oldid=18001892"