Generalized polynomial approximations in Markovian decision processes

From MaRDI portal

Publication:1066821

Jump to:navigation, search

DOI10.1016/0022-247X(85)90317-8zbMath0578.90091MaRDI QIDQ1066821

Abraham Seidmann, Paul J. Schweitzer

Publication date: 1985

Published in: Journal of Mathematical Analysis and Applications (Search for Journal in Brave)

zbMATH Keywords

least squares value function policy iteration Markovian decision process linear superposition

Mathematics Subject Classification ID

Numerical mathematical programming methods (65K05) Linear programming (90C05) Markov and semi-Markov decision processes (90C40)

Related Items

A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications, Practical solution techniques for first-order MDPs, Approximate dynamic programming for stochastic linear control problems on compact state spaces, A methodology for computation reduction for specially structured large scale Markov decision problems, Approximate dynamic programming with state aggregation applied to UAV perimeter patrol, A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning, A Polyhedral Approach to Online Bipartite Matching, Approximate linear programming for networks: average cost bounds, Feature-based methods for large scale dynamic programming, Unnamed Item, Symmetric approximate linear programming for factored MDPs with application to constrained problems, Optimal recovery strategies for manufacturing systems, A strong Lagrangian relaxation for general discrete-choice network revenue management, State partitioning based linear program for stochastic dynamic programs: an invariance property, Reductions of non-separable approximate linear programs for network revenue management, Efficient compact linear programs for network revenue management, MF-OMO: An Optimization Formulation of Mean-Field Games, Computational bounds for elevator control policies by large scale linear programming, Hybrid least-squares algorithms for approximate policy evaluation, Using mathematical programming to solve factored Markov decision processes with imprecise probabilities, Approximate dynamic programming for stochastic \(N\)-stage optimization with application to optimal consumption under uncertainty, Optimal toll design: a lower bound framework for the asymmetric traveling salesman problem, Relationship between least squares Monte Carlo and approximate linear programming, Network-Based Approximate Linear Programming for Discrete Optimization, Identifying proactive ICU patient admission, transfer and diversion policies in a public-private hospital network, Relaxation Analysis for the Dynamic Knapsack Problem with Stochastic Item Sizes, A column and constraint generation algorithm for the dynamic knapsack problem with stochastic item sizes, Dynamic multi-appointment patient scheduling for radiation therapy, Value function approximation via linear programming for FMS scheduling, The actor-critic algorithm as multi-time-scale stochastic approximation., Reductions of Approximate Linear Programs for Network Revenue Management, Linear Programming and the Control of Diffusion Processes, Technical Note—A Note on Relaxations of the Choice Network Revenue Management Dynamic Program, A polyhedral approach to online bipartite matching, Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path, Computing Near-Optimal Policies in Generalized Joint Replenishment, Semi-Infinite Relaxations for the Dynamic Knapsack Problem with Stochastic Item Sizes, Data-driven optimal control with a relaxed linear program, Performance bounds and suboptimal policies for linear stochastic control via LMIs, Approximate dynamic programming via iterated Bellman inequalities, Unnamed Item

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1066821&oldid=13084287"