Computation of weighted sums of rewards for concurrent MDPs
From MaRDI portal
Publication:1731592
DOI10.1007/s00186-018-0653-1zbMath1411.90353OpenAlexW2898656812WikidataQ129000532 ScholiaQ129000532MaRDI QIDQ1731592
Peter Buchholz, Dimitri Scheftelowitsch
Publication date: 13 March 2019
Published in: Mathematical Methods of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s00186-018-0653-1
Multi-objective and goal programming (90C29) Optimality conditions and duality in mathematical programming (90C46)
Related Items (5)
State-Variable Modeling for a Class of Two-Stage Stochastic Optimization Problems ⋮ Partially observable multistage stochastic programming ⋮ Light robustness in the optimization of Markov decision processes with uncertain parameters ⋮ Policy-based branch-and-bound for infinite-horizon multi-model Markov decision processes ⋮ Concurrent MDPs with Finite Markovian Policies
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Planning and acting in partially observable stochastic domains
- A combinatorial strongly subexponential strategy improvement algorithm for mean payoff games
- Modeling methods and a branch and cut algorithm for pharmaceutical clinical trial planning using stochastic programming
- Bounded-parameter Markov decision processes
- Global optimization of MIQCPs with dynamic piecewise relaxations
- Robust control of the multi-armed bandit problem
- Mixed Integer Linear Programming Formulation Techniques
- A unified approach for different concepts of robustness and stochastic programming via non-linear scalarizing functionals
- Linear Programming Relaxations of Quadratically Constrained Quadratic Programs
- Multiple-environment Markov decision processes
- Technical Note—An Equivalence Between Continuous and Discrete Time Markov Decision Processes
- Scenarios and Policy Aggregation in Optimization Under Uncertainty
- Lectures on Stochastic Programming
- The Complexity of Markov Decision Processes
- Updating the Inverse of a Matrix
- Markov Decision Processes with Imprecise Transition Probabilities
- Markovian Decision Processes with Uncertain Transition Probabilities
- Robust Markov Decision Processes
- Multicriteria Optimization
- Robust Control of Markov Decision Processes with Uncertain Transition Matrices
- Amsaa: A Multistep Anticipatory Algorithm for Online Stochastic Combinatorial Optimization
- Robust Dynamic Programming
- Robust Product Line Design
- Markov decision processes
- Scenarios for multistage stochastic programs
- Handbook of Markov decision processes. Methods and applications
This page was built for publication: Computation of weighted sums of rewards for concurrent MDPs