Variance-Penalized Markov Decision Processes

From MaRDI portal
Publication:3832356

DOI10.1287/moor.14.1.147zbMath0676.90096OpenAlexW2057510529MaRDI QIDQ3832356

Huey-Miin Lee, Lodewijk C. M. Kallenberg, Jerzy A. Filar

Publication date: 1989

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/moor.14.1.147




Related Items (43)

Risk measurement and risk-averse control of partially observable discrete-time Markov systemsMarkov Decision Processes with Variance Minimization: A New Condition and ApproachA price-setting newsvendor problem under mean-variance criteriaAnalyzing operational risk-reward trade-offs for start-upsAugmenting Markov Cohort Analysis to Compute (Co)Variances: Implications for Strength of Cost-EffectivenessTrading performance for stability in Markov decision processesMarkov Decision Problems Where Means Bound VariancesFinite-horizon variance penalised Markov decision processesMulti-objective discounted Markov decision processes with expectation and variance criteriaNon-homogeneous Markov decision processes with a constraintSurvey of linear programming for standard and nonstandard Markovian control problems. Part I: TheoryMean-variance problems for finite horizon semi-Markov decision processesRisk-Sensitive Reinforcement Learning via Policy Gradient SearchVariance-constrained actor-critic algorithms for discounted and average reward MDPsA unified algorithm framework for mean-variance optimization in discounted Markov decision processesApproximate solutions to constrained risk-sensitive Markov decision processesUnnamed ItemVariance-penalized response-adaptive randomization with mismeasurementNotes on variance in randomized reward Markov decision processesComputational approaches to variance-penalised Markov decision processesRisk-Constrained Reinforcement Learning with Percentile Risk CriteriaVariance-penalized Markov decision processes: dynamic programming and reinforcement learning techniquesTime consistent dynamic risk measuresOn the total reward variance for continuous-time Markov reward chainsUnnamed ItemA risk-sensitive approach to total productive maintenanceA Sensitivity‐Based Construction Approach to Variance Minimization of Markov Decision ProcessesMean-Variance Analysis in Infinite Horizon Non-Discounted Markov Decision Processes: Technical NoteEfficient algorithms for risk-sensitive Markov decision processes with limited budgetStochastic optimization of forward recursive functionsOptimal policy for minimizing risk models in Markov decision processesA Convex Analytic Approach to Risk-Aware Markov Decision ProcessesSemi-Markov decision processes with variance minimization criterionMean-variance criteria in an undiscounted Markov decision processA mathematical programming approach to a problem in variance penalised Markov decision processesMean-Semivariance Policy Optimization via Risk-Averse Reinforcement LearningSolution strategies for variance minimization problemsVariance-minimization of Markov control processes with pathwise constraintsOn mean reward variance in semi-Markov processesComputational Methods for Risk-Averse Undiscounted Transient Markov ModelsAlgorithmic aspects of mean-variance optimization in Markov decision processesProcess-based risk measures and risk-averse control of discrete-time systemsNotes on average Markov decision processes with a minimum-variance criterion




This page was built for publication: Variance-Penalized Markov Decision Processes