scientific article; zbMATH DE number 837313
From MaRDI portal
Publication:4863593
zbMath0840.93001MaRDI QIDQ4863593
Onésimo Hernández-Lerma, Jean-Bernard Lasserre
Publication date: 23 January 1996
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
dynamic programmingoptimal policiesaverage costdiscounted costdiscrete-time Markov control processes
Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to systems and control theory (93-01)
Related Items (91)
Upper and lower values in zero-sum stochastic games with asymmetric information ⋮ An Algorithm to Construct Subsolutions of Convex Optimal Control Problems ⋮ Computable approximations for continuous-time Markov decision processes on Borel spaces based on empirical measures ⋮ Bayesian estimation of the mean holding time in average semi-Markov control processes ⋮ The Lagrange approach to infinite linear programs ⋮ Discrete-time control for systems of interacting objects with unknown random disturbance distributions: a mean field approach ⋮ APPROXIMATE DYNAMIC PROGRAMMING TECHNIQUES FOR THE CONTROL OF TIME-VARYING QUEUING SYSTEMS APPLIED TO CALL CENTERS WITH ABANDONMENTS AND RETRIALS ⋮ Robustness to Incorrect Priors and Controlled Filter Stability in Partially Observed Stochastic Control ⋮ Stationary Markov Nash Equilibria for Nonzero-Sum Constrained ARAT Markov Games ⋮ Constrained continuous-time Markov decision processes on the finite horizon ⋮ Constrained Markov control processes with randomized discounted cost criteria: infinite linear programming approach ⋮ Local Poisson equations associated with discrete-time Markov control processes ⋮ Constrained Markov decision processes with first passage criteria ⋮ Approximations and Optimal Control for State-Dependent Limited Processor Sharing Queues ⋮ Stationary policies for lower bounds on the minimum average cost of discrete-time nonlinear control systems ⋮ Dynamic admission and service rate control of a queue ⋮ Variance minimization for constrained discounted continuous-time MDPs with exponentially distributed stopping times ⋮ The policy iteration algorithm for average continuous control of piecewise deterministic Markov processes ⋮ Risk-sensitive semi-Markov decision problems with discounted cost and general utilities ⋮ Two-person zero-sum stochastic games with varying discount factors ⋮ Markov Decision Processes with Incomplete Information and Semiuniform Feller Transition Probabilities ⋮ On Stochastic Stability of a Class of non-Markovian Processes and Applications in Quantization ⋮ Reduction of total-cost and average-cost MDPs with weakly continuous transition probabilities to discounted mdps ⋮ The risk probability criterion for discounted continuous-time Markov decision processes ⋮ Partially observed discrete-time risk-sensitive mean field games ⋮ Minimizing Ruin Probabilities by Reinsurance and Investment: A Markovian Decision Approach ⋮ Short Communication: Existence of Markov Equilibrium Control in Discrete Time ⋮ Optimal sensor scheduling for remote state estimation with limited bandwidth: a deep reinforcement learning approach ⋮ First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs ⋮ Bias and Overtaking Optimality for Continuous-Time Jump Markov Decision Processes in Polish Spaces ⋮ New average optimality conditions for semi-Markov decision processes in Borel spaces ⋮ A version of the Euler equation in discounted Markov decision processes ⋮ Average control of Markov decision processes with Feller transition probabilities and general action spaces ⋮ A risk minimization problem for finite horizon semi-Markov decision processes with loss rates ⋮ Semi-Markov control processes with unknown holding times distribution under an average cost criterion ⋮ A Universal Dynamic Program and Refined Existence Results for Decentralized Stochastic Control ⋮ Computing Controlled Invariant Sets from Data Using Convex Optimization ⋮ Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies ⋮ A Convex Programming Approach for Discrete-Time Markov Decision Processes under the Expected Total Reward Criterion ⋮ The role of information in system stability with partially observable servers ⋮ Min-max and robust polynomial optimization ⋮ Discounted continuous-time constrained Markov decision processes in Polish spaces ⋮ Optimal strategies for a fishery model applied to utility functions ⋮ Zero-sum Markov games with random state-actions-dependent discount factors: existence of optimal strategies ⋮ First Passage Exponential Optimality Problem for Semi-Markov Decision Processes ⋮ Constrained Markov decision processes in Borel spaces: from discounted to average optimality ⋮ Unnamed Item ⋮ Markov decision processes associated with two threshold probability criteria ⋮ The discounted method and equivalence of average criteria for risk-sensitive Markov decision processes on Borel spaces ⋮ An excursion-theoretic approach to stability of discrete-time stochastic hybrid systems ⋮ Semi-Markov control models with partially known holding times distribution: discounted and average criteria ⋮ Singular perturbation for the discounted continuous control of piecewise deterministic Markov processes ⋮ Maximizing the probability of attaining a target prior to extinction ⋮ Dual-based methods for solving infinite-horizon nonstationary deterministic dynamic programs ⋮ Unnamed Item ⋮ Convex computation of extremal invariant measures of nonlinear dynamical systems and Markov processes ⋮ Markov decision processes with quasi-hyperbolic discounting ⋮ Finite-horizon optimality for continuous-time Markov decision processes with unbounded transition rates ⋮ Risk-sensitive average equilibria for discrete-time stochastic games ⋮ A survey of recent results on continuous-time Markov decision processes (with comments and rejoinder) ⋮ Nonzero-Sum Expected Average Discrete-Time Stochastic Games: The Case of Uncountable Spaces ⋮ Stackelberg equilibrium in a dynamic stimulation model with complete information ⋮ Rationally Inattentive Control of Markov Processes ⋮ Solutions of semi-Markov control models with recursive discount rates and approximation by $\epsilon-$optimal policies ⋮ Ergodic Control-Coding Capacity of Stochastic Control Systems: Information Signalling and Hierarchical Optimality of Gaussian Systems ⋮ Unnamed Item ⋮ Constrained and Unconstrained Optimal Discounted Control of Piecewise Deterministic Markov Processes ⋮ Computing Near-Optimal Policies in Generalized Joint Replenishment ⋮ Discrete-time average-cost mean-field games on Polish spaces ⋮ On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs ⋮ Unnamed Item ⋮ Two person zero-sum semi-Markov games with unknown holding times distribution on one side: A discounted payoff criterion ⋮ A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs ⋮ <html> Nash ε-equilibria for stochastic games with total reward functions: an approach through Markov decision processes</html> ⋮ Optimal control and performance analysis of anMX/M/1queue with batches of negative customers ⋮ Performance bounds and suboptimal policies for linear stochastic control via LMIs ⋮ Unnamed Item ⋮ Unnamed Item ⋮ OPTIMAL MIXING OF MARKOV DECISION RULES FOR MDP CONTROL ⋮ Bellman equations for scalar linear convex stochastic control problems ⋮ Average optimality for Markov decision processes in borel spaces: a new condition and approach ⋮ Nowak's Theorem on Probability Measures Induced by Strategies Revisited ⋮ On the Existence of Optimal Policies for a Class of Static and Sequential Dynamic Teams ⋮ Dynamic Programming Subject to Total Variation Distance Ambiguity ⋮ On the reduction of total‐cost and average‐cost MDPs to discounted MDPs ⋮ Application of average dynamic programming to inventory systems ⋮ On the First Passage $g$-Mean-Variance Optimality for Discounted Continuous-Time Markov Decision Processes ⋮ A Convex Analytic Approach to Risk-Aware Markov Decision Processes ⋮ Zero-sum semi-Markov games with state-action-dependent discount factors ⋮ A Moreau-Yosida regularization for Markov decision processes ⋮ Unnamed Item
This page was built for publication: