scientific article; zbMATH DE number 3320878
From MaRDI portal
Publication:5599448
zbMath0202.18401MaRDI QIDQ5599448
Publication date: 1970
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (only showing first 100 items - show all)
Bounds for the regret loss in dynamic programming under adaptive control ⋮ Asymptotic properties of constrained Markov Decision Processes ⋮ Discounted Cost Markov Decision Processes with a Constraint ⋮ A stochastic decision model with vector-valued reward ⋮ Structural results for partially observed control models ⋮ Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes ⋮ Bounds for the approximation of dynamic programs ⋮ A class of procedures to compute the optimal value f unction in a Markovian decision problem ⋮ Increasing Lipschitz continuous maximizers of some dynamic programs ⋮ Recurrence conditions for Markov decision processes with Borel state space: A survey ⋮ Density estimation and adaptive control of Markov processes: Average and discounted criteria ⋮ A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies ⋮ Markov--Nash Equilibria in Mean-Field Games with Discounted Cost ⋮ Dynamic CVAR with multi-period risk problems ⋮ Controlled Markov processes on the infinite planning horizon: Weighted and overtaking cost criteria ⋮ Constrained Markov control processes with randomized discounted cost criteria: infinite linear programming approach ⋮ On continuous dynamic programming with discrete time-parameter ⋮ Entscheidungsmodelle über angeordneten körpern ⋮ Sensitivitätsanalysen in entscheidungsmodellen ⋮ On a problem of optimal search ⋮ Markov decision processes with iterated coherent risk measures ⋮ On a Continuously Discounted Vector Valued Markov Decision Process ⋮ Unnamed Item ⋮ Stochastic scheduling problems I — General strategies ⋮ Estimation and control in discounted stochastic dynamic programming ⋮ Asymptotic optimality and rates of convergence of quantized stationary policies in continuous-time Markov decision processes ⋮ Partially observed discrete-time risk-sensitive mean field games ⋮ Arbitrary state semi-Markov decision processes ⋮ Approximate Nash Equilibria in Partially Observed Stochastic Games with Mean-Field Interactions ⋮ Unnamed Item ⋮ Some advances on constrained Markov decision processes in Borel spaces with random state-dependent discount factors ⋮ Finite-stage stochastic decision processes with recursive reward structure I: optimality equations and deterministic strategies ⋮ STRONG AVERAGE OPTIMALITY FOR CONTROLLED NONHOMOGENEOUS MARKOV CHAINS* ⋮ Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal ⋮ Optimal Control of Partially Observable Piecewise Deterministic Markov Processes ⋮ Minimizing spectral risk measures applied to Markov decision processes ⋮ Q-Learning for Distributionally Robust Markov Decision Processes ⋮ Optimal replacement under additive damage in randomly varying environments ⋮ Markov control processes with randomized discounted cost ⋮ Unnamed Item ⋮ On Markov policies for minimax decision processes ⋮ Markov decision processes associated with two threshold probability criteria ⋮ Solution procedures for multi-objective markov decision processes ⋮ Gradient-projection and policy-iteration methods for solving optimization problems in STEOR networks ⋮ On the stability of a dynamic stochastic production and inventory system controlled by an optimal policy ⋮ An analysis of transient Markov decision processes ⋮ The bellman equation for vector-valued semi-markovian dyanmic programiing ⋮ Unnamed Item ⋮ On the optimality of (z, Z)-order-policies in adaptive inventory control ⋮ Unnamed Item ⋮ Recursive adaptive control of Markov decision processes with the average reward criterion ⋮ Average cost optimal policies for Markov control processes with Borel state space and unbounded costs ⋮ Optimal inventory policies when the demand distribution is not known ⋮ Nonstationary stochastic gold-mining: A time-sequential tactical-allocation problem ⋮ Unnamed Item ⋮ Dynamic risk measures under model uncertainty ⋮ Denumerable controlled Markov chains with average reward criterion: Sample path optimality ⋮ Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities ⋮ Credibilistic Markov decision processes: The average case ⋮ Solving a general discounted dynamic program by linear programming ⋮ On the convergence of successive approximations in dynamic programming with non-zero terminal reward ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Optimal investment and consumption with stochastic dividends ⋮ Optimality of (s. S)—Policies in Statistical Inventory Control ⋮ Approximations of inventory models ⋮ Markov decision processes under ambiguity ⋮ Instationäre dynamische Optimierung bei schwachen Voraussetzungen über die Gewinnfunktionen ⋮ Unnamed Item ⋮ Learning and self-confirming long-run biases ⋮ Unnamed Item ⋮ Approximation Theorems for Zero-Sum Nonstationary Stochastic Games ⋮ THE TOTAL REPAIR COST IN A DEFECTIVE, COHERENT BINARY SYSTEM ⋮ Optimization of STEOR networks via Markov renewal programming ⋮ Preventive replacement for multi-parts systems ⋮ A mathematical framework for learning and adaption: (Generalized) random systems with complete connections ⋮ On an optimal harvesting problem ⋮ Semi-Markov decision processes with variance minimization criterion ⋮ Unnamed Item ⋮ Stochastic scheduling problems II-set strategies- ⋮ Approximations and bounds for a generalized optimal stopping problem ⋮ CHARACTERIZATIONS OF OPTIMAL POLICIES IN A GENERAL STOPPING PROBLEM AND STABILITY ESTIMATING ⋮ Partially observable Markov decision processes with partially observable random discount factors ⋮ Semicontinuous nonstationary stochastic games ⋮ On \(\epsilon\)-optimal continuous selectors and their application in discounted dynamic programming ⋮ A polynomial time bound for Howard's policy improvement algorithm ⋮ Finite-state approximations for denumerable multidimensional state discounted Markov decision processes ⋮ Risk measurement and risk-averse control of partially observable discrete-time Markov systems ⋮ Fixed point theorems for discounted finite Markov decision processes ⋮ On Nash equilibrium solutions in nonzero-sum stochastic games with complete information ⋮ Utility, probabilistic constraints, mean and variance of discounted rewards in Markov decision processes ⋮ Sufficient conditions for optimality of a \((z,c^ -,c^ +)\)-sampling plan in multistage Bayesian acceptance sampling ⋮ Vector-valued Markov decision processes and the systems of linear inequalities ⋮ A unified approach to adaptive control of average reward Markov decision processes ⋮ Conditions for the solvability of the linear programming formulation for constrained discounted Markov decision processes ⋮ Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution ⋮ Continuous dependence of stochastic control models on the noise distribution ⋮ A fuzzy approach to Markov decision processes with uncertain transition probabilities ⋮ Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains ⋮ A consumption and investment problem via a Markov decision processes approach with random horizon
This page was built for publication: