scientific article; zbMATH DE number 3320878
From MaRDI portal
Publication:5599448
zbMath0202.18401MaRDI QIDQ5599448
Publication date: 1970
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (only showing first 100 items - show all)
Semicontinuous nonstationary stochastic games ⋮ On \(\epsilon\)-optimal continuous selectors and their application in discounted dynamic programming ⋮ A polynomial time bound for Howard's policy improvement algorithm ⋮ Finite-state approximations for denumerable multidimensional state discounted Markov decision processes ⋮ Risk measurement and risk-averse control of partially observable discrete-time Markov systems ⋮ Fixed point theorems for discounted finite Markov decision processes ⋮ On Nash equilibrium solutions in nonzero-sum stochastic games with complete information ⋮ Utility, probabilistic constraints, mean and variance of discounted rewards in Markov decision processes ⋮ Sufficient conditions for optimality of a \((z,c^ -,c^ +)\)-sampling plan in multistage Bayesian acceptance sampling ⋮ Vector-valued Markov decision processes and the systems of linear inequalities ⋮ A unified approach to adaptive control of average reward Markov decision processes ⋮ Conditions for the solvability of the linear programming formulation for constrained discounted Markov decision processes ⋮ Adaptive policies for discrete-time stochastic control systems with unknown disturbance distribution ⋮ Continuous dependence of stochastic control models on the noise distribution ⋮ A fuzzy approach to Markov decision processes with uncertain transition probabilities ⋮ Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains ⋮ A consumption and investment problem via a Markov decision processes approach with random horizon ⋮ Approximation of average cost optimal policies for general Markov decision processes with unbounded costs ⋮ The recursive approach to time inconsistency ⋮ Value iteration in average cost Markov control processes on Borel spaces ⋮ The Bellman's principle of optimality in the discounted dynamic programming ⋮ On compactness of the space of policies in stochastic dynamic programming ⋮ Nonparametric adaptive control of discrete-time partially observable stochastic systems ⋮ Discrete-time ergodic mean-field games with average reward on compact spaces ⋮ Robustness inequality for Markov control processes with unbounded costs ⋮ A pause control approach to the value iteration scheme in average Markov decision processes ⋮ A note on the convergence rate of the value iteration scheme in controlled Markov chains ⋮ \(C^3\) modeling with symmetrical rationality ⋮ Markov control models with unknown random state-action-dependent discount factors ⋮ Portfolio optimization under dynamic risk constraints: continuous vs. discrete time trading ⋮ The transformation method for continuous-time Markov decision processes ⋮ Markov decision processes with state-dependent discount factors and unbounded rewards/costs ⋮ Markov renewal decision processes with finite horizon ⋮ A remark on the connections between coding and dynamic programming ⋮ Conditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programs ⋮ On the expected total reward with unbounded returns for Markov decision processes ⋮ Randomization and simplification in dynamic decision-making. ⋮ Optimal policies for constrained average-cost Markov decision processes ⋮ A natural extension of the MacQueen extrapolation ⋮ Some comments on preference order dynamic programming models ⋮ A dual approach to Bayesian inference and adaptive control ⋮ First-order sensitivity of the optimal value in a Markov decision model with respect to deviations in the transition probability function ⋮ Optimal policies in multiproduct inventory models ⋮ On two-state quality control under Markovian deterioration ⋮ Markov-achievable payoffs for finite-horizon decision models. ⋮ Markov decision processes on Borel spaces with total cost and random horizon ⋮ Recent results on conditions for the existence of average optimal stationary policies ⋮ Nonparametric estimation and adaptive control in a class of finite Markov decision chains ⋮ On variable discounting in dynamic programming: applications to resource extraction and other economic models ⋮ Average cost Markov decision processes: Optimality conditions ⋮ Estimation and control in multichain processes ⋮ Structured policies in the sequential design of experiments ⋮ Optimal dynamic load distribution in a class of flow-type flexible manufacturing systems ⋮ Convergence of probability measures and Markov decision models with incomplete information ⋮ Stochastic dynamic programming with non-linear discounting ⋮ Optimal stationary policies in the vector-valued Markov decision process ⋮ Equivalence of Lyapunov stability criteria in a class of Markov decision processes ⋮ Existence of optimal stationary policies in average reward Markov decision processes with a recurrent state ⋮ Evolution and market behavior ⋮ A stochastic interpretation of game logic ⋮ A dynamic multi-item two-activity problem ⋮ Markov-Entscheidungs-Prozesse mit abhängigen Aktionen für optimale Reparaturmaßnahmen bei unvollständiger Information. (Markov decision processes with dependent actions for optimal repair policies under incomplete information) ⋮ A limited order capacity stochastic inventory model with a fixed cost for order: The discounted case ⋮ A note on negative dynamic programming for risk-sensitive control ⋮ Continuous-time Markov decision processes with state-dependent discount factors ⋮ Controlled jump processes ⋮ On dynamic programming: Compactness of the space of policies ⋮ On stopped decision processes with discrete time parameter ⋮ Estimates for finite-stage dynamic programs ⋮ Finite-state, discrete-time optimization with randomly varying observation quality ⋮ Dynamic programming of expectation and variance ⋮ A selection theorem for optimization problems ⋮ Stochastische dynamische Optimierung als Spezialfall linearer Optimierung in halbgeordneten Vektorräumen ⋮ On a representation of measurable automaton transformations by stochastic automata ⋮ Discounted, positive, and noncooperative stochastic games ⋮ Kleisli morphisms and randomized congruences for the Giry monad ⋮ Zero-sum risk-sensitive stochastic games ⋮ Measurable selection theorems for optimization problems ⋮ On some aspects in stochastic dynamic programming with terminal region ⋮ Dynamic programming and principles of optimality ⋮ Semicontinuous nonstationary stochastic games. II ⋮ Constrained denumerable state non-stationary MDPs with expected total reward criterion ⋮ Optimal strategies for an inventory system with cost functions of general form ⋮ Conditional decision processes with recursive function ⋮ Dynamic mean-risk optimization in a binomial model ⋮ Zum Problem des zweiarmigen Bernoulli-Banditen mit einer bekannten Erfolgswahrscheinlichkeit und unendlich vielen Spielen ⋮ Monotonicity and the principle of optimality ⋮ On a stopping rule for a class of sequential decision problems ⋮ Adaptive control of discounted Markov decision chains ⋮ Stochastic control theory and operational research ⋮ Optimal research and development expenditures under an incremental tax incentive scheme ⋮ Controlled Markov set-chains under average criteria ⋮ Minimax control for discrete-time time-varying stochastic systems ⋮ On discounted dynamic programming with constraints ⋮ Bounds for the quality and the number of steps in Bellman's value iteration algorithm ⋮ Nonstationary value-iteration and adaptive control of discounted semi- Markov processes ⋮ Stability estimation of some Markov controlled processes ⋮ Existence of optimal policy for time non-homogeneous discounted Markovian decision programming ⋮ On essential information in sequential decision processes ⋮ Stationary policies and Markov policies in Borel dynamic programming
This page was built for publication: