scientific article
From MaRDI portal
Publication:3999474
zbMath0725.93082MaRDI QIDQ3999474
Publication date: 17 September 1992
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Adaptive control/observation systems (93C40) Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Research exposition (monographs, survey articles) pertaining to systems and control theory (93-02) Optimal stochastic control (93E20)
Related Items
Reinforcement learning, sequential Monte Carlo and the EM algorithm ⋮ Multiplicative ergodicity and large deviations for an irreducible Markov chain. ⋮ LP based upper and lower bounds for Cesàro and Abel limits of the optimal values in problems of control of stochastic discrete time systems ⋮ Markov Decision Processes with Variance Minimization: A New Condition and Approach ⋮ Optimization problems in chemical reactions using continuous-time Markov chains ⋮ Cumulative weighting optimization ⋮ Controlled Markov processes on the infinite planning horizon: Weighted and overtaking cost criteria ⋮ Constrained Semi-Markov decision processes with average rewards ⋮ Distributed computation of fixed points of \(\infty\)-nonexpansive maps ⋮ The policy iteration algorithm for average continuous control of piecewise deterministic Markov processes ⋮ Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with the Average Reward Criterion ⋮ The Linear Program approach in multi-chain Markov Decision Processes revisited ⋮ Extreme Occupation Measures in Markov Decision Processes with an Absorbing State ⋮ Approachability in Stackelberg stochastic games with vector costs ⋮ Exact finite approximations of average-cost countable Markov decision processes ⋮ The LP approach in average reward MDPs with multiple cost constraints: The countable state case ⋮ Another Set of Conditions for Strongn(n = −1, 0) Discount Optimality in Markov Decision Processes ⋮ Sample-path optimality and variance-maximization for Markov decision processes ⋮ Martingale limit theorem and its application to an ergodic controlled Markov chain ⋮ A note on the vanishing interest rate approach in average Markov decision chains with continuous and bounded costs ⋮ Markov control processes with randomized discounted cost ⋮ Quantitative model-checking of controlled discrete-time Markov processes ⋮ On strong average optimality of Markov decision processes with unbounded costs ⋮ Correction to: Ergodic and adaptive control of nearest-neighbor motions ⋮ An excursion-theoretic approach to stability of discrete-time stochastic hybrid systems ⋮ Compactness of the space of non-randomized policies in countable-state sequential decision processes ⋮ A further remark on dynamic programming for partially observed Markov processes ⋮ Stopped decision processes in conjunction with general utility ⋮ Maximizing the probability of attaining a target prior to extinction ⋮ Discounted Markov decision processes with utility constraints ⋮ Occupation measures in average cost Markov decision processes ⋮ On the Hamiltonicity Gap and doubly stochastic matrices ⋮ Controlled Markov chains with constraints. ⋮ Algorithms for optimization and stabilization of controlled Markov chains. ⋮ Another set of conditions for Markov decision processes with average sample-path costs ⋮ Ergodic control of reflected diffusions with jumps ⋮ Denumerable controlled Markov chains with average reward criterion: Sample path optimality ⋮ Nonzero-Sum Risk-Sensitive Stochastic Games on a Countable State Space ⋮ Average Reward Markov Decision Processes with Multiple Cost Constraints ⋮ Ergodic control of partially observed Markov chains ⋮ Controlled Sensing for Sequential Multihypothesis Testing with Controlled Markovian Observations and Non-Uniform Control Cost ⋮ On the General Utility of Discounted Markov Decision Processes ⋮ Sample complexity for Markov chain self-tuner ⋮ The Expected Total Cost Criterion for Markov Decision Processes under Constraints ⋮ Blackwell optimality in Markov decision processes with partial observation. ⋮ Empirical Q-Value Iteration ⋮ Whittle indexability in egalitarian processor sharing systems ⋮ “Controlled” Versions of the Collatz–Wielandt and Donsker–Varadhan Formulae ⋮ Constrained Markovian decision processes: The dynamic programming approach ⋮ Concentration of Contractive Stochastic Approximation and Reinforcement Learning ⋮ Whittle index based Q-learning for restless bandits with average reward ⋮ Strategic measures in optimal control problems for stochastic sequences
This page was built for publication: