scientific article; zbMATH DE number 5212058
From MaRDI portal
Publication:5425954
zbMath1130.93057MaRDI QIDQ5425954
Publication date: 15 November 2007
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
optimizationadaptive controlMarkov decision processeslearning theoryqueueing theoryoperations researchoptimization theory
Stochastic programming (90C15) Research exposition (monographs, survey articles) pertaining to systems and control theory (93-02) Stochastic learning and adaptive control (93E35) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02)
Related Items (max. 100)
Improving routing decisions in parallel non-observable queues ⋮ Approximate policy iteration: a survey and some new methods ⋮ Event-based optimization approach for solving stochastic decision problems with probabilistic constraint ⋮ AN IMPROVEMENT OF MARKOVIAN INTEGRATION BY PARTS FORMULA AND APPLICATION TO SENSITIVITY COMPUTATION ⋮ A Jackson network model and threshold policy for joint optimization of energy and delay in multi-hop wireless networks ⋮ Designing parsimonious scheduling policies for complex resource allocation systems through concurrency theory ⋮ Modeling and optimization control of a demand-driven, conveyor-serviced production station ⋮ New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system ⋮ Sensitivity-based optimization for blockchain selfish mining ⋮ Optimization of stock trading with additional information by limit order book ⋮ Optimization of Markov decision processes under the variance criterion ⋮ Time-optimal control of large-scale systems of systems using compositional optimization ⋮ Optimal energy-efficient policies for data centers through sensitivity-based optimization ⋮ Simulation-based optimization of Markov decision processes: an empirical process theory approach ⋮ Gradient estimation for smooth stopping criteria ⋮ The risk probability criterion for discounted continuous-time Markov decision processes ⋮ A unified algorithm framework for mean-variance optimization in discounted Markov decision processes ⋮ A neural network approach to performance analysis of tandem lines: the value of analytical knowledge ⋮ Mean-variance optimization of discrete time discounted Markov decision processes ⋮ Finding optimal memoryless policies of POMDPs under the expected average reward criterion ⋮ The optimal dynamic rationing policy in the stock-rationing queue ⋮ Optimal balanced control for call centers ⋮ Stochastic control via direct comparison ⋮ A complete algebraic solution to the optimal dynamic rationing policy in the stock-rationing queue with two demand classes ⋮ Performance optimization of queueing systems with perturbation realization ⋮ Completion-of-squares: revisited and extended ⋮ A perturbation analysis approach to phantom estimators for waiting times in the \(G/G/1\) queue ⋮ A tutorial on event-based optimization -- a new optimization framework ⋮ Event-based optimization of admission control in open queueing networks ⋮ Sensitivity-based nested partitions for solving finite-horizon Markov decision processes ⋮ Parameterized Markov decision process and its application to service rate control ⋮ An Overview for Markov Decision Processes in Queues and Networks ⋮ Online scheduling for outpatient services with heterogeneous patients and physicians ⋮ Variance minimization of parameterized Markov decision processes ⋮ Delay-optimal scheduling for two-hop relay networks with randomly varying connectivity: join the shortest queue-longest connected queue policy ⋮ A stochastic minimum principle and an adaptive pathwise algorithm for stochastic optimal control ⋮ Continuous-time Markov decision processes with \(n\)th-bias optimality criteria ⋮ Policy Gradient Approach of Event‐Based Optimization and Its Online Implementation ⋮ Gradient projection-based performance improvement for JLQ problems ⋮ Optimal risk probability for first passage models in semi-Markov decision processes ⋮ What you should know about simulation and derivatives ⋮ Coupling based estimation approaches for the average reward performance potential in Markov chains ⋮ Optimization in curbing risk contagion among financial institutes ⋮ Service rate control of closed Jackson networks from game theoretic perspective ⋮ Perturbation analysis of inhomogeneous finite Markov chains ⋮ Error bounds for augmented truncation approximations of Markov chains via the perturbation method ⋮ A Sensitivity‐Based Construction Approach to Variance Minimization of Markov Decision Processes ⋮ Optimal dynamic mining policy of blockchain selfish mining through sensitivity-based optimization ⋮ Performance optimization for a class of generalized stochastic Petri nets ⋮ Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning ⋮ A unified approach to time-aggregated Markov decision processes
This page was built for publication: