scientific article; zbMATH DE number 5685899
From MaRDI portal
Publication:5305630
zbMath1184.90170MaRDI QIDQ5305630
Publication date: 22 March 2010
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02)
Related Items
Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints ⋮ Unnamed Item ⋮ Markov decision processes for infinite horizon problems solved with the cosine simplex method ⋮ An Incremental Fast Policy Search Using a Single Sample Path ⋮ Unnamed Item ⋮ Unnamed Item ⋮ A forwards induction approach to candidate drug selection ⋮ Experimental Design for Partially Observed Markov Decision Processes ⋮ Unnamed Item ⋮ An Approach for Determining Stationary Equilibria in a Single-Controller Average Stochastic Game ⋮ Unnamed Item ⋮ Average Cost Brownian Drift Control with Proportional Changeover Costs ⋮ Stationary policies for lower bounds on the minimum average cost of discrete-time nonlinear control systems ⋮ Computing Behavioral Relations for Probabilistic Concurrent Systems ⋮ Multi-hop sensor network scheduling for optimal remote estimation ⋮ A Q-Learning Algorithm for Discrete-Time Linear-Quadratic Control with Random Parameters of Unknown Distribution: Convergence and Stabilization ⋮ Effective Scenarios in Multistage Distributionally Robust Optimization with a Focus on Total Variation Distance ⋮ Unnamed Item ⋮ A queueing model for customer rescheduling and no-shows in service systems ⋮ A numerical study of Markov decision process algorithms for multi-component replacement problems ⋮ Unnamed Item ⋮ Another set of verifiable conditions for average Markov decision processes with Borel spaces ⋮ An exponential lower bound for Zadeh's pivot rule ⋮ A scalable anticipatory policy for the dynamic pickup and delivery problem ⋮ Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands ⋮ OPTIMAL CONTROL OF A TWO-SERVER QUEUEING SYSTEM WITH FAILURES ⋮ Formalization of methods for the development of autonomous artificial intelligence systems ⋮ Optimal policies for stochastic clearing systems with time‐dependent delay penalties ⋮ Block Policy Mirror Descent ⋮ Model checking differentially private properties ⋮ A unified algorithm framework for mean-variance optimization in discounted Markov decision processes ⋮ Smoothing policies and safe policy gradients ⋮ A dynamic analytic method for risk-aware controlled martingale problems ⋮ A specification logic for programs in the probabilistic guarded command language ⋮ Task allocation and on-the-job training ⋮ Unnamed Item ⋮ A framework to measure the robustness of programs in the unpredictable environment ⋮ OPTIMIZATION OF OVERFLOW POLICIES IN CALL CENTERS ⋮ Optimal Routing of Fixed Size Jobs to Two Parallel Servers ⋮ Adaptive constraint satisfaction for Markov decision process congestion games: application to transportation networks ⋮ Average cost minimization in a multi-server retrial queueing system with a controllable reserve group of servers ⋮ Premium control with reinforcement learning ⋮ OPTIMALLY REPLACING MULTIPLE SYSTEMS IN A SHARED ENVIRONMENT ⋮ Distributionally Robust Partially Observable Markov Decision Process with Moment-Based Ambiguity ⋮ On the Value Function of the M/G/1 FCFS and LCFS Queues ⋮ Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Iterative Improvement of Lower and Upper Bounds for Backward SDEs ⋮ Scheduling services in a queuing system with impatience and setup costs ⋮ Dynamic Pricing with a Poisson Bandit Model ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ A CTMDP-Based Exact Method for RCPSP with Uncertain Activity Durations and Rework ⋮ Unnamed Item ⋮ Optimal Kullback–Leibler approximation of Markov chains via nuclear norm regularisation ⋮ Dynamic Decision Making in Energy Systems with Storage and Renewable Energy Sources ⋮ Minimising average passenger waiting time in personal rapid transit systems ⋮ Fast value iteration: an application of Legendre-Fenchel duality to a class of deterministic dynamic programming problems in discrete time ⋮ Unnamed Item ⋮ Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains ⋮ Unnamed Item ⋮ A Survey of Bidding Games on Graphs (Invited Paper) ⋮ A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs ⋮ <html> Nash ε-equilibria for stochastic games with total reward functions: an approach through Markov decision processes</html> ⋮ A Continuous-Time Markov Decision Process for Infrastructure Surveillance ⋮ To wait or not to wait: Optimal ordering under lead time uncertainty and forecast updating ⋮ Repeated Sequential Prisoner's Dilemma: The Stackleberg Variant ⋮ Solving the drift control problem ⋮ Synchronization and control in intrinsic and designed computation: An information-theoretic analysis of competing models of stochastic computation ⋮ Empirical Q-Value Iteration ⋮ A Convex Analytic Approach to Risk-Aware Markov Decision Processes ⋮ Concurrent MDPs with Finite Markovian Policies ⋮ Multiply Accelerated Value Iteration for NonSymmetric Affine Fixed Point Problems and Application to Markov Decision Processes ⋮ Unnamed Item ⋮ On Nash Equilibria in Stochastic Positional Games with Average Payoffs ⋮ Unnamed Item ⋮ Multiple stopping time POMDPs: structural results \& application in interactive advertising on social media ⋮ Demand seasonality in retail inventory management ⋮ Robust decomposable Markov decision processes motivated by allocating school budgets ⋮ Solving dynamic public insurance games with endogenous agent distributions: theory and computational approximation ⋮ Revenue management for operations with urgent orders ⋮ Parametric replenishment policies for inventory systems with lost sales and fixed order cost ⋮ Optimal sensor scheduling for multiple linear dynamical systems ⋮ Optimal inventory management using retail prepacks ⋮ Fully probabilistic design of strategies with estimator ⋮ Comparing strategies to prevent stroke and ischemic heart disease in the Tunisian population: Markov modeling approach using a comprehensive sensitivity analysis algorithm ⋮ Solving stochastic resource-constrained project scheduling problems by closed-loop approximate dynamic programming ⋮ A two-state partially observable Markov decision process with three actions ⋮ Continue, quit, restart probability model ⋮ Perspectives of approximate dynamic programming ⋮ Optimal decisions for continuous time Markov decision processes over finite planning horizons ⋮ On transition matrices of Markov chains corresponding to Hamiltonian cycles ⋮ A model for equilibrium in some service-provider user-set interactions ⋮ Heuristic decision rules for short-term trading of renewable energy with co-located energy storage ⋮ Game theoretic interaction and decision: a quantum analysis ⋮ A stochastic game approach to the security issue of networked control systems under jamming attacks ⋮ Scheduling of multi-class multi-server queueing systems with abandonments ⋮ Frameworks and results in distributionally robust optimization ⋮ Robust optimal strategies in Markov decision problems ⋮ A non-penalty recurrent neural network for solving a class of constrained optimization problems ⋮ A multi-objective approach for PH-graphs with applications to stochastic shortest paths ⋮ On the computation of Whittle's index for Markovian restless bandits ⋮ Optimal supervisory control with mean payoff objectives and under partial observation ⋮ When are emptiness and containment decidable for probabilistic automata? ⋮ Bidding mechanisms in graph games ⋮ Stochastic reachability of a target tube: theory and computation ⋮ Efficient incremental planning and learning with multi-valued decision diagrams ⋮ Solving generic nonarchimedean semidefinite programs using stochastic game algorithms ⋮ Lost-sales inventory systems with a service level criterion ⋮ Improved utilization for joint HCCA-EDCA access in IEEE 802.11e WLANs ⋮ Infinite-duration poorman-bidding games ⋮ On budget balance of the dynamic pivot mechanism ⋮ Fuzzy Markovian decision processes: application to queueing systems ⋮ Bayesian optimistic Kullback-Leibler exploration ⋮ Optimal dynamic resource allocation to prevent defaults ⋮ Space-efficient scheduling of stochastically generated tasks ⋮ Renewable resource management with stochastic recharge and environmental threats ⋮ Offline reinforcement learning with task hierarchies ⋮ Integrating stochastic reasoning into Event-B development ⋮ Preference-based reinforcement learning: a formal framework and a policy iteration algorithm ⋮ On discounted dynamic programming with unbounded returns ⋮ Discounted dynamic programming with unbounded returns: application to economic models ⋮ A mean field approach for optimization in discrete time ⋮ The value function of an infinite-horizon single-item lot-sizing problem ⋮ Approximate dynamic programming for capacity allocation in the service industry ⋮ Optimal denial-of-service attack energy management against state estimation over an SINR-based network ⋮ Computational bounds for elevator control policies by large scale linear programming ⋮ Dynamic speed scaling minimizing expected energy consumption for real-time tasks ⋮ A necessary condition for Nash equilibrium in two-person zero-sum constrained stochastic games ⋮ A survey on skill-based routing with applications to service operations management ⋮ A stochastic approach to optimize Maritime pine (\textit{Pinus pinaster} Ait.) stand management scheduling under fire risk. An application in Portugal ⋮ Sensitivity-based nested partitions for solving finite-horizon Markov decision processes ⋮ Distributed adaptive dynamic programming for data-driven optimal control ⋮ Computation of weighted sums of rewards for concurrent MDPs ⋮ On the hardness of analyzing probabilistic programs ⋮ An approximate dynamic programming approach for sequential pig marketing decisions at herd level ⋮ A hybrid simulation-optimization algorithm for the Hamiltonian cycle problem ⋮ Optimal strategies for a fishery model applied to utility functions ⋮ Identifying proactive ICU patient admission, transfer and diversion policies in a public-private hospital network ⋮ Determining the optimal strategies for zero-sum average stochastic positional games ⋮ Attack allocation on remote state estimation in multi-systems: structural results and asymptotic solution ⋮ A policy iteration algorithm for the American put option and free boundary control problems ⋮ Model-based testing of probabilistic systems ⋮ An approximate dynamic programming approach to project scheduling with uncertain resource availabilities ⋮ A nested family of \(k\)-total effective rewards for positional games ⋮ Customizing exponential semi-Markov decision processes under the discounted cost criterion ⋮ Dynamic expediting of an urgent order with uncertain progress ⋮ Cooperation dynamics in repeated games of adverse selection ⋮ Production and availability policies through the Markov decision process and myopic methods for contractual and selective orders ⋮ A general approach for population games with application to vaccination ⋮ Determining the optimal strategies for discrete control problems on stochastic networks with discounted costs ⋮ Providing radiology health care services to stochastic demand of different customer classes ⋮ Light robustness in the optimization of Markov decision processes with uncertain parameters ⋮ Dynamic pricing in a production system with multiple demand classes ⋮ Sell or store? An ADP approach to marketing renewable energy ⋮ Erlang loss bounds for OT-ICU systems ⋮ An intelligent packet loss control heuristic for connectionless real-time voice communication ⋮ On infinite horizon active fault diagnosis for a class of non-linear non-Gaussian systems ⋮ Dual-based methods for solving infinite-horizon nonstationary deterministic dynamic programs ⋮ Markov decision processes with quasi-hyperbolic discounting ⋮ Optimal control in dynamic food supply chains using big data ⋮ What foreclosed homes should a municipality purchase to stabilize vulnerable neighborhoods? ⋮ Applications of stochastic modeling in air traffic management: methods, challenges and opportunities for solving air traffic problems under uncertainty ⋮ A stochastic dynamic pricing model for the multiclass problems in the airline industry ⋮ Stochastic dynamic programming model for optimal resource allocation in vehicular ad hoc networks ⋮ Analysis of customer lifetime value and marketing expenditure decisions through a Markovian-based model ⋮ Inferring expected runtimes of probabilistic integer programs using expected sizes ⋮ Condition-dependent mate choice: a stochastic dynamic programming approach ⋮ Asymptotically optimal index policies for an abandonment queue with convex holding cost ⋮ Equilibrium points and equilibrium sets of some \(GI /M/1\) queues ⋮ A pseudo-linear time algorithm for the optimal discrete speed minimizing energy consumption ⋮ The operator approach to entropy games ⋮ Optimal dynamic mining policy of blockchain selfish mining through sensitivity-based optimization ⋮ From reinforcement learning to optimal control: a unified framework for sequential decisions ⋮ Time and inventory dependent optimal maintenance policies for single machine workstations: an MDP approach ⋮ How adaptive and reliable is your program?