scientific article; zbMATH DE number 5685899

From MaRDI portal
Revision as of 23:03, 8 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:5305630

zbMath1184.90170MaRDI QIDQ5305630

Martin L. Puterman

Publication date: 22 March 2010


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpointsUnnamed ItemMarkov decision processes for infinite horizon problems solved with the cosine simplex methodAn Incremental Fast Policy Search Using a Single Sample PathUnnamed ItemUnnamed ItemA forwards induction approach to candidate drug selectionExperimental Design for Partially Observed Markov Decision ProcessesUnnamed ItemAn Approach for Determining Stationary Equilibria in a Single-Controller Average Stochastic GameUnnamed ItemAverage Cost Brownian Drift Control with Proportional Changeover CostsStationary policies for lower bounds on the minimum average cost of discrete-time nonlinear control systemsComputing Behavioral Relations for Probabilistic Concurrent SystemsMulti-hop sensor network scheduling for optimal remote estimationA Q-Learning Algorithm for Discrete-Time Linear-Quadratic Control with Random Parameters of Unknown Distribution: Convergence and StabilizationEffective Scenarios in Multistage Distributionally Robust Optimization with a Focus on Total Variation DistanceUnnamed ItemA queueing model for customer rescheduling and no-shows in service systemsA numerical study of Markov decision process algorithms for multi-component replacement problemsUnnamed ItemAnother set of verifiable conditions for average Markov decision processes with Borel spacesAn exponential lower bound for Zadeh's pivot ruleA scalable anticipatory policy for the dynamic pickup and delivery problemOff-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demandsOPTIMAL CONTROL OF A TWO-SERVER QUEUEING SYSTEM WITH FAILURESFormalization of methods for the development of autonomous artificial intelligence systemsOptimal policies for stochastic clearing systems with time‐dependent delay penaltiesBlock Policy Mirror DescentModel checking differentially private propertiesA unified algorithm framework for mean-variance optimization in discounted Markov decision processesSmoothing policies and safe policy gradientsA dynamic analytic method for risk-aware controlled martingale problemsA specification logic for programs in the probabilistic guarded command languageTask allocation and on-the-job trainingUnnamed ItemA framework to measure the robustness of programs in the unpredictable environmentOPTIMIZATION OF OVERFLOW POLICIES IN CALL CENTERSOptimal Routing of Fixed Size Jobs to Two Parallel ServersAdaptive constraint satisfaction for Markov decision process congestion games: application to transportation networksAverage cost minimization in a multi-server retrial queueing system with a controllable reserve group of serversPremium control with reinforcement learningOPTIMALLY REPLACING MULTIPLE SYSTEMS IN A SHARED ENVIRONMENTDistributionally Robust Partially Observable Markov Decision Process with Moment-Based AmbiguityOn the Value Function of the M/G/1 FCFS and LCFS QueuesLearning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular ConstraintsUnnamed ItemUnnamed ItemIterative Improvement of Lower and Upper Bounds for Backward SDEsScheduling services in a queuing system with impatience and setup costsDynamic Pricing with a Poisson Bandit ModelUnnamed ItemUnnamed ItemUnnamed ItemA CTMDP-Based Exact Method for RCPSP with Uncertain Activity Durations and ReworkUnnamed ItemOptimal Kullback–Leibler approximation of Markov chains via nuclear norm regularisationDynamic Decision Making in Energy Systems with Storage and Renewable Energy SourcesMinimising average passenger waiting time in personal rapid transit systemsFast value iteration: an application of Legendre-Fenchel duality to a class of deterministic dynamic programming problems in discrete timeUnnamed ItemCharacterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision ChainsUnnamed ItemA Survey of Bidding Games on Graphs (Invited Paper)A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs<html> Nash &epsilon;-equilibria for stochastic games with total reward functions: an approach through Markov decision processes</html>A Continuous-Time Markov Decision Process for Infrastructure SurveillanceTo wait or not to wait: Optimal ordering under lead time uncertainty and forecast updatingRepeated Sequential Prisoner's Dilemma: The Stackleberg VariantSolving the drift control problemSynchronization and control in intrinsic and designed computation: An information-theoretic analysis of competing models of stochastic computationEmpirical Q-Value IterationA Convex Analytic Approach to Risk-Aware Markov Decision ProcessesConcurrent MDPs with Finite Markovian PoliciesMultiply Accelerated Value Iteration for NonSymmetric Affine Fixed Point Problems and Application to Markov Decision ProcessesUnnamed ItemOn Nash Equilibria in Stochastic Positional Games with Average PayoffsUnnamed ItemMultiple stopping time POMDPs: structural results \& application in interactive advertising on social mediaDemand seasonality in retail inventory managementRobust decomposable Markov decision processes motivated by allocating school budgetsSolving dynamic public insurance games with endogenous agent distributions: theory and computational approximationRevenue management for operations with urgent ordersParametric replenishment policies for inventory systems with lost sales and fixed order costOptimal sensor scheduling for multiple linear dynamical systemsOptimal inventory management using retail prepacksFully probabilistic design of strategies with estimatorComparing strategies to prevent stroke and ischemic heart disease in the Tunisian population: Markov modeling approach using a comprehensive sensitivity analysis algorithmSolving stochastic resource-constrained project scheduling problems by closed-loop approximate dynamic programmingA two-state partially observable Markov decision process with three actionsContinue, quit, restart probability modelPerspectives of approximate dynamic programmingOptimal decisions for continuous time Markov decision processes over finite planning horizonsOn transition matrices of Markov chains corresponding to Hamiltonian cyclesA model for equilibrium in some service-provider user-set interactionsHeuristic decision rules for short-term trading of renewable energy with co-located energy storageGame theoretic interaction and decision: a quantum analysisA stochastic game approach to the security issue of networked control systems under jamming attacksScheduling of multi-class multi-server queueing systems with abandonmentsFrameworks and results in distributionally robust optimizationRobust optimal strategies in Markov decision problemsA non-penalty recurrent neural network for solving a class of constrained optimization problemsA multi-objective approach for PH-graphs with applications to stochastic shortest pathsOn the computation of Whittle's index for Markovian restless banditsOptimal supervisory control with mean payoff objectives and under partial observationWhen are emptiness and containment decidable for probabilistic automata?Bidding mechanisms in graph gamesStochastic reachability of a target tube: theory and computationEfficient incremental planning and learning with multi-valued decision diagramsSolving generic nonarchimedean semidefinite programs using stochastic game algorithmsLost-sales inventory systems with a service level criterionImproved utilization for joint HCCA-EDCA access in IEEE 802.11e WLANsInfinite-duration poorman-bidding gamesOn budget balance of the dynamic pivot mechanismFuzzy Markovian decision processes: application to queueing systemsBayesian optimistic Kullback-Leibler explorationOptimal dynamic resource allocation to prevent defaultsSpace-efficient scheduling of stochastically generated tasksRenewable resource management with stochastic recharge and environmental threatsOffline reinforcement learning with task hierarchiesIntegrating stochastic reasoning into Event-B developmentPreference-based reinforcement learning: a formal framework and a policy iteration algorithmOn discounted dynamic programming with unbounded returnsDiscounted dynamic programming with unbounded returns: application to economic modelsA mean field approach for optimization in discrete timeThe value function of an infinite-horizon single-item lot-sizing problemApproximate dynamic programming for capacity allocation in the service industryOptimal denial-of-service attack energy management against state estimation over an SINR-based networkComputational bounds for elevator control policies by large scale linear programmingDynamic speed scaling minimizing expected energy consumption for real-time tasksA necessary condition for Nash equilibrium in two-person zero-sum constrained stochastic gamesA survey on skill-based routing with applications to service operations managementA stochastic approach to optimize Maritime pine (\textit{Pinus pinaster} Ait.) stand management scheduling under fire risk. An application in PortugalSensitivity-based nested partitions for solving finite-horizon Markov decision processesDistributed adaptive dynamic programming for data-driven optimal controlComputation of weighted sums of rewards for concurrent MDPsOn the hardness of analyzing probabilistic programsAn approximate dynamic programming approach for sequential pig marketing decisions at herd levelA hybrid simulation-optimization algorithm for the Hamiltonian cycle problemOptimal strategies for a fishery model applied to utility functionsIdentifying proactive ICU patient admission, transfer and diversion policies in a public-private hospital networkDetermining the optimal strategies for zero-sum average stochastic positional gamesAttack allocation on remote state estimation in multi-systems: structural results and asymptotic solutionA policy iteration algorithm for the American put option and free boundary control problemsModel-based testing of probabilistic systemsAn approximate dynamic programming approach to project scheduling with uncertain resource availabilitiesA nested family of \(k\)-total effective rewards for positional gamesCustomizing exponential semi-Markov decision processes under the discounted cost criterionDynamic expediting of an urgent order with uncertain progressCooperation dynamics in repeated games of adverse selectionProduction and availability policies through the Markov decision process and myopic methods for contractual and selective ordersA general approach for population games with application to vaccinationDetermining the optimal strategies for discrete control problems on stochastic networks with discounted costsProviding radiology health care services to stochastic demand of different customer classesLight robustness in the optimization of Markov decision processes with uncertain parametersDynamic pricing in a production system with multiple demand classesSell or store? An ADP approach to marketing renewable energyErlang loss bounds for OT-ICU systemsAn intelligent packet loss control heuristic for connectionless real-time voice communicationOn infinite horizon active fault diagnosis for a class of non-linear non-Gaussian systemsDual-based methods for solving infinite-horizon nonstationary deterministic dynamic programsMarkov decision processes with quasi-hyperbolic discountingOptimal control in dynamic food supply chains using big dataWhat foreclosed homes should a municipality purchase to stabilize vulnerable neighborhoods?Applications of stochastic modeling in air traffic management: methods, challenges and opportunities for solving air traffic problems under uncertaintyA stochastic dynamic pricing model for the multiclass problems in the airline industryStochastic dynamic programming model for optimal resource allocation in vehicular ad hoc networksAnalysis of customer lifetime value and marketing expenditure decisions through a Markovian-based modelInferring expected runtimes of probabilistic integer programs using expected sizesCondition-dependent mate choice: a stochastic dynamic programming approachAsymptotically optimal index policies for an abandonment queue with convex holding costEquilibrium points and equilibrium sets of some \(GI /M/1\) queuesA pseudo-linear time algorithm for the optimal discrete speed minimizing energy consumptionThe operator approach to entropy gamesOptimal dynamic mining policy of blockchain selfish mining through sensitivity-based optimizationFrom reinforcement learning to optimal control: a unified framework for sequential decisionsTime and inventory dependent optimal maintenance policies for single machine workstations: an MDP approachHow adaptive and reliable is your program?