scientific article

From MaRDI portal
Publication:3795523

zbMath0649.93001MaRDI QIDQ3795523

Dimitri P. Bertsekas

Publication date: 1987


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items

Optimal control of single-server queueing networksThe Expected Total Cost Criterion for Markov Decision Processes under Constraints: A Convex Analytic ApproachTemporal logics for the specification of performance and reliabilityUnnamed ItemUnnamed ItemA turnpike improvement algorithm for piecewise deterministic controlA unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learningStructural results for partially observed control modelsDiscrete reachability of hybrid systemsSolving H-horizon, stationary Markov decision problems in time proportional to log (H)On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processesOn the computation of the optimal cost function for discrete time Markov models with partial observationsApplication of Stochastic Dynamic Programming in Demand Dispatch-Based Optimal Operation of a MicrogridNear optimization of stochastic dynamic systems by decomposition and aggregationControlled Markov processes on the infinite planning horizon: Weighted and overtaking cost criteriaZero-sum stochastic games with unbounded costs: Discounted and average cost casesWhen control and state variations increase uncertainty: modeling and stochastic control in discrete timeOptimal feedback solution of a constrained stochastic one-storage modelError bounds for stochastic shortest path problemsUnnamed ItemReinforcement learning of non-Markov decision processesRegular Policies in Abstract Dynamic ProgrammingNow decision theorySolving an Infinite-Horizon Discounted Markov Decision Process by DC Programming and DCAA survey of average cost problems in deterministic discrete-time control systemsHeterogeneous expertise and collective decision-makingModel development with Maple in PhD-level management science courses: a personal accountAn approximate dynamic programming approach to resource management in multi-cloud scenariosAssessing Decisions on Multiple Uses of Water and Hydroelectric FacilitiesDynamic distributed clustering in wireless sensor networks via Voronoi tessellation controlOptimality of greedy and sustainable policies in the management of renewable resourcesA MODEL FOR THE OPTIMAL ASSET-LIABILITY MANAGEMENT FOR INSURANCE COMPANIESOptimal empty vehicle repositioning and fleet-sizing for two-depot service systemsAnalysis of supply contracts with commitments and flexibilityTurnpikes and computation of piecewise open-loop equilibria in stochastic differential gamesEquation‐free optimal switching policies for bistable reacting systemsPull-based broadcasting with timing constraintsTurnpikes and computation of piecewise open-loop equilibria in stochastic differential gamesOptimal Portfolio Selection with Transaction CostsAdaptive control of stochastic systems with unknown disturbance distribution: discounted criteriaLocal solutions to the Hamilton-Jacobi-Bellman equation in stochastic problems of optimal controlAn envelope theorem and some applications to discounted Markov decision processesOptimal pension funding dynamics over infinite control horizon when stochastic rates of return are stationaryOptimal asset--liability management with constraints: A dynamic programming approachMultiagent cooperative search for portfolio selectionMonitoring and control of anytime algorithms: A dynamic programming approachOn valuing appreciating human assets in servicesUnnamed ItemBuilding an Optimal Portfolio in Discrete Time in the Presence of Transaction CostsUnnamed ItemManaging the impact of high market growth and learning on knowledge worker productivity and service qualityTD(λ) learning without eligibility traces: a theoretical analysisModel-based learning of interaction strategies in multi-agent systemsRecent developments in single product, discrete-time, capacitated production-inventory systems.Approximating infinite horizon stochastic optimal control in discrete time with constraintsAsymptotically optimal controls of hybrid linear quadratic regulators in discrete time.2guaranteed cost control for uncertain discrete-time linear systemsAuctions for Resource Allocation in Overlay NetworksNonzero-sum stochastic games with unbounded costs: Discounted and average cost casesUndiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policyDenumerable controlled Markov chains with average reward criterion: Sample path optimalityUnnamed ItemA perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costsOptimal control and performance analysis of anMX/M/1queue with batches of negative customersOn the Complexity of Value IterationControl of singularly perturbed Markov chains: A numerical studyWhen is a base stock policy optimal in recovering disrupted cyclic schedules?BRIDGE LANE DIRECTION SPECIFICATION FOR SUSTAINABLE TRAFFIC MANAGEMENTUnnamed ItemUnnamed ItemUnnamed ItemThe convergence of value iteration in average cost Markov decision chainsAverage Cost Semi-Markov Decision Processes and the Control of Queueing SystemsConstrained Discounted Markov Decision ChainsA real-time dynamic lot-sizing heuristic for a manufacturing system subject to random setup timesOptimal intensity allocation of single-server queueing networksMinimizing fleet operating costs for a container transportation companyBoundedly optimal control of piecewise deterministic systemsNumerical methods for controlled and uncontrolled multiplexing and queueing systemsOptimal single ordering policy with multiple delivery modes and Bayesian information updatesCompetitive production scheduling: A two-firm, noncooperative finite dynamic gameSerial and parallel value iteration algorithms for discounted Markov decision processesA dynamic software release modelA note on the Ross-Taylor theoremAn empirical study of policy convergence in Markov decision process value iterationOptimal inventory replenishment policy for a queueing system with finite waiting room capacityA duality approach to admission and scheduling controls of queuesDiscrete-time control for systems of interacting objects with unknown random disturbance distributions: a mean field approachNumerical aspects of monotone approximations in convex stochastic control problemsAn effective numerical method for controlled routing in large trunk line networksProduction scheduling in a price competitionInference in credal networks: Branch-and-bound methods and the A/R+ algorithmA perturbation approach to a class of discounted approximate value iteration algorithms with Borel spacesRestricted gradient-descent algorithm for value-function approximation in reinforcement learningValue iteration in average cost Markov control processes on Borel spacesDiscrete-time behavioral portfolio selection under cumulative prospect theoryControl systems engineering educationImmunization and max-min optimal controlDiscretization procedures for adaptive Markov control processesOptimal control of polling models for transportation applicationsNonlinear and dynamic programming for epidemic interventionOptimality of monotonic policies for two-action Markovian decision processes, with applications to control of queues with delayed informationA metaheuristic approach to solving a multiproduct EOQ-based inventory problem with storage space constraintsLinear quadratic dynamic programming for water reservoir managementOptimal purchasing policy in a two-component assembly system with different purchasing contracts for each componentA MDP approach to fault-tolerant routingOptimal maintenance policies in random environmentsMarkov control models with unknown random state-action-dependent discount factorsScalable computational techniques for centrality metrics on temporally detailed social networkAbstraction and approximate decision-theoretic planning.Stochastic observability in network state estimation and controlRegularity properties of constrained set-valued mappingsSimultaneous design of measurement and control strategies for stochastic systems with feedbackA version of the Euler equation in discounted Markov decision processesStochastic vendor managed replenishment with demand dependent shipment.A dynamic programming approach: improving the performance of wireless networksA dynamic programming strategy to balance exploration and exploitation in the bandit problemControl strategies for a stochastic flexible manufacturing and assembly system modelA Q-learning predictive control scheme with guaranteed stabilityApplication of interior-point methods to model predictive controlStructured policies in the sequential design of experimentsIrreversibility and the behavior of aggregate stochastic growth modelsStochastic finite-state systems in control theoryAn optimal inventory management problem with reputation-dependent demandAnother set of conditions for average optimality in Markov control processesControl systems of interacting objects modeled as a game against nature under a mean field approachAverage optimality in dynamic programming on Borel spaces -- unbounded costs and controlsOptimal nonlinear policy: signal extraction with a non-normal priorRandomization for robot tasks: using dynamic programming in the space of knowledge statesAdmission control in UMTS networks based on approximate dynamic programmingOptimal consumption under deterministic incomeMaximization of revenue in fishery model with Cobb-Douglas type of production functionA modified genetic algorithm for optimal control problemsPolicy iteration and Newton-Raphson methods for Markov decision processes under average cost criterionReduced complexity dynamic programming based on policy iterationInventory control as a discrete system control for the fixed-order quantity systemIllustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPsOptimal control of renewable resources with alternative useConvergence analysis of an inertial accelerated iterative algorithm for solving split variational inequality problemAverage cost optimal policies for Markov control processes with Borel state space and unbounded costsA survey of Markov decision models for control of networks of queuesAccelerating autonomous learning by using heuristic selection of actionsThe complexity of dynamic programmingHedging point policy improvementTransfer of learning by composing solutions of elemental sequential tasksEffect of reconfiguration costs on planning for capacity scalability in reconfigurable manufacturing systemsRemarks on the existence of solutions to the average cost optimality equation in Markov decision processesHierarchical production control for a flow shop with dynamic setup changes and random machine breakdownsAdaptive optimization and the harvest of biological populationsNear optimization of dynamic systems by decomposition and aggregationInventory models with unreliable suppliers in a random environmentOptimal control of a stochastic assembly production lineUtility-based on-line exploration for repeated navigation in an embedded graphMultiple perspective dynamic decision makingImpact of ramp-up on the optimal capacity-related reconfiguration policyAlternate approaches to solving the Holt et al. model and to performing sensitivity analysisOptimal digital product auctions with unlimited supply and rebidding behaviorOptimal filtering of discrete-time hybrid systemsBond management and max-min optimal control.Text segmentation by product partition models and dynamic programmingDeep reinforcement learning for inventory control: a roadmapTheoretical tools for understanding and aiding dynamic decision makingStopping rules for utility functions and the St. Petersburg gambleA maxmin policy for bond managementStochastic dynamic programming with factored representationsBounded-parameter Markov decision processesA production control problem in competitionOptimal myopic policy for a stochastic inventory problem with fixed and proportional backorder costsEstimating equilibrium probabilities for band diagonal Markov chains using aggregation and disaggregation techniquesOn the intertemporal allocation of a natural resourceA survey of solution techniques for the partially observed Markov decision processInformation matrices with random regressors. Application to experimental designOptimal cost and policy for a Markovian replacement problemDynamic programming using radial basis functionsOn the undecidability of probabilistic planning and related stochastic optimization problemsAn application of Lemke's method to a class of Markov decision problems