scientific article

From MaRDI portal
Revision as of 13:59, 5 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:3795523

zbMath0649.93001MaRDI QIDQ3795523

Dimitri P. Bertsekas

Publication date: 1987


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (only showing first 100 items - show all)

Optimal control of single-server queueing networksThe Expected Total Cost Criterion for Markov Decision Processes under Constraints: A Convex Analytic ApproachTemporal logics for the specification of performance and reliabilityUnnamed ItemUnnamed ItemA turnpike improvement algorithm for piecewise deterministic controlA unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learningStructural results for partially observed control modelsDiscrete reachability of hybrid systemsSolving H-horizon, stationary Markov decision problems in time proportional to log (H)On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processesOn the computation of the optimal cost function for discrete time Markov models with partial observationsApplication of Stochastic Dynamic Programming in Demand Dispatch-Based Optimal Operation of a MicrogridNear optimization of stochastic dynamic systems by decomposition and aggregationControlled Markov processes on the infinite planning horizon: Weighted and overtaking cost criteriaZero-sum stochastic games with unbounded costs: Discounted and average cost casesWhen control and state variations increase uncertainty: modeling and stochastic control in discrete timeOptimal feedback solution of a constrained stochastic one-storage modelError bounds for stochastic shortest path problemsUnnamed ItemReinforcement learning of non-Markov decision processesRegular Policies in Abstract Dynamic ProgrammingNow decision theorySolving an Infinite-Horizon Discounted Markov Decision Process by DC Programming and DCAA survey of average cost problems in deterministic discrete-time control systemsHeterogeneous expertise and collective decision-makingModel development with Maple in PhD-level management science courses: a personal accountAn approximate dynamic programming approach to resource management in multi-cloud scenariosAssessing Decisions on Multiple Uses of Water and Hydroelectric FacilitiesDynamic distributed clustering in wireless sensor networks via Voronoi tessellation controlOptimality of greedy and sustainable policies in the management of renewable resourcesA MODEL FOR THE OPTIMAL ASSET-LIABILITY MANAGEMENT FOR INSURANCE COMPANIESOptimal empty vehicle repositioning and fleet-sizing for two-depot service systemsAnalysis of supply contracts with commitments and flexibilityTurnpikes and computation of piecewise open-loop equilibria in stochastic differential gamesEquation‐free optimal switching policies for bistable reacting systemsPull-based broadcasting with timing constraintsTurnpikes and computation of piecewise open-loop equilibria in stochastic differential gamesOptimal Portfolio Selection with Transaction CostsAdaptive control of stochastic systems with unknown disturbance distribution: discounted criteriaLocal solutions to the Hamilton-Jacobi-Bellman equation in stochastic problems of optimal controlAn envelope theorem and some applications to discounted Markov decision processesOptimal pension funding dynamics over infinite control horizon when stochastic rates of return are stationaryOptimal asset--liability management with constraints: A dynamic programming approachMultiagent cooperative search for portfolio selectionMonitoring and control of anytime algorithms: A dynamic programming approachOn valuing appreciating human assets in servicesUnnamed ItemBuilding an Optimal Portfolio in Discrete Time in the Presence of Transaction CostsUnnamed ItemManaging the impact of high market growth and learning on knowledge worker productivity and service qualityTD(λ) learning without eligibility traces: a theoretical analysisModel-based learning of interaction strategies in multi-agent systemsRecent developments in single product, discrete-time, capacitated production-inventory systems.Approximating infinite horizon stochastic optimal control in discrete time with constraintsAsymptotically optimal controls of hybrid linear quadratic regulators in discrete time.2guaranteed cost control for uncertain discrete-time linear systemsAuctions for Resource Allocation in Overlay NetworksNonzero-sum stochastic games with unbounded costs: Discounted and average cost casesUndiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policyDenumerable controlled Markov chains with average reward criterion: Sample path optimalityUnnamed ItemA perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costsOptimal control and performance analysis of anMX/M/1queue with batches of negative customersOn the Complexity of Value IterationControl of singularly perturbed Markov chains: A numerical studyWhen is a base stock policy optimal in recovering disrupted cyclic schedules?BRIDGE LANE DIRECTION SPECIFICATION FOR SUSTAINABLE TRAFFIC MANAGEMENTUnnamed ItemUnnamed ItemUnnamed ItemThe convergence of value iteration in average cost Markov decision chainsAverage Cost Semi-Markov Decision Processes and the Control of Queueing SystemsConstrained Discounted Markov Decision ChainsA real-time dynamic lot-sizing heuristic for a manufacturing system subject to random setup timesOptimal intensity allocation of single-server queueing networksMinimizing fleet operating costs for a container transportation companyBoundedly optimal control of piecewise deterministic systemsNumerical methods for controlled and uncontrolled multiplexing and queueing systemsOptimal single ordering policy with multiple delivery modes and Bayesian information updatesCompetitive production scheduling: A two-firm, noncooperative finite dynamic gameSerial and parallel value iteration algorithms for discounted Markov decision processesA dynamic software release modelA note on the Ross-Taylor theoremAn empirical study of policy convergence in Markov decision process value iterationOptimal inventory replenishment policy for a queueing system with finite waiting room capacityA duality approach to admission and scheduling controls of queuesDiscrete-time control for systems of interacting objects with unknown random disturbance distributions: a mean field approachNumerical aspects of monotone approximations in convex stochastic control problemsAn effective numerical method for controlled routing in large trunk line networksProduction scheduling in a price competitionInference in credal networks: Branch-and-bound methods and the A/R+ algorithmA perturbation approach to a class of discounted approximate value iteration algorithms with Borel spacesRestricted gradient-descent algorithm for value-function approximation in reinforcement learningValue iteration in average cost Markov control processes on Borel spacesDiscrete-time behavioral portfolio selection under cumulative prospect theoryControl systems engineering educationImmunization and max-min optimal controlDiscretization procedures for adaptive Markov control processesOptimal control of polling models for transportation applications






This page was built for publication: