scientific article

From MaRDI portal

Publication:3795523

Jump to:navigation, search

zbMath0649.93001MaRDI QIDQ3795523

Dimitri P. Bertsekas

Publication date: 1987

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

dynamic programming discrete stochastic systems imperfect state information Infinite horizon problems Suboptimal and adaptive control

Mathematics Subject Classification ID

Dynamic programming in optimal control and differential games (49L20) Search theory (90B40) Management decision making, including multiple objectives (90B50) Deterministic scheduling theory in operations research (90B35) Queues and service in operations research (90B22) Adaptive control/observation systems (93C40) Discrete-time control/observation systems (93C55) Dynamic programming (90C39) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to operations research and mathematical programming (90-01) Stochastic systems in control theory (general) (93E03) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to systems and control theory (93-01)

Related Items

Optimal control of single-server queueing networks ⋮ The Expected Total Cost Criterion for Markov Decision Processes under Constraints: A Convex Analytic Approach ⋮ Temporal logics for the specification of performance and reliability ⋮ Unnamed Item ⋮ Unnamed Item ⋮ A turnpike improvement algorithm for piecewise deterministic control ⋮ A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning ⋮ Structural results for partially observed control models ⋮ Discrete reachability of hybrid systems ⋮ Solving H-horizon, stationary Markov decision problems in time proportional to log (H) ⋮ On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes ⋮ On the computation of the optimal cost function for discrete time Markov models with partial observations ⋮ Application of Stochastic Dynamic Programming in Demand Dispatch-Based Optimal Operation of a Microgrid ⋮ Near optimization of stochastic dynamic systems by decomposition and aggregation ⋮ Controlled Markov processes on the infinite planning horizon: Weighted and overtaking cost criteria ⋮ Zero-sum stochastic games with unbounded costs: Discounted and average cost cases ⋮ When control and state variations increase uncertainty: modeling and stochastic control in discrete time ⋮ Optimal feedback solution of a constrained stochastic one-storage model ⋮ Error bounds for stochastic shortest path problems ⋮ Unnamed Item ⋮ Reinforcement learning of non-Markov decision processes ⋮ Regular Policies in Abstract Dynamic Programming ⋮ Now decision theory ⋮ Solving an Infinite-Horizon Discounted Markov Decision Process by DC Programming and DCA ⋮ A survey of average cost problems in deterministic discrete-time control systems ⋮ Heterogeneous expertise and collective decision-making ⋮ Model development with Maple in PhD-level management science courses: a personal account ⋮ An approximate dynamic programming approach to resource management in multi-cloud scenarios ⋮ Assessing Decisions on Multiple Uses of Water and Hydroelectric Facilities ⋮ Dynamic distributed clustering in wireless sensor networks via Voronoi tessellation control ⋮ Optimality of greedy and sustainable policies in the management of renewable resources ⋮ A MODEL FOR THE OPTIMAL ASSET-LIABILITY MANAGEMENT FOR INSURANCE COMPANIES ⋮ Optimal empty vehicle repositioning and fleet-sizing for two-depot service systems ⋮ Analysis of supply contracts with commitments and flexibility ⋮ Turnpikes and computation of piecewise open-loop equilibria in stochastic differential games ⋮ Equation‐free optimal switching policies for bistable reacting systems ⋮ Pull-based broadcasting with timing constraints ⋮ Turnpikes and computation of piecewise open-loop equilibria in stochastic differential games ⋮ Optimal Portfolio Selection with Transaction Costs ⋮ Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria ⋮ Local solutions to the Hamilton-Jacobi-Bellman equation in stochastic problems of optimal control ⋮ An envelope theorem and some applications to discounted Markov decision processes ⋮ Optimal pension funding dynamics over infinite control horizon when stochastic rates of return are stationary ⋮ Optimal asset--liability management with constraints: A dynamic programming approach ⋮ Multiagent cooperative search for portfolio selection ⋮ Monitoring and control of anytime algorithms: A dynamic programming approach ⋮ On valuing appreciating human assets in services ⋮ Unnamed Item ⋮ Building an Optimal Portfolio in Discrete Time in the Presence of Transaction Costs ⋮ Unnamed Item ⋮ Managing the impact of high market growth and learning on knowledge worker productivity and service quality ⋮ TD(λ) learning without eligibility traces: a theoretical analysis ⋮ Model-based learning of interaction strategies in multi-agent systems ⋮ Recent developments in single product, discrete-time, capacitated production-inventory systems. ⋮ Approximating infinite horizon stochastic optimal control in discrete time with constraints ⋮ Asymptotically optimal controls of hybrid linear quadratic regulators in discrete time. ⋮ ℋ₂guaranteed cost control for uncertain discrete-time linear systems ⋮ Auctions for Resource Allocation in Overlay Networks ⋮ Nonzero-sum stochastic games with unbounded costs: Discounted and average cost cases ⋮ Undiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policy ⋮ Denumerable controlled Markov chains with average reward criterion: Sample path optimality ⋮ Unnamed Item ⋮ A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs ⋮ Optimal control and performance analysis of anM^X/M/1queue with batches of negative customers ⋮ On the Complexity of Value Iteration ⋮ Control of singularly perturbed Markov chains: A numerical study ⋮ When is a base stock policy optimal in recovering disrupted cyclic schedules? ⋮ BRIDGE LANE DIRECTION SPECIFICATION FOR SUSTAINABLE TRAFFIC MANAGEMENT ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ The convergence of value iteration in average cost Markov decision chains ⋮ Average Cost Semi-Markov Decision Processes and the Control of Queueing Systems ⋮ Constrained Discounted Markov Decision Chains ⋮ A real-time dynamic lot-sizing heuristic for a manufacturing system subject to random setup times ⋮ Optimal intensity allocation of single-server queueing networks ⋮ Minimizing fleet operating costs for a container transportation company ⋮ Boundedly optimal control of piecewise deterministic systems ⋮ Numerical methods for controlled and uncontrolled multiplexing and queueing systems ⋮ Optimal single ordering policy with multiple delivery modes and Bayesian information updates ⋮ Competitive production scheduling: A two-firm, noncooperative finite dynamic game ⋮ Serial and parallel value iteration algorithms for discounted Markov decision processes ⋮ A dynamic software release model ⋮ A note on the Ross-Taylor theorem ⋮ An empirical study of policy convergence in Markov decision process value iteration ⋮ Optimal inventory replenishment policy for a queueing system with finite waiting room capacity ⋮ A duality approach to admission and scheduling controls of queues ⋮ Discrete-time control for systems of interacting objects with unknown random disturbance distributions: a mean field approach ⋮ Numerical aspects of monotone approximations in convex stochastic control problems ⋮ An effective numerical method for controlled routing in large trunk line networks ⋮ Production scheduling in a price competition ⋮ Inference in credal networks: Branch-and-bound methods and the A/R+ algorithm ⋮ A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces ⋮ Restricted gradient-descent algorithm for value-function approximation in reinforcement learning ⋮ Value iteration in average cost Markov control processes on Borel spaces ⋮ Discrete-time behavioral portfolio selection under cumulative prospect theory ⋮ Control systems engineering education ⋮ Immunization and max-min optimal control ⋮ Discretization procedures for adaptive Markov control processes ⋮ Optimal control of polling models for transportation applications ⋮ Nonlinear and dynamic programming for epidemic intervention ⋮ Optimality of monotonic policies for two-action Markovian decision processes, with applications to control of queues with delayed information ⋮ A metaheuristic approach to solving a multiproduct EOQ-based inventory problem with storage space constraints ⋮ Linear quadratic dynamic programming for water reservoir management ⋮ Optimal purchasing policy in a two-component assembly system with different purchasing contracts for each component ⋮ A MDP approach to fault-tolerant routing ⋮ Optimal maintenance policies in random environments ⋮ Markov control models with unknown random state-action-dependent discount factors ⋮ Scalable computational techniques for centrality metrics on temporally detailed social network ⋮ Abstraction and approximate decision-theoretic planning. ⋮ Stochastic observability in network state estimation and control ⋮ Regularity properties of constrained set-valued mappings ⋮ Simultaneous design of measurement and control strategies for stochastic systems with feedback ⋮ A version of the Euler equation in discounted Markov decision processes ⋮ Stochastic vendor managed replenishment with demand dependent shipment. ⋮ A dynamic programming approach: improving the performance of wireless networks ⋮ A dynamic programming strategy to balance exploration and exploitation in the bandit problem ⋮ Control strategies for a stochastic flexible manufacturing and assembly system model ⋮ A Q-learning predictive control scheme with guaranteed stability ⋮ Application of interior-point methods to model predictive control ⋮ Structured policies in the sequential design of experiments ⋮ Irreversibility and the behavior of aggregate stochastic growth models ⋮ Stochastic finite-state systems in control theory ⋮ An optimal inventory management problem with reputation-dependent demand ⋮ Another set of conditions for average optimality in Markov control processes ⋮ Control systems of interacting objects modeled as a game against nature under a mean field approach ⋮ Average optimality in dynamic programming on Borel spaces -- unbounded costs and controls ⋮ Optimal nonlinear policy: signal extraction with a non-normal prior ⋮ Randomization for robot tasks: using dynamic programming in the space of knowledge states ⋮ Admission control in UMTS networks based on approximate dynamic programming ⋮ Optimal consumption under deterministic income ⋮ Maximization of revenue in fishery model with Cobb-Douglas type of production function ⋮ A modified genetic algorithm for optimal control problems ⋮ Policy iteration and Newton-Raphson methods for Markov decision processes under average cost criterion ⋮ Reduced complexity dynamic programming based on policy iteration ⋮ Inventory control as a discrete system control for the fixed-order quantity system ⋮ Illustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPs ⋮ Optimal control of renewable resources with alternative use ⋮ Convergence analysis of an inertial accelerated iterative algorithm for solving split variational inequality problem ⋮ Average cost optimal policies for Markov control processes with Borel state space and unbounded costs ⋮ A survey of Markov decision models for control of networks of queues ⋮ Accelerating autonomous learning by using heuristic selection of actions ⋮ The complexity of dynamic programming ⋮ Hedging point policy improvement ⋮ Transfer of learning by composing solutions of elemental sequential tasks ⋮ Effect of reconfiguration costs on planning for capacity scalability in reconfigurable manufacturing systems ⋮ Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes ⋮ Hierarchical production control for a flow shop with dynamic setup changes and random machine breakdowns ⋮ Adaptive optimization and the harvest of biological populations ⋮ Near optimization of dynamic systems by decomposition and aggregation ⋮ Inventory models with unreliable suppliers in a random environment ⋮ Optimal control of a stochastic assembly production line ⋮ Utility-based on-line exploration for repeated navigation in an embedded graph ⋮ Multiple perspective dynamic decision making ⋮ Impact of ramp-up on the optimal capacity-related reconfiguration policy ⋮ Alternate approaches to solving the Holt et al. model and to performing sensitivity analysis ⋮ Optimal digital product auctions with unlimited supply and rebidding behavior ⋮ Optimal filtering of discrete-time hybrid systems ⋮ Bond management and max-min optimal control. ⋮ Text segmentation by product partition models and dynamic programming ⋮ Deep reinforcement learning for inventory control: a roadmap ⋮ Theoretical tools for understanding and aiding dynamic decision making ⋮ Stopping rules for utility functions and the St. Petersburg gamble ⋮ A maxmin policy for bond management ⋮ Stochastic dynamic programming with factored representations ⋮ Bounded-parameter Markov decision processes ⋮ A production control problem in competition ⋮ Optimal myopic policy for a stochastic inventory problem with fixed and proportional backorder costs ⋮ Estimating equilibrium probabilities for band diagonal Markov chains using aggregation and disaggregation techniques ⋮ On the intertemporal allocation of a natural resource ⋮ A survey of solution techniques for the partially observed Markov decision process ⋮ Information matrices with random regressors. Application to experimental design ⋮ Optimal cost and policy for a Markovian replacement problem ⋮ Dynamic programming using radial basis functions ⋮ On the undecidability of probabilistic planning and related stochastic optimization problems ⋮ An application of Lemke's method to a class of Markov decision problems

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3795523&oldid=17360956"