scientific article

Publication date: 1987

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

dynamic programming discrete stochastic systems imperfect state information Infinite horizon problems Suboptimal and adaptive control

Mathematics Subject Classification ID

Dynamic programming in optimal control and differential games (49L20) Search theory (90B40) Management decision making, including multiple objectives (90B50) Deterministic scheduling theory in operations research (90B35) Queues and service in operations research (90B22) Adaptive control/observation systems (93C40) Discrete-time control/observation systems (93C55) Dynamic programming (90C39) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to operations research and mathematical programming (90-01) Stochastic systems in control theory (general) (93E03) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to systems and control theory (93-01)

Related Items (only showing first 100 items - show all)

Optimal control of single-server queueing networks ⋮ The Expected Total Cost Criterion for Markov Decision Processes under Constraints: A Convex Analytic Approach ⋮ Temporal logics for the specification of performance and reliability ⋮ Unnamed Item ⋮ Unnamed Item ⋮ A turnpike improvement algorithm for piecewise deterministic control ⋮ A unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learning ⋮ Structural results for partially observed control models ⋮ Discrete reachability of hybrid systems ⋮ Solving H-horizon, stationary Markov decision problems in time proportional to log (H) ⋮ On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes ⋮ On the computation of the optimal cost function for discrete time Markov models with partial observations ⋮ Application of Stochastic Dynamic Programming in Demand Dispatch-Based Optimal Operation of a Microgrid ⋮ Near optimization of stochastic dynamic systems by decomposition and aggregation ⋮ Controlled Markov processes on the infinite planning horizon: Weighted and overtaking cost criteria ⋮ Zero-sum stochastic games with unbounded costs: Discounted and average cost cases ⋮ When control and state variations increase uncertainty: modeling and stochastic control in discrete time ⋮ Optimal feedback solution of a constrained stochastic one-storage model ⋮ Error bounds for stochastic shortest path problems ⋮ Unnamed Item ⋮ Reinforcement learning of non-Markov decision processes ⋮ Regular Policies in Abstract Dynamic Programming ⋮ Now decision theory ⋮ Solving an Infinite-Horizon Discounted Markov Decision Process by DC Programming and DCA ⋮ A survey of average cost problems in deterministic discrete-time control systems ⋮ Heterogeneous expertise and collective decision-making ⋮ Model development with Maple in PhD-level management science courses: a personal account ⋮ An approximate dynamic programming approach to resource management in multi-cloud scenarios ⋮ Assessing Decisions on Multiple Uses of Water and Hydroelectric Facilities ⋮ Dynamic distributed clustering in wireless sensor networks via Voronoi tessellation control ⋮ Optimality of greedy and sustainable policies in the management of renewable resources ⋮ A MODEL FOR THE OPTIMAL ASSET-LIABILITY MANAGEMENT FOR INSURANCE COMPANIES ⋮ Optimal empty vehicle repositioning and fleet-sizing for two-depot service systems ⋮ Analysis of supply contracts with commitments and flexibility ⋮ Turnpikes and computation of piecewise open-loop equilibria in stochastic differential games ⋮ Equation‐free optimal switching policies for bistable reacting systems ⋮ Pull-based broadcasting with timing constraints ⋮ Turnpikes and computation of piecewise open-loop equilibria in stochastic differential games ⋮ Optimal Portfolio Selection with Transaction Costs ⋮ Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria ⋮ Local solutions to the Hamilton-Jacobi-Bellman equation in stochastic problems of optimal control ⋮ An envelope theorem and some applications to discounted Markov decision processes ⋮ Optimal pension funding dynamics over infinite control horizon when stochastic rates of return are stationary ⋮ Optimal asset--liability management with constraints: A dynamic programming approach ⋮ Multiagent cooperative search for portfolio selection ⋮ Monitoring and control of anytime algorithms: A dynamic programming approach ⋮ On valuing appreciating human assets in services ⋮ Unnamed Item ⋮ Building an Optimal Portfolio in Discrete Time in the Presence of Transaction Costs ⋮ Unnamed Item ⋮ Managing the impact of high market growth and learning on knowledge worker productivity and service quality ⋮ TD(λ) learning without eligibility traces: a theoretical analysis ⋮ Model-based learning of interaction strategies in multi-agent systems ⋮ Recent developments in single product, discrete-time, capacitated production-inventory systems. ⋮ Approximating infinite horizon stochastic optimal control in discrete time with constraints ⋮ Asymptotically optimal controls of hybrid linear quadratic regulators in discrete time. ⋮ ℋ₂guaranteed cost control for uncertain discrete-time linear systems ⋮ Auctions for Resource Allocation in Overlay Networks ⋮ Nonzero-sum stochastic games with unbounded costs: Discounted and average cost cases ⋮ Undiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policy ⋮ Denumerable controlled Markov chains with average reward criterion: Sample path optimality ⋮ Unnamed Item ⋮ A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs ⋮ Optimal control and performance analysis of anM^X/M/1queue with batches of negative customers ⋮ On the Complexity of Value Iteration ⋮ Control of singularly perturbed Markov chains: A numerical study ⋮ When is a base stock policy optimal in recovering disrupted cyclic schedules? ⋮ BRIDGE LANE DIRECTION SPECIFICATION FOR SUSTAINABLE TRAFFIC MANAGEMENT ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ The convergence of value iteration in average cost Markov decision chains ⋮ Average Cost Semi-Markov Decision Processes and the Control of Queueing Systems ⋮ Constrained Discounted Markov Decision Chains ⋮ A real-time dynamic lot-sizing heuristic for a manufacturing system subject to random setup times ⋮ Optimal intensity allocation of single-server queueing networks ⋮ Minimizing fleet operating costs for a container transportation company ⋮ Boundedly optimal control of piecewise deterministic systems ⋮ Numerical methods for controlled and uncontrolled multiplexing and queueing systems ⋮ Optimal single ordering policy with multiple delivery modes and Bayesian information updates ⋮ Competitive production scheduling: A two-firm, noncooperative finite dynamic game ⋮ Serial and parallel value iteration algorithms for discounted Markov decision processes ⋮ A dynamic software release model ⋮ A note on the Ross-Taylor theorem ⋮ An empirical study of policy convergence in Markov decision process value iteration ⋮ Optimal inventory replenishment policy for a queueing system with finite waiting room capacity ⋮ A duality approach to admission and scheduling controls of queues ⋮ Discrete-time control for systems of interacting objects with unknown random disturbance distributions: a mean field approach ⋮ Numerical aspects of monotone approximations in convex stochastic control problems ⋮ An effective numerical method for controlled routing in large trunk line networks ⋮ Production scheduling in a price competition ⋮ Inference in credal networks: Branch-and-bound methods and the A/R+ algorithm ⋮ A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces ⋮ Restricted gradient-descent algorithm for value-function approximation in reinforcement learning ⋮ Value iteration in average cost Markov control processes on Borel spaces ⋮ Discrete-time behavioral portfolio selection under cumulative prospect theory ⋮ Control systems engineering education ⋮ Immunization and max-min optimal control ⋮ Discretization procedures for adaptive Markov control processes ⋮ Optimal control of polling models for transportation applications

This page was built for publication: