The theory of dynamic programming

DOI10.1090/S0002-9904-1954-09848-8zbMath0057.12503OpenAlexW1980516134WikidataQ56115923 ScholiaQ56115923MaRDI QIDQ5830128

Richard Bellman

Publication date: 1954

Published in: Bulletin of the American Mathematical Society (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1090/s0002-9904-1954-09848-8

Mathematics Subject Classification ID

Dynamic programming (90C39)

Related Items (93)

Time Consistency of the Mean-Risk Problem ⋮ On optimal segmentation and parameter tuning for multiple change-point detection and inference ⋮ Multiscale Quantile Segmentation ⋮ Canonical forms for stochastic nonlinear systems ⋮ Towards sustainable timber harvesting of homogeneous stands: dynamic programming in synergy with forest growth simulation ⋮ Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming ⋮ Temporal concatenation for Markov decision processes ⋮ Computational approaches for mixed integer optimal control problems with indicator constraints ⋮ Dynamic optimization and its relation to classical and quantum constrained systems ⋮ Second class constraints and the consistency of optimal control theory in phase space ⋮ An adaptive multi-spline refinement algorithm in simulation based sailboat trajectory optimization using onboard multi-core computer systems ⋮ New solution procedures for the order picker routing problem in U-shaped Pick areas with a movable depot ⋮ The quantum dark side of the optimal control theory ⋮ Model predictive control of cash balance in a cash concentration and disbursements system ⋮ Approximate dynamic programming with post-decision states as a solution method for dynamic economic models ⋮ ON THE INTERFACE BETWEEN OPTIMAL PERIODIC AND CONTINUOUS DIVIDEND STRATEGIES IN THE PRESENCE OF TRANSACTION COSTS ⋮ Stochastic decision diagrams ⋮ Initialization of the shooting method via the Hamilton-Jacobi-Bellman approach ⋮ On continuous-time infinite horizon optimal control -- dissipativity, stability, and transversality ⋮ A dual subspace parsimonious mixture of matrix normal distributions ⋮ What Next? ⋮ Multibody dynamics and control using machine learning ⋮ Cluster-based lateral transshipments for the Zambian health supply chain ⋮ The policy graph decomposition of multistage stochastic programming problems ⋮ Risk-averse optimization of reward-based coherent risk measures ⋮ Solving nonlinear and dynamic programming equations on extended \(b\)-metric spaces with the fixed-point technique ⋮ Hiring Secretaries over Time: The Benefit of Concurrent Employment ⋮ Dissipativity in infinite horizon optimal control and dynamic programming ⋮ Synthesis of distributed optimal control in the tracking problem for the optimization of thermal processes described by integro-differential equations ⋮ A duality-based proof of the triangle inequality for the Wasserstein distances ⋮ On the sample complexity of actor-critic method for reinforcement learning with function approximation ⋮ A survey of average cost problems in deterministic discrete-time control systems ⋮ Optimization of an economic ordering quantity model for non-instantaneous deteriorating items with ordering time constraint using dynamic programming ⋮ Cost-aware sequential diagnostics ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ Dominance rules in combinatorial optimization problems ⋮ Maximizing reachability in a temporal graph obtained by assigning starting times to a collection of walks ⋮ Analysis and Numerical Approximation of Stationary Second-Order Mean Field Game Partial Differential Inclusions ⋮ On using dynamic programming for time warping in pattern recognition ⋮ Numerical solutions to continuous linear programming problems ⋮ Strengthening of the Kneser theorem on zeros of the solutions of the equation \(y+ p(x)y=0\) ⋮ A unified framework for stochastic optimization ⋮ Reinforcement learning for long-run average cost. ⋮ Some problems of the theory of dynamic programming ⋮ Dynamic Programming for an Optimal and Equitable Public Load Shedding Schedule ⋮ Unnamed Item ⋮ Cooperative and non-cooperative behaviour in the exploitation of a common renewable resource with environmental stochasticity ⋮ Quantitative model-checking of controlled discrete-time Markov processes ⋮ Branch-and-bound algorithms: a survey of recent advances in searching, branching, and pruning ⋮ A population-based fast algorithm for a billion-dimensional resource allocation problem with integer variables ⋮ A linear-quadratic Gaussian approach to dynamic information acquisition ⋮ Existence of solutions to the Cauchy problem and stability of kink- solutions of the nonlinear Schrödinger equation ⋮ Application of dynamical systems to the study of asymptotic properties of solutions to nonlinear higher-order differential equations ⋮ Time to the MRCA of a sample in a Wright-Fisher model with variable population size ⋮ Segmentation of choroidal boundary in enhanced depth imaging octs using a multiresolution texture based modeling in graph cuts ⋮ A boundary value problem for differential equations with a retarded argument ⋮ Variational optimisation by the solution of a series of Hamilton-Jacobi equations ⋮ Oscillatory properties of second order nonlinear differential equations ⋮ Suzdal conference--2. Proceedings of the international conference on dynamical systems and differential equations, Suzdal, Russia, July 1--6, 2002. Part 2. Transl. from the Russian ⋮ A survey for deep reinforcement learning in Markovian cyber-physical systems: common problems and solutions ⋮ Optimisation model for multi-item multi-echelon supply chains with nested multi-level products ⋮ A rotating-grid upwind fast sweeping scheme for a class of Hamilton-Jacobi equations ⋮ Stochastic dynamic programming illuminates the link between environment, physiology, and evolution ⋮ Relationship between solutions of families of two-point boundary value problems and Cauchy problems ⋮ Feedback control problem of an SIR epidemic model based on the Hamilton-Jacobi-Bellman equation ⋮ Functional data clustering by projection into latent generalized hyperbolic subspaces ⋮ High-performance simulation-based algorithms for an alpine ski racer's trajectory optimization in heterogeneous computer systems ⋮ Loading tow trains ergonomically for just-in-time part supply ⋮ A breakpoint detection in the mean model with heterogeneous variance on fixed time intervals ⋮ Computational aspects of optimal strategic network diffusion ⋮ Some Functional Equations in the Theory of Dynamic Programming. I. Functions of Points and Point Transformations ⋮ Stability in a neighborhood of a certain state ⋮ On the Bogolyubov-Mitropol'skij averaging principle for a class of second order hyperbolic equations ⋮ A theorem on averaging for hyperbolic systems of first order ⋮ The reduction principle in the Banach space ⋮ Variational problems with constraints ⋮ Functional equations in the theory of dynamic programming. III ⋮ The theory of dynamic programming ⋮ Eigenvalues and Functional Equations ⋮ The truncation of a countable system of partial differential equations ⋮ Deep reinforcement learning for inventory control: a roadmap ⋮ Ratcheting with a bliss level of consumption ⋮ The difference and unity of irregular LQ control and standard LQ control and its solution ⋮ Asymptotic series for the solution to the Cauchy problem ⋮ A scheduling problem in the baking industry ⋮ Political behavior of a deputy and voting prediction in a legislative body ⋮ A Linear Programming Approach to Sequential Hypothesis Testing ⋮ A new look at Bellman's principle of optimality ⋮ Qualitative analysis of families of bounded solutions of the nonlinear three-dimensional Schrödinger equation ⋮ Model-based Reinforcement Learning: A Survey ⋮ The maximum principle, Bellman's equation, and Carathéodory's work ⋮ High-order fully actuated system approaches: Part I. Models and basic procedure ⋮ Improving the filtering of branch-and-bound MDD solver

Cites Work

This page was built for publication: The theory of dynamic programming