The theory of dynamic programming

From MaRDI portal
Revision as of 05:36, 7 March 2024 by Import240305080351 (talk | contribs) (Created automatically from import240305080351)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:5830128

DOI10.1090/S0002-9904-1954-09848-8zbMath0057.12503OpenAlexW1980516134WikidataQ56115923 ScholiaQ56115923MaRDI QIDQ5830128

Richard Bellman

Publication date: 1954

Published in: Bulletin of the American Mathematical Society (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1090/s0002-9904-1954-09848-8




Related Items (93)

Time Consistency of the Mean-Risk ProblemOn optimal segmentation and parameter tuning for multiple change-point detection and inferenceMultiscale Quantile SegmentationCanonical forms for stochastic nonlinear systemsTowards sustainable timber harvesting of homogeneous stands: dynamic programming in synergy with forest growth simulationUndiscounted control policy generation for continuous-valued optimal control by approximate dynamic programmingTemporal concatenation for Markov decision processesComputational approaches for mixed integer optimal control problems with indicator constraintsDynamic optimization and its relation to classical and quantum constrained systemsSecond class constraints and the consistency of optimal control theory in phase spaceAn adaptive multi-spline refinement algorithm in simulation based sailboat trajectory optimization using onboard multi-core computer systemsNew solution procedures for the order picker routing problem in U-shaped Pick areas with a movable depotThe quantum dark side of the optimal control theoryModel predictive control of cash balance in a cash concentration and disbursements systemApproximate dynamic programming with post-decision states as a solution method for dynamic economic modelsON THE INTERFACE BETWEEN OPTIMAL PERIODIC AND CONTINUOUS DIVIDEND STRATEGIES IN THE PRESENCE OF TRANSACTION COSTSStochastic decision diagramsInitialization of the shooting method via the Hamilton-Jacobi-Bellman approachOn continuous-time infinite horizon optimal control -- dissipativity, stability, and transversalityA dual subspace parsimonious mixture of matrix normal distributionsWhat Next?Multibody dynamics and control using machine learningCluster-based lateral transshipments for the Zambian health supply chainThe policy graph decomposition of multistage stochastic programming problemsRisk-averse optimization of reward-based coherent risk measuresSolving nonlinear and dynamic programming equations on extended \(b\)-metric spaces with the fixed-point techniqueHiring Secretaries over Time: The Benefit of Concurrent EmploymentDissipativity in infinite horizon optimal control and dynamic programmingSynthesis of distributed optimal control in the tracking problem for the optimization of thermal processes described by integro-differential equationsA duality-based proof of the triangle inequality for the Wasserstein distancesOn the sample complexity of actor-critic method for reinforcement learning with function approximationA survey of average cost problems in deterministic discrete-time control systemsOptimization of an economic ordering quantity model for non-instantaneous deteriorating items with ordering time constraint using dynamic programmingCost-aware sequential diagnosticsFrom Reinforcement Learning to Deep Reinforcement Learning: An OverviewDominance rules in combinatorial optimization problemsMaximizing reachability in a temporal graph obtained by assigning starting times to a collection of walksAnalysis and Numerical Approximation of Stationary Second-Order Mean Field Game Partial Differential InclusionsOn using dynamic programming for time warping in pattern recognitionNumerical solutions to continuous linear programming problemsStrengthening of the Kneser theorem on zeros of the solutions of the equation \(y+ p(x)y=0\)A unified framework for stochastic optimizationReinforcement learning for long-run average cost.Some problems of the theory of dynamic programmingDynamic Programming for an Optimal and Equitable Public Load Shedding ScheduleUnnamed ItemCooperative and non-cooperative behaviour in the exploitation of a common renewable resource with environmental stochasticityQuantitative model-checking of controlled discrete-time Markov processesBranch-and-bound algorithms: a survey of recent advances in searching, branching, and pruningA population-based fast algorithm for a billion-dimensional resource allocation problem with integer variablesA linear-quadratic Gaussian approach to dynamic information acquisitionExistence of solutions to the Cauchy problem and stability of kink- solutions of the nonlinear Schrödinger equationApplication of dynamical systems to the study of asymptotic properties of solutions to nonlinear higher-order differential equationsTime to the MRCA of a sample in a Wright-Fisher model with variable population sizeSegmentation of choroidal boundary in enhanced depth imaging octs using a multiresolution texture based modeling in graph cutsA boundary value problem for differential equations with a retarded argumentVariational optimisation by the solution of a series of Hamilton-Jacobi equationsOscillatory properties of second order nonlinear differential equationsSuzdal conference--2. Proceedings of the international conference on dynamical systems and differential equations, Suzdal, Russia, July 1--6, 2002. Part 2. Transl. from the RussianA survey for deep reinforcement learning in Markovian cyber-physical systems: common problems and solutionsOptimisation model for multi-item multi-echelon supply chains with nested multi-level productsA rotating-grid upwind fast sweeping scheme for a class of Hamilton-Jacobi equationsStochastic dynamic programming illuminates the link between environment, physiology, and evolutionRelationship between solutions of families of two-point boundary value problems and Cauchy problemsFeedback control problem of an SIR epidemic model based on the Hamilton-Jacobi-Bellman equationFunctional data clustering by projection into latent generalized hyperbolic subspacesHigh-performance simulation-based algorithms for an alpine ski racer's trajectory optimization in heterogeneous computer systemsLoading tow trains ergonomically for just-in-time part supplyA breakpoint detection in the mean model with heterogeneous variance on fixed time intervalsComputational aspects of optimal strategic network diffusionSome Functional Equations in the Theory of Dynamic Programming. I. Functions of Points and Point TransformationsStability in a neighborhood of a certain stateOn the Bogolyubov-Mitropol'skij averaging principle for a class of second order hyperbolic equationsA theorem on averaging for hyperbolic systems of first orderThe reduction principle in the Banach spaceVariational problems with constraintsFunctional equations in the theory of dynamic programming. IIIThe theory of dynamic programmingEigenvalues and Functional EquationsThe truncation of a countable system of partial differential equationsDeep reinforcement learning for inventory control: a roadmapRatcheting with a bliss level of consumptionThe difference and unity of irregular LQ control and standard LQ control and its solutionAsymptotic series for the solution to the Cauchy problemA scheduling problem in the baking industryPolitical behavior of a deputy and voting prediction in a legislative bodyA Linear Programming Approach to Sequential Hypothesis TestingA new look at Bellman's principle of optimalityQualitative analysis of families of bounded solutions of the nonlinear three-dimensional Schrödinger equationModel-based Reinforcement Learning: A SurveyThe maximum principle, Bellman's equation, and Carathéodory's workHigh-order fully actuated system approaches: Part I. Models and basic procedureImproving the filtering of branch-and-bound MDD solver




Cites Work




This page was built for publication: The theory of dynamic programming