scientific article; zbMATH DE number 1095138
zbMATH Open0904.90170MaRDI QIDQ4368722FDOQ4368722
Authors: Dimitri P. Bertsekas
Publication date: 7 December 1997
Title of this publication is not available (Why is that?)
Recommendations
- Dynamic programming and optimal control. Vol. 2
- Dynamic programming and optimal control. Vol. 1.
- Dynamic programming and optimal control. Vol. 1.
- Dynamic programming and optimal control. Vol. 2.
- scientific article; zbMATH DE number 3889341
- scientific article; zbMATH DE number 3924501
- An Introduction to Optimal Control Theory
- Publication:4939624
- scientific article; zbMATH DE number 3892056
- Dynamic programming and stochastic control
uncertaintycombinatorial optimizationdynamic programmingoptimal controlsequential decision makingstochastic controlMarkovian decision
Dynamic programming (90C39) Dynamic programming in optimal control and differential games (49L20) Markov and semi-Markov decision processes (90C40) Optimal stochastic control (93E20) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to operations research and mathematical programming (90-01) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to calculus of variations and optimal control (49-01)
Cited In (only showing first 100 items - show all)
- Dynamic programming and optimal control. Vol. 1.
- The Impact of Noise and Sampling Frequency on the Control of Peak-to-Peak Dynamics
- Infinite horizon optimal policy for an inventory system with two types of product sharing common hardware platforms
- Batch repair actions for automated troubleshooting
- Delay-optimal scheduling for two-hop relay networks with randomly varying connectivity: join the shortest queue-longest connected queue policy
- Dynamic marketing policies with rating-sensitive consumers: a mean-field games approach
- Title not available (Why is that?)
- Optimal battery purchasing and charging strategy at electric vehicle battery swap stations
- Optimal synchronization control of multiple Euler-Lagrange systems via event-triggered reinforcement learning
- Dynamic games with strategic complements and large number of players
- Multi-agent natural actor-critic reinforcement learning algorithms
- An incremental off-policy search in a model-free Markov decision process using a single sample path
- Pareto efficiency of finite horizon switched linear quadratic differential games
- A survey of numerical solutions for stochastic control problems: some recent progress
- On nondeterministic dynamic programming
- Finite-horizon LQR controller for partially-observed Boolean dynamical systems
- On infinite horizon active fault diagnosis for a class of non-linear non-Gaussian systems
- Assortment planning with nested preferences: dynamic programming with distributions as states?
- Steering social activity: a stochastic optimal control point of view
- Toward Breaking the Curse of Dimensionality: An FPTAS for Stochastic Dynamic Programs with Multidimensional Actions and Scalar States
- Accelerating Benders decomposition for short-term hydropower maintenance scheduling
- On the convergence of reinforcement learning with Monte Carlo exploring starts
- LQG online learning
- Automatic Generation of FPTASes for Stochastic Monotone Dynamic Programs Made Easier
- Inference via low-dimensional couplings
- Quickest detection of deception attacks on cyber-physical systems with a parsimonious watermarking policy
- Stochastic event-based LQG control: an analysis on strict consistency
- Robust optimizers for nonlinear programming in approximate dynamic programming
- General value iteration based single network approach for constrained optimal controller design of partially-unknown continuous-time nonlinear systems
- Optimal search from multiple distributions with infinite horizon
- Coupling based estimation approaches for the average reward performance potential in Markov chains
- Stochastic output-feedback model predictive control
- An iterative approach to the optimal co-design of linear control systems
- Dynamic coordination games with activation costs
- Optimizing image quality
- Applied Dynamic Programming for Optimization of Dynamical Systems
- Primal-dual method for solving a linear-quadratic multi-input optimal control problem
- Analysis of the optimization landscape of Linear Quadratic Gaussian (LQG) control
- Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems
- Model-free \(H_\infty\) tracking control for de-oiling hydrocyclone systems via off-policy reinforcement learning
- Adaptive low-nonnegative-rank approximation for state aggregation of Markov chains
- Stochastic control liaisons. Richard Sinkhorn meets Gaspard Monge on a Schrödinger bridge
- Amplitude mean of functional data on \(\mathbb{S}^2\) and its accurate computation
- Off-policy learning for adaptive optimal output synchronization of heterogeneous multi-agent systems
- Risk-constrained reinforcement learning with percentile risk criteria
- Optimal operation of a grid‐connected battery energy storage system over its lifetime
- Joint routing and scheduling control in a two-class network with a flexible server
- Reducing the Bullwhip effect in a supply chain network by application of optimal control theory
- Multiscale analysis and control of networks with fractal traffic
- Online inverse optimal control for control-constrained discrete-time systems on finite and infinite horizons
- Strongly polynomial FPTASes for monotone dynamic programs
- A finite time analysis of temporal difference learning with linear function approximation
- A generalization of Bellman's equation with application to path planning, obstacle avoidance and invariant set estimation
- Simplified risk-aware decision making with belief-dependent rewards in partially observable domains
- Reinforcement learning: an industrial perspective
- The joint transshipment and production control policies for multi-location production/inventory systems
- Riemannian fast-marching on cartesian grids, using Voronoi's first reduction of quadratic forms
- Optimizing DoS attack energy with imperfect acknowledgments and energy harvesting constraints in cyber-physical systems
- A network flow approach in finding maximum likelihood estimate of high concentration regions
- A variable projection method for large-scale inverse problems with \(\ell^1\) regularization
- Neural circuits for learning context-dependent associations of stimuli
- Shrinking-horizon dynamic programming
- Optimization Based Stabilization of Nonlinear Control Systems
- Algorithmic aspects of mean-variance optimization in Markov decision processes
- Multi-objective evolutionary optimization of biological pest control with impulsive dynamics in soybean crops
- Symmetry and antisymmetry properties of optimal solutions to regression problems
- A semi-Lagrangian scheme for a modified version of the Hughes' model for Pedestrian flow
- Discrete-review policies for scheduling stochastic networks: trajectory tracking and fluid-scale asymptotic optimality.
- Discovering hidden structure in factored MDPs
- Policy iteration type algorithms for recurrent state Markov decision processes
- Finite time identification in unstable linear systems
- Learning Markov models via low-rank optimization
- Parameter uncertainty and policy intensity: some extensions and suggestions for further work
- Revisiting dynamic programming for finding optimal subtrees in trees
- Optimal capture trajectories using multiple gravity assists
- Exponentially convergent receding horizon strategy for constrained optimal control
- Low earth orbit satellite based communication systems -- research opportunities
- Title not available (Why is that?)
- Multi-sensor transmission power control for remote estimation through a SINR-based communication channel
- Maximum principle based algorithms for deep learning
- Optimization of stock trading with additional information by limit order book
- Dynamic journeying under uncertainty
- Optimizing Bernoulli routing policies for balancing loads on call centers and minimizing transmission costs
- Optimal control of chaotic systems via peak-to-peak maps
- Finding a simple polytope from its graph in polynomial time
- The linear quadratic regulator for periodic hybrid systems
- Age-based maintenance under population heterogeneity: optimal exploration and exploitation
- Randomized algorithms for the synthesis of cautious adaptive controllers
- Optimal placement of UV-based communications relay nodes
- Meta-control of an interacting-particle algorithm for global optimization
- An Approximation Approach for Response-Adaptive Clinical Trial Design
- Heuristics for planning with penalties and rewards formulated in logic and computed through circuits
- Immediate return preference emerged from a synaptic learning rule for return maximization
- Optimal control of a two-server flow-shop network
- Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques
- Some operations research methods for analyzing protein sequences and structures
- A complete characterization of optimal dictionaries for least squares representation
- Connected cruise control with delayed feedback and disturbance: an adaptive dynamic programming approach
- Real-time dynamic programming for Markov decision processes with imprecise probabilities
- An overview for Markov decision processes in queues and networks
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4368722)