scientific article; zbMATH DE number 1095138

From MaRDI portal
Publication:4368722

zbMath0904.90170MaRDI QIDQ4368722

Dimitri P. Bertsekas

Publication date: 7 December 1997


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items

Generalized maximum entropy estimationStabilising quasi-time-optimal nonlinear model predictive control with variable discretisationSTOCHASTIC MODEL PREDICTIVE CONTROL AND PORTFOLIO OPTIMIZATIONA MEAN FIELD GAME ANALYSIS OF SIR DYNAMICS WITH VACCINATIONComputational Benefits of Intermediate Rewards for Goal-Reaching Policy LearningWhat is the value of the cross-sectional approach to deep reinforcement learning?State-Variable Modeling for a Class of Two-Stage Stochastic Optimization ProblemsQuantile Markov Decision ProcessesDual Control and Online Optimal Experimental DesignAnalysis of the optimization landscape of Linear Quadratic Gaussian (LQG) controlOptimal operation of a grid‐connected battery energy storage system over its lifetimeA Lyapunov characterization of robust policy optimizationA variable projection method for large-scale inverse problems with \(\ell^1\) regularizationENDOGENOUS SOCIAL NETWORKS AND INEQUALITY IN AN INTERGENERATIONAL SETTINGLong-term dynamic asset allocation under asymmetric risk preferencesMulti-agent off-policy actor-critic algorithm for distributed multi-task reinforcement learningLearning Markov Models Via Low-Rank OptimizationGUBS criterion: arbitrary trade-offs between cost and probability-to-goal in stochastic planning based on expected utility theoryOn the optimization of pit stop strategies via dynamic programmingSolving nonlinear and dynamic programming equations on extended \(b\)-metric spaces with the fixed-point techniqueAn optimal control approach to particle filteringDistributed output data-driven optimal robust synchronization of heterogeneous multi-agent systemsMulti-sourcing under supply uncertainty and buyer's risk aversionOn the sample complexity of actor-critic method for reinforcement learning with function approximationRobust regulation of discrete-time systems subject to parameter uncertainties and state delayAdaptive event-triggered actor-critic algorithm for optimal 3D formation circumnavigation with relative measurement and an unknown moving targetMetalearning of time series: an approximate dynamic programming approachMulti-agent natural actor-critic reinforcement learning algorithmsOptimal transmission scheduling for remote state estimation in CPSs with energy harvesting two-hop relay networksA Stochastic Composite Augmented Lagrangian Method for Reinforcement LearningQuickest detection of deception attacks on cyber-physical systems with a parsimonious watermarking policyUnnamed ItemModel‐free optimal tracking over finite horizon using adaptive dynamic programmingStealthy switching attacks on sensors against state estimation in cyber‐physical systemsReinforcement learning based optimal synchronization control for multi-agent systems with input constraints using vanishing viscosity methodRandomized Linear Programming Solves the Markov Decision Problem in Nearly Linear (Sometimes Sublinear) TimeEasy Affine Markov Decision ProcessesDeadlines, Offer Timing, and the Search for AlternativesAn average optimal control approach to the set stabilization problem for boolean control networksLQG Online LearningA New Approach to Real-Time Bidding in Online Advertisements: Auto Pricing StrategyAn Approximation Approach for Response-Adaptive Clinical Trial DesignSome operations research methods for analyzing protein sequences and structuresDeterministic mean-variance-optimal consumption and investmentVariance-penalized Markov decision processes: dynamic programming and reinforcement learning techniquesLAO*: A heuristic search algorithm that finds solutions with loopsMultiscale analysis and control of networks with fractal trafficDerivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement LearningImprovements and Generalizations of Stochastic Knapsack and Markovian Bandits Approximation AlgorithmsRobust Dynamic Pricing with Strategic CustomersAdaptive Low-Nonnegative-Rank Approximation for State Aggregation of Markov ChainsConnected cruise control with delayed feedback and disturbance: An adaptive dynamic programming approachUnnamed ItemRiemannian Fast-Marching on Cartesian Grids, Using Voronoi's First Reduction of Quadratic FormsUnnamed ItemOnline optimal and adaptive integral tracking control for varying discrete‐time systems using reinforcement learningUnnamed ItemA Finite Time Analysis of Temporal Difference Learning with Linear Function ApproximationAutomatic Generation of FPTASes for Stochastic Monotone Dynamic Programs Made EasierOn the correctness of monadic backward inductionImpedance adaptation for optimal robot–environment interactionOptimal Energy Shaping via Neural ApproximatorsModel-based Reinforcement Learning: A SurveyDiscrete-review policies for scheduling stochastic networks: trajectory tracking and fluid-scale asymptotic optimality.Approximation properties of receding horizon optimal controlSome applications of polynomial optimization in operations research and real-time decision makingRiemannian optimization for registration of curves in elastic shape analysisAn incremental off-policy search in a model-free Markov decision process using a single sample pathFinite-horizon LQR controller for partially-observed Boolean dynamical systemsPolicy iteration type algorithms for recurrent state Markov decision processesMulti-objective evolutionary optimization of biological pest control with impulsive dynamics in soybean cropsContinuous lunches are free plus the design of optimal optimization algorithmsResponse-adaptive designs for clinical trials: simultaneous learning from multiple patientsParameter uncertainty and policy intensity: some extensions and suggestions for further workDecentralized stochastic controlRevisiting dynamic programming for finding optimal subtrees in treesA quantity flexibility contract model for a system with heterogeneous suppliersNumerical methods for the pricing of swing options: a stochastic control approachMulti-period mean-variance portfolio optimization based on Monte-Carlo simulationStrategy improvement for concurrent reachability and turn-based stochastic safety gamesGeneral value iteration based single network approach for constrained optimal controller design of partially-unknown continuous-time nonlinear systemsThe \((S,s)\) policy is an optimal trading strategy in a class of commodity price speculation problemsEfficient output solution for nonlinear stochastic optimal control problem with model-reality differencesNonlinear protocols for optimal distributed consensus in networks of dynamic agentsApproximate robust dynamic programming and robustly stable MPCOptimal placement of UV-based communications relay nodesSymmetry and antisymmetry properties of optimal solutions to regression problemsIntegrated topology optimization and optimal control for vibration suppression in structural designMeta-control of an interacting-particle algorithm for global optimizationSafety verification for probabilistic hybrid systemsImmediate return preference emerged from a synaptic learning rule for return maximizationOptimal control of a two-server flow-shop networkReputation in the long-run with imperfect monitoringVariance-constrained actor-critic algorithms for discounted and average reward MDPsReal-time dynamic programming for Markov decision processes with imprecise probabilitiesA semi-Lagrangian scheme for a modified version of the Hughes' model for Pedestrian flowJoint routing and scheduling control in a two-class network with a flexible serverThe single-server scheduling problem with convex costsDynamic programming and viscosity solutions for the optimal control of quantum spin systemsFinite time identification in unstable linear systemsReinforcement learning for a class of continuous-time input constrained optimal control problemsDepth-based short-sighted stochastic shortest path problemsThe joint transshipment and production control policies for multi-location production/inventory systemsRisk pooling strategy in a multi-echelon supply chain with price-sensitive demandInfinite horizon optimal policy for an inventory system with two types of product sharing common hardware platformsA unified approach to Markov decision problems and performance sensitivity analysisMulti-sensor transmission power control for remote estimation through a SINR-based communication channelDynamic mechanism design with interdependent valuationsAnalyzing anonymity attacks through noisy channelsMulti-period risk sharing under financial fairnessPareto efficiency of finite horizon switched linear quadratic differential gamesOptimal energy allocation for linear control with packet loss under energy harvesting constraintsStochastic scheduling in an in-forestFinite-horizon inverse optimal control for discrete-time nonlinear systemsBatch repair actions for automated troubleshootingSymbolic optimal expected time reachability computation and controller synthesis for probabilistic timed automataDelay-optimal scheduling for two-hop relay networks with randomly varying connectivity: join the shortest queue-longest connected queue policyA network flow approach in finding maximum likelihood estimate of high concentration regionsA linear-quadratic Gaussian approach to dynamic information acquisitionAssortment planning with nested preferences: dynamic programming with distributions as states?Set-membership estimations for the evolution of infectious diseases in heterogeneous populationsDiscovering hidden structure in factored MDPsPlanning and acting in partially observable stochastic domainsGenerative models for functional data using phase and amplitude separationConformant plans and beyond: principles and complexityBeam-ACO--hybridizing ant colony optimization with beam search: an application to open shop schedulingSampled fictitious play for approximate dynamic programmingA numerical method for hybrid optimal control based on dynamic programmingError estimation and adaptive discretization for the discrete stochastic Hamilton-Jacobi-Bellman equationOn pricing of multiple bundles of products and servicesBasic ideas for event-based optimization of Markov systemsOn infinite horizon active fault diagnosis for a class of non-linear non-Gaussian systemsOptimal capture trajectories using multiple gravity assistsAccelerating Benders decomposition for short-term hydropower maintenance schedulingSensitivity analysis and optimal ultimately stationary deterministic policies in some constrained discounted cost modelsOptimal search from multiple distributions with infinite horizonCoupling based estimation approaches for the average reward performance potential in Markov chainsStabilization of strictly dissipative discrete time systems with discounted optimal controlStochastic output-feedback model predictive controlFinding a simple polytope from its graph in polynomial timeEfficient blind search: optimal power of detection under computational cost constraintsOptimal synchronization control of multiple Euler-Lagrange systems via event-triggered reinforcement learningA survey on metaheuristics for stochastic combinatorial optimizationDynamic coordination games with activation costsA benders squared \((B^2)\) framework for infinite-horizon stochastic linear programsBias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systemsModel-free \(H_\infty\) tracking control for de-oiling hydrocyclone systems via off-policy reinforcement learningThe interacting-particle algorithm with dynamic heating and coolingLow earth orbit satellite based communication systems -- research opportunitiesSingle sample path-based optimization of Markov chainsOptimizing Bernoulli routing policies for balancing loads on call centers and minimizing transmission costsStrongly polynomial FPTASes for monotone dynamic programsA dynamic game formulation for control of opinion dynamics over social networksOptimal control of chaotic systems via peak-to-peak mapsSimplified risk-aware decision making with belief-dependent rewards in partially observable domainsReinforcement learning: an industrial perspectiveAdaptive optimal output tracking of continuous-time systems via output-feedback-based reinforcement learningAmplitude mean of functional data on \(\mathbb{S}^2\) and its accurate computationOptimal cost almost-sure reachability in POMDPsRobustness of performance and stability for multistep and updated multistep MPC schemesTool path optimization of selective laser sintering processes using deep learningPeril, prudence and planning as risk, avoidance and worryStriped parameterized tube model predictive controlOptimal inventory control with fixed ordering cost for selling by Internet auctionsRobust and reliable portfolio optimization formulation of a chance constrained problemHomotopic policy iteration-based learning design for unknown linear continuous-time systemsStochastic event-based LQG control: an analysis on strict consistencyLevenberg-Marquardt method for identifying Young's modulus of the elasticity imaging inverse problemA partial history of the early development of continuous-time nonlinear stochastic systems theoryRevenue management for operations with urgent ordersStochastic output feedback MPC with intermittent observationsAge-based maintenance under population heterogeneity: optimal exploration and exploitationLearning classifier systems: a surveyHeuristics for planning with penalties and rewards formulated in logic and computed through circuitsOptimizing Image QualityOptimal allocation of heterogeneous resources in cooperative control scenariosSelf-triggered control of probabilistic Boolean control networks: a reinforcement learning approachOptimal control of a queue under a quality-of-service constraint with bounded and unbounded ratesRemote state estimation with usage-dependent Markovian packet lossesThe linear quadratic regulator for periodic hybrid systemsOptimizing DoS attack energy with imperfect acknowledgments and energy harvesting constraints in cyber-physical systemsMarkov Reward Models and Markov Decision Processes in Discrete and Continuous Time: Performance Evaluation and OptimizationPrimal-dual method for solving a linear-quadratic multi-input optimal control problemOptimization of stock trading with additional information by limit order bookA generalization of Bellman's equation with application to path planning, obstacle avoidance and invariant set estimationOn the convergence of reinforcement learning with Monte Carlo exploring startsOn the usefulness of set-membership estimation in the epidemiology of infectious diseasesDynamic marketing policies with rating-sensitive consumers: a mean-field games approachSymbolic Minimum Expected Time Controller Synthesis for Probabilistic Timed AutomataNeural circuits for learning context-dependent associations of stimuliBias optimality of admission control in a non-stationary repairable queueA survey of numerical solutions for stochastic control problems: some recent progressInput perturbations for adaptive control and learningOn adaptive linear-quadratic regulatorsDifferential-game for resource aware approximate optimal control of large-scale nonlinear systems with multiple playersImproved value iteration for neural-network-based stochastic optimal control designRobust min-max optimal control design for systems with uncertain models: a neural dynamic programming approachDifferential stability of discrete optimal control problems with possibly nondifferentiable costsA complete characterization of optimal dictionaries for least squares representationDesigning higher value roads to preserve species at risk by optimally controlling traffic flowAction selection in growing state spaces: control of network structure growthThe Joint Stock and Capacity Rationings of a Make-To-Stock System with Flexible DemandOnline inverse optimal control for control-constrained discrete-time systems on finite and infinite horizonsDynamic games with strategic complements and large number of playersOff-policy learning for adaptive optimal output synchronization of heterogeneous multi-agent systemsA penalty function-based greedy diffusion search algorithm for the optimization of constrained nonlinear dynamical processes with discrete-valued inputDetection-averse optimal and receding-horizon control for Markov decision processesInvestment Decisions Under Uncertainty Using Stochastic Dynamic Programming: A Case Study of Wind PowerPolicy iteration based feedback controlOptimal stopping in infinite horizon: an eigenfunction expansion approachControl and Systems Theory for Advanced ManufacturingToward Breaking the Curse of Dimensionality: An FPTAS for Stochastic Dynamic Programs with Multidimensional Actions and Scalar StatesReducing the Bullwhip effect in a supply chain network by application of optimal control theoryDiscrete time dynamic multi-leader-follower games with stage-depending leaders under feedback informationEllipsoidal methods for dynamics and control. IJoint source-channel coding via model predictive controlAn Overview for Markov Decision Processes in Queues and NetworksINTEGRATED DECISION ON PRICING, PROMOTION AND INVENTORY MANAGEMENTMarkov control processes with randomized discounted costRobust Optimizers for Nonlinear Programming in Approximate Dynamic ProgrammingStatistical Modeling of Curves Using Shapes and Related FeaturesRisk-Constrained Reinforcement Learning with Percentile Risk CriteriaDynamic journeying under uncertaintyA tutorial on the cross-entropy methodBasis function adaptation in temporal difference reinforcement learningNonlinear optimal control of population systems: applications in ecosystemsThe Impact of Noise and Sampling Frequency on the Control of Peak-to-Peak DynamicsTrajectory Generation for Relative Guidance of Merging AircraftOn the introduction of an agile, temporary workforce into a tandem queueing systemUnnamed ItemUnnamed ItemUnnamed ItemRandomized algorithms for the synthesis of cautious adaptive controllersA set oriented approach to optimal feedback stabilizationExponentially Accurate Temporal Decomposition for Long-Horizon Linear-Quadratic Dynamic OptimizationConvergence of the standard RLS method andUDUTfactorisation of covariance matrix for solving the algebraic Riccati equation of the DLQR via heuristic approximate dynamic programmingOutput regulation of unknown linear systems using average cost reinforcement learningScheduling networked state estimators based on value of informationAn optimal stopping approach for the end-of-life inventory problemThe How and Why of Interactive Markov ChainsA moment and sum-of-squares extension of dual dynamic programming with application to nonlinear energy storage problemsPortfolio optimization under Solvency IIDynamic procurement management by reverse auctions with fixed setup costs and sales leversOptimization Based Stabilization of Nonlinear Control SystemsAn iterative approach to the optimal co-design of linear control systemsVariance Reduced Value Iteration and Faster Algorithms for Solving Markov Decision ProcessesExponentially convergent receding horizon strategy for constrained optimal controlA Riccati-based primal interior point solver for multistage stochastic programming ‐ extensionsComputational aspects of optimal strategic network diffusionPhase-Amplitude Separation and Modeling of Spherical TrajectoriesOptimal battery purchasing and charging strategy at electric vehicle battery swap stationsOptimal dictionary for least squares representationUnnamed ItemStochastic Control Liaisons: Richard Sinkhorn Meets Gaspard Monge on a Schrödinger BridgeCONTROL OF COMPLEX PEAK-TO-PEAK DYNAMICSOptimality of admission control in an MM∕1∕N queue with varying servicesPerishable inventory management and dynamic pricing using RFID technologyEfficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processesModel checking discounted temporal propertiesAccelerating the convergence of value iteration by using partial transition functionsAlgorithmic aspects of mean-variance optimization in Markov decision processesOn the computational efficiency of catalyst accelerated coordinate descentOptimal sensor scheduling for hidden Markov model state estimation