scientific article

From MaRDI portal
Revision as of 20:14, 3 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:2925454

zbMath1298.90001MaRDI QIDQ2925454

Dimitri P. Bertsekas

Publication date: 22 October 2014


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (only showing first 100 items - show all)

Distillation optimization: Parameterized relationship between feed flow rate of a steady‐state distillation column and heat duties of reboiler and condenserStability analysis of optimal control problems with time-dependent costsMarkov decision processes with burstiness constraintsA flow based formulation and a reinforcement learning based strategic oscillation for cross-dock door assignmentNonparametric learning for impulse control problems -- exploration vs. exploitationA Lyapunov-based version of the value iteration algorithm formulated as a discrete-time switched affine systemLinear quadratic optimal regulation for multiplicative noise systems with special terminal penaltyDissipativity in infinite horizon optimal control and dynamic programmingAn improved method for approximating the infinite-horizon value function of the discrete-time switched LQR problemOptimal periodic sensor scheduling for minimizing communication rate under LQG constraintCost-aware defense for parallel server systems against reliability and security failuresA note on the existence of optimal stationary policies for average Markov decision processes with countable statesA dynamic encryption-decryption scheme for replay attack detection in cyber-physical systemsCompromise policy for multi-stage stochastic linear programming: variance and bias reductionOptimal decision-making of mutual fund temporary borrowing problem via approximate dynamic programmingDDQN-based optimal targeted therapy with reversible inhibitors to combat the Warburg effectBellman filtering and smoothing for state-space modelsA Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement LearningOptimal guidance algorithms for parking search with reservationsContinuous‐time stochastic gradient descent for optimizing over the stationary distribution of stochastic differential equationsUnnamed ItemDynamic Programming Deconstructed: Transformations of the Bellman Equation and Computational EfficiencyLeast squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storageDistributional Robustness in Minimax Linear Quadratic Control with Wasserstein DistancePredictive stochastic programmingAdaptive park-and-ride choice on time-dependent stochastic multimodal transportation networkUndiscounted control policy generation for continuous-valued optimal control by approximate dynamic programmingMean-field Markov decision processes with common noise and open-loop controlsSolving average cost Markov decision processes by means of a two-phase time aggregation algorithmComputational approaches for mixed integer optimal control problems with indicator constraintsDesigning parsimonious scheduling policies for complex resource allocation systems through concurrency theoryContinuous-action planning for discounted infinite-horizon nonlinear optimal control with Lipschitz valuesEfficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement LearningLearning Manipulation Through Information DisseminationAn approximate dynamic programming approach for improving accuracy of lossy data compression by Bloom filtersHeuristics for dynamic and stochastic inventory-routingLeast squares approximate policy iteration for learning bid prices in choice-based revenue managementA model for equilibrium in some service-provider user-set interactionsTail Optimality and Preferences Consistency for Intertemporal Optimization ProblemsA general endogenous grid method for multi-dimensional models with non-convexities and constraintsTechnical Note—Static Pricing: Universal Guarantees for Reusable ResourcesDynamic portfolio choice: a simulation-and-regression approachStochastic decision diagramsFrom Infinite to Finite Programs: Explicit Error Bounds with Applications to Approximate Dynamic ProgrammingApproximate dynamic programming for the military inventory routing problemOptimal supervisory control with mean payoff objectives and under partial observationDeep reinforcement learning for wireless sensor scheduling in cyber-physical systemsStatistical learning for analysis of networked control systems over unknown channelsLeast-violating symbolic controller synthesis for safety, reachability and attractivity specificationsError bounds for stochastic shortest path problemsThroughput maximization of complex resource allocation systems through timed-continuous-Petri-net modelingAn optimal control approach to day-to-day congestion pricing for stochastic transportation networksFinding multiple Nash equilibria via machine learning-supported Gröbner basesDeep differentiable reinforcement learning and optimal tradingDynamic Learning and Decision Making via Basis Weight VectorsApproximate dynamic programming for lateral transshipment problems in multi-location inventory systemsThe stochastic shortest path problem: a polyhedral combinatorics perspectiveRegular Policies in Abstract Dynamic ProgrammingRisk-Sensitive Reinforcement Learning via Policy Gradient SearchMarkov models of policy support for technology transitionsOptimal multi-stage allocation of weapons to targets using adaptive dynamic programmingSelf-Reflective Model Predictive ControlUnnamed ItemAn approach to solving optimal control problems of nonlinear systems by introducing detail-reward mechanism in deep reinforcement learningRecursively feasible stochastic model predictive control using indirect feedbackSymmetry reduction for dynamic programmingBenchmarking a Scalable Approximate Dynamic Programming Algorithm for Stochastic Control of Grid-Level Energy StorageA Q-learning predictive control scheme with guaranteed stabilityMoving target search optimization -- a literature reviewMYOPIC POLICIES FOR NON-PREEMPTIVE SCHEDULING OF JOBS WITH DECAYING VALUELinear programming formulation for non-stationary, finite-horizon Markov decision process modelsLarge-scale unit commitment under uncertainty: an updated literature surveyPrecautionary replenishment in financially-constrained inventory systems subject to credit rollover risk and supply disruptionShape constraints in economics and operations researchNetwork-Based Approximate Linear Programming for Discrete OptimizationRisk-averse model predictive controlOptimal control of Boolean control networks with average cost: a policy iteration approachOn a multistage discrete stochastic optimization problem with stochastic constraints and nested samplingVariance minimization of parameterized Markov decision processesDynamical systems on weighted lattices: general theoryIs Temporal Difference Learning Optimal? An Instance-Dependent AnalysisUnnamed ItemOn finding the optimal BDD relaxationExit time risk-sensitive control for systems of cooperative agentsDecentralized and distributed active fault diagnosis: multiple model estimation algorithmsProximal algorithms and temporal difference methods for solving fixed point problemsAn intrinsic material tailoring approach for functionally graded axisymmetric hollow bodies under plane elasticityTutorial on risk neutral, distributionally robust and risk averse multistage stochastic programmingStochastic decomposition applied to large-scale hydro valleys managementDiscrete time optimal control with frequency constraints for non-smooth systemsA dynamic programming framework for optimal delivery time slot pricingChoosing a good toolkit. II: Bayes-rule based heuristicsExperiments with Tractable Feedback in Robotic Planning Under Uncertainty: Insights over a Wide Range of Noise RegimesConcentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform samplingPerformance Guarantees and Optimal Purification Decisions for Engineered ProteinsTechnical Note—Analysis of Scrip Systems: On an Open Question in Johnson et al. (2014)Data Uncertainty in Markov Chains: Application to Cost-Effectiveness Analyses of Medical InnovationsBayesian Exploration for Approximate Dynamic ProgrammingRecomposable restricted finite state machines: definition and solution approachesIntelligent Human–Robot Interaction Systems Using Reinforcement Learning and Neural Networks






This page was built for publication: