scientific article

From MaRDI portal
Revision as of 20:14, 3 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:2925454

zbMath1298.90001MaRDI QIDQ2925454

Dimitri P. Bertsekas

Publication date: 22 October 2014


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (only showing first 100 items - show all)

Dynamic Programming Deconstructed: Transformations of the Bellman Equation and Computational EfficiencyLeast squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storageDistributional Robustness in Minimax Linear Quadratic Control with Wasserstein DistancePredictive stochastic programmingAdaptive park-and-ride choice on time-dependent stochastic multimodal transportation networkUndiscounted control policy generation for continuous-valued optimal control by approximate dynamic programmingMean-field Markov decision processes with common noise and open-loop controlsSolving average cost Markov decision processes by means of a two-phase time aggregation algorithmComputational approaches for mixed integer optimal control problems with indicator constraintsDesigning parsimonious scheduling policies for complex resource allocation systems through concurrency theoryContinuous-action planning for discounted infinite-horizon nonlinear optimal control with Lipschitz valuesEfficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement LearningLearning Manipulation Through Information DisseminationAn approximate dynamic programming approach for improving accuracy of lossy data compression by Bloom filtersHeuristics for dynamic and stochastic inventory-routingLeast squares approximate policy iteration for learning bid prices in choice-based revenue managementA model for equilibrium in some service-provider user-set interactionsTail Optimality and Preferences Consistency for Intertemporal Optimization ProblemsA general endogenous grid method for multi-dimensional models with non-convexities and constraintsTechnical Note—Static Pricing: Universal Guarantees for Reusable ResourcesDynamic portfolio choice: a simulation-and-regression approachStochastic decision diagramsFrom Infinite to Finite Programs: Explicit Error Bounds with Applications to Approximate Dynamic ProgrammingApproximate dynamic programming for the military inventory routing problemOptimal supervisory control with mean payoff objectives and under partial observationDeep reinforcement learning for wireless sensor scheduling in cyber-physical systemsStatistical learning for analysis of networked control systems over unknown channelsLeast-violating symbolic controller synthesis for safety, reachability and attractivity specificationsError bounds for stochastic shortest path problemsThroughput maximization of complex resource allocation systems through timed-continuous-Petri-net modelingAn optimal control approach to day-to-day congestion pricing for stochastic transportation networksFinding multiple Nash equilibria via machine learning-supported Gröbner basesDeep differentiable reinforcement learning and optimal tradingDynamic Learning and Decision Making via Basis Weight VectorsApproximate dynamic programming for lateral transshipment problems in multi-location inventory systemsThe stochastic shortest path problem: a polyhedral combinatorics perspectiveRegular Policies in Abstract Dynamic ProgrammingRisk-Sensitive Reinforcement Learning via Policy Gradient SearchMarkov models of policy support for technology transitionsOptimal multi-stage allocation of weapons to targets using adaptive dynamic programmingSelf-Reflective Model Predictive ControlUnnamed ItemAn approach to solving optimal control problems of nonlinear systems by introducing detail-reward mechanism in deep reinforcement learningRecursively feasible stochastic model predictive control using indirect feedbackSymmetry reduction for dynamic programmingBenchmarking a Scalable Approximate Dynamic Programming Algorithm for Stochastic Control of Grid-Level Energy StorageA Q-learning predictive control scheme with guaranteed stabilityMoving target search optimization -- a literature reviewMYOPIC POLICIES FOR NON-PREEMPTIVE SCHEDULING OF JOBS WITH DECAYING VALUELinear programming formulation for non-stationary, finite-horizon Markov decision process modelsLarge-scale unit commitment under uncertainty: an updated literature surveyPrecautionary replenishment in financially-constrained inventory systems subject to credit rollover risk and supply disruptionShape constraints in economics and operations researchNetwork-Based Approximate Linear Programming for Discrete OptimizationRisk-averse model predictive controlOptimal control of Boolean control networks with average cost: a policy iteration approachOn a multistage discrete stochastic optimization problem with stochastic constraints and nested samplingVariance minimization of parameterized Markov decision processesDynamical systems on weighted lattices: general theoryIs Temporal Difference Learning Optimal? An Instance-Dependent AnalysisUnnamed ItemOn finding the optimal BDD relaxationExit time risk-sensitive control for systems of cooperative agentsDecentralized and distributed active fault diagnosis: multiple model estimation algorithmsProximal algorithms and temporal difference methods for solving fixed point problemsAn intrinsic material tailoring approach for functionally graded axisymmetric hollow bodies under plane elasticityTutorial on risk neutral, distributionally robust and risk averse multistage stochastic programmingStochastic decomposition applied to large-scale hydro valleys managementDiscrete time optimal control with frequency constraints for non-smooth systemsA dynamic programming framework for optimal delivery time slot pricingChoosing a good toolkit. II: Bayes-rule based heuristicsExperiments with Tractable Feedback in Robotic Planning Under Uncertainty: Insights over a Wide Range of Noise RegimesConcentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform samplingPerformance Guarantees and Optimal Purification Decisions for Engineered ProteinsTechnical Note—Analysis of Scrip Systems: On an Open Question in Johnson et al. (2014)Data Uncertainty in Markov Chains: Application to Cost-Effectiveness Analyses of Medical InnovationsBayesian Exploration for Approximate Dynamic ProgrammingRecomposable restricted finite state machines: definition and solution approachesIntelligent Human–Robot Interaction Systems Using Reinforcement Learning and Neural NetworksGradient-bounded dynamic programming for submodular and concave extensible value functions with probabilistic performance guaranteesTransmission scheduling for multi-process multi-sensor remote estimation via approximate dynamic programmingUnnamed ItemUnnamed ItemDecision programming for mixed-integer multi-stage optimization under uncertaintyOptimal stopping for response-guided dosingAn application of approximate dynamic programming in multi-period multi-product advertising budgetingRobust shortest path planning and semicontractive dynamic programmingData-driven control of hydraulic servo actuator based on adaptive dynamic programmingFundamental design principles for reinforcement learning algorithmsBounded rationality in learning, perception, decision-making, and stochastic gamesIncremental constraint projection methods for variational inequalitiesStable Optimal Control and Semicontractive Dynamic ProgrammingLinear controller design for chance constrained systemsAllocating resources via price management systems: a dynamic programming-based approachScenario-based, closed-loop model predictive control with application to emergency vehicle schedulingUnnamed ItemLarge-scale unit commitment under uncertaintyStrategic stiffening/cooling in the Ising gameInertial-type incremental constraint projection method for solving variational inequalities without Lipschitz continuityDistillation optimization: Parameterized relationship between feed flow rate of a steady‐state distillation column and heat duties of reboiler and condenser




This page was built for publication: