scientific article

From MaRDI portal
Publication:3245701

zbMath0078.34101MaRDI QIDQ3245701

Richard Bellman

Publication date: 1957


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (max. 100)

Deep reinforcement learning in finite-horizon to explore the most probable transition pathwayCertified reinforcement learning with logic guidanceMixed nondeterministic-probabilistic automata: blending graphical probabilistic models with nondeterminismOn how to exploit a population given by a difference equation with random parametersA denotational semantics for low-level probabilistic programs with nondeterminismComputational aspects in applied stochastic controlPreference changeA novel state-transition forest: pricing corporate securities with intertemporal exercise policies and corresponding capital structure changesA methodology for computation reduction for specially structured large scale Markov decision problemsUnnamed ItemA unified DC programming framework and efficient DCA based approaches for large scale batch reinforcement learningDesign and evaluation of norm-aware agents based on normative Markov decision processesOn the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processesRestricted gradient-descent algorithm for value-function approximation in reinforcement learningDynamic diagnostic and decision procedures under uncertaintyModel Checking Linear-Time Properties of Probabilistic SystemsStable sequential control rules and Markov chainsMarkovian sequential control processes. Denumerable state spaceFinite-Memory Strategies in POMDPs with Long-Run Average ObjectivesThe browser war -- analysis of Markov perfect equilibrium in markets with dynamic demand effectsComputing Behavioral Relations for Probabilistic Concurrent SystemsComputing semi-stationary optimal policies for multichain semi-Markov decision processesReinforcement learning for combinatorial optimization: a surveyUnnamed ItemSHORTFALL RISK MINIMIZATION UNDER FIXED TRANSACTION COSTSOptimal strategies in the fighting fantasy gaming system: influencing stochastic dynamics by gambling with limited resourceDynamic dispatching and repositioning policies for fast-response service networksA human-robot collaborative reinforcement learning algorithmValue-Gradient Based Formulation of Optimal Control Problem and Machine Learning AlgorithmOptimal management of stochastic invasion in a metapopulation with Allee effectsA Markovian decision model of adaptive cancer treatment and quality of lifeAlgebraic optimization of sequential decision problemsPricing tenure payment reverse mortgages with optimal exercised prepayment options by accounting for house prices, interest rates, and mortality riskApproximate Newton Policy Gradient AlgorithmsQuantitative controller synthesis for consumption Markov decision processesHuman-cyber-physical automata and their synthesisThe method of value oriented successive approximations for the average reward Markov decision processUnnamed ItemA survey of average cost problems in deterministic discrete-time control systemsUnnamed ItemClosed-loop supply chain inventory management with recovery information of reusable containersUnnamed ItemPolynomial Approximation of High-Dimensional Hamilton--Jacobi--Bellman Equations and Applications to Feedback Control of Semilinear Parabolic PDEsAn intelligent choice of witnesses in the Miller-Rabin primality test. Reinforcement learning approachReinforcement learning for optimal error correction of toric codesValue set iteration for Markov decision processesUnnamed ItemOL-DEC-MDP model for multiagent online scheduling with a time-dependent probability of successSLAP: specification logic of actions with probabilityControl: a perspectiveUnnamed ItemLexicographic refinements in stationary possibilistic Markov decision processesOPTIMALLY REPLACING MULTIPLE SYSTEMS IN A SHARED ENVIRONMENTA review of operations research models in invasive species management: state of the art, challenges, and future directionsUnnamed ItemDynamic lookahead policies for stochastic-dynamic inventory routing in bike sharing systemsStochastic finite-state systems in control theoryUnnamed ItemMeta-modeling game for deriving theory-consistent, microstructure-based traction-separation laws via deep reinforcement learningStructures and methods of dynamical decision-makingSolutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: the fixed-point approach revisitedA review on deep reinforcement learning for fluid mechanicsQuantitative model-checking of controlled discrete-time Markov processesControl of chaotic systems by deep reinforcement learningProbabilistic timed graph transformation systemsDynamic journeying under uncertaintyEngineering constraint solvers for automatic analysis of probabilistic hybrid automataAn optimality principle for Markovian decision processesSolving stochastic dynamic programming problems by linear programming — An annotated bibliographyPursuit of food \textit{versus} pursuit of information in a Markovian perception-action loop model of foragingUnnamed ItemContraction mappings underlying undiscounted Markov decision problemsStochastic revision opportunities in Markov decision problemsElaboration Tolerant Representation of Markov Decision Process via Decision-Theoretic Extension of Probabilistic Action Language +Discrete Dividend Payments in Continuous TimeConditional Probabilities over Probabilistic and Nondeterministic SystemsUnnamed ItemThe optimization of K-effect models by linear and dynamic programmingLinear programming considerations on Markovian decision processes with no discountingStrong Uniform Value in Gambling Houses and Partially Observable Markov Decision ProcessesRecomposable restricted finite state machines: definition and solution approachesDynamic programming and optimal control of variable multichannel stochastic service systems with applicationsLinear programming algorithms for semi-Markovian decision processesMAXIMIZING THE GROWTH RATE UNDER RISK CONSTRAINTSUnnamed ItemOn a set of optimal policies in continuous time Markovian decision problemNew classes of stochastic control processesStage-\(t\) scenario dominance for risk-averse multi-stage stochastic mixed-integer programsHistory-dependent Evaluations in Partially Observable Markov Decision ProcessUltimate precision of joint parameter estimation under noisy Gaussian environmentBelief base contraction by belief accrualFunctional equations in the theory of dynamic programming. XI: Limit theoremsOn the solvability of Bellman's functional equation for a Markovian decision processLearning with policy prediction in continuous state-action multi-agent decision processesSolving sequential collective decision problems under qualitative uncertaintyConstrained Multiagent Markov Decision Processes: a Taxonomy of Problems and AlgorithmsVPint: value propagation-based spatial interpolationOptimal and near-optimal incentive strategies in the hierarchical control of Markov chainsQuantifying quantum correlations in noisy Gaussian channelsExplainable dynamic programming




This page was built for publication: