scientific article; zbMATH DE number 1315585

From MaRDI portal
Publication:4256521

zbMath0918.68110MaRDI QIDQ4256521

T. Dean, Craig Boutilier, Steve Hanks

Publication date: 18 July 1999


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (48)

Scheduling with timed automataHow to decide what to do?Anytime heuristic search for partial satisfaction planningTask decomposition on abstract states, for planning under nondeterminismPractical solution techniques for first-order MDPsSolving average cost Markov decision processes by means of a two-phase time aggregation algorithmStrong planning under uncertainty in domains with numerous but identical elements (a generic approach)Optimal speech motor control and token-to-token variability: a Bayesian modeling approachDesign and evaluation of norm-aware agents based on normative Markov decision processesGraphical models for imprecise probabilitiesQuantum physical symbol systemsComplexity results and algorithms for possibilistic influence diagramsComputational Benefits of Intermediate Rewards for Goal-Reaching Policy LearningOpen problems in universal induction \& intelligenceEfficient incremental planning and learning with multi-valued decision diagramsQuantitative controller synthesis for consumption Markov decision processesSolving an Infinite-Horizon Discounted Markov Decision Process by DC Programming and DCARanking policies in discrete Markov decision processesAbstraction and approximate decision-theoretic planning.Knows what it knows: a framework for self-aware learningAffect control processes: intelligent affective interaction using a partially observable Markov decision processReal-time dynamic programming for Markov decision processes with imprecise probabilitiesReasoning about discrete and continuous noisy sensors and effectors in dynamical systemsDetecting and repairing anomalous evolutions in noisy environments. Logic programming formalization and complexity resultsEfficient solutions to factored MDPs with imprecise transition probabilitiesComputing rank dependent utility in graphical models for sequential decision problemsAdaptive-resolution reinforcement learning with polynomial exploration in deterministic domainsUsing mathematical programming to solve factored Markov decision processes with imprecise probabilitiesRisk-sensitive multiagent decision-theoretic planning based on MDP and one-switch utility functionsUsing Machine Learning for Decreasing State Uncertainty in PlanningStrong planning under partial observabilityBridging the Gap between Reinforcement Learning and Knowledge Representation: A Logical Off- and On-Policy FrameworkCausal learning with Occam's razorComputer science and decision theoryRecursively modeling other agents for decision making: a research perspectiveWhat you should know about approximate dynamic programmingProbabilistic Reasoning by SAT SolversPartially observable Markov decision processes with imprecise parametersInfluence of modeling structure in probabilistic sequential decision problemsCompact and efficient encodings for planning in factored state and action spaces with learned binarized neural network transition modelsStochastic dynamic programming with factored representationsBounded-parameter Markov decision processesA Sufficient Statistic for Influence in Structured Multiagent EnvironmentsOn the undecidability of probabilistic planning and related stochastic optimization problemsWeak, strong, and strong cyclic planning via symbolic model checkingContingent planning under uncertainty via stochastic satisfiabilityEquivalence notions and model minimization in Markov decision processesSolving factored MDPs using non-homogeneous partitions


Uses Software



This page was built for publication: