Planning and acting in partially observable stochastic domains
From MaRDI portal
(Redirected from Publication:72343)
Recommendations
- Planning in partially-observable switching-mode continuous domains
- scientific article; zbMATH DE number 5547912
- Strong planning under partial observability
- Learning and planning in partially observable environments without prior domain knowledge
- Replanning in domains with partial information and sensing actions
- Randomized belief-space replanning in partially-observable continuous spaces
Cites work
- scientific article; zbMATH DE number 3148886 (Why is no real title available?)
- scientific article; zbMATH DE number 4089320 (Why is no real title available?)
- scientific article; zbMATH DE number 700091 (Why is no real title available?)
- scientific article; zbMATH DE number 1095138 (Why is no real title available?)
- scientific article; zbMATH DE number 795587 (Why is no real title available?)
- A survey of algorithmic methods for partially observed Markov decision processes
- A survey of solution techniques for the partially observed Markov decision process
- Application of Jensen's inequality to adaptive suboptimal design
- Fast planning through planning graph analysis
- OPTIMAL CONTROL FOR PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES OVER AN INFINITE HORIZON
- Optimal control of Markov processes with incomplete state information
- Solution Procedures for Partially Observed Markov Decision Processes
- Solving H-horizon, stationary Markov decision problems in time proportional to log (H)
- State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms
- The Optimal Control of Partially Observable Markov Processes over a Finite Horizon
- The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs
- The Optimal Search for a Moving Target When the Search Path Is Constrained
- The complexity of mean payoff games on graphs
- The complexity of stochastic games
Cited in
(only showing first 100 items - show all)- Gradient-based mixed planning with symbolic and numeric action parameters
- Simplified risk-aware decision making with belief-dependent rewards in partially observable domains
- A conflict-directed approach to chance-constrained mixed logical linear programming
- Analyzing generalized planning under nondeterminism
- Permissive planning: Extending classical planning to uncertain task domains.
- Computation of weighted sums of rewards for concurrent MDPs
- A survey of inverse reinforcement learning: challenges, methods and progress
- POMDP planning for robust robot control
- Partially observable multistage stochastic programming
- Knowledge-based programs as succinct policies for partially observable domains
- Soft rumor control in mobile instant messengers
- Enforcing almost-sure reachability in POMDPs
- A dynamic epistemic framework for reasoning about conformant probabilistic plans
- Probabilistic reasoning about epistemic action narratives
- Finite-horizon LQR controller for partially-observed Boolean dynamical systems
- Optimizing active surveillance for prostate cancer using partially observable Markov decision processes
- Markov decision processes with sequential sensor measurements
- A sufficient statistic for influence in structured multiagent environments
- A logic for specifying stochastic actions and observations
- Deliberative acting, planning and learning with hierarchical operational models
- Algorithms and conditional lower bounds for planning problems
- Representation and Timing in Theories of the Dopamine System
- Minimax real-time heuristic search
- Planning and control in artificial intelligence: A unifying perspective
- Induction and exploitation of subgoal automata for reinforcement learning
- Geometric backtracking for combined task and motion planning in robotic systems
- Affect control processes: intelligent affective interaction using a partially observable Markov decision process
- scientific article; zbMATH DE number 4166888 (Why is no real title available?)
- Counterexample-guided inductive synthesis for probabilistic systems
- scientific article; zbMATH DE number 7370547 (Why is no real title available?)
- Large-scale financial planning via a partially observable stochastic dual dynamic programming framework
- Gradient-descent for randomized controllers under partial observability
- A Markovian model for the spread of the SARS-CoV-2 virus
- Computer Vision - ECCV 2004
- Dynamic optimization over infinite-time horizon: web-building strategy in an orb-weaving spider as a case study
- Integration of reinforcement learning and optimal decision-making theories of the basal ganglia
- A Fenchel-Moreau-Rockafellar type theorem on the Kantorovich-Wasserstein space with applications in partially observable Markov decision processes
- Integration of AI and OR Techniques in Constraint Programming for Combinatorial Optimization Problems
- Tutorial series on brain-inspired computing. IV: Reinforcement learning: machine learning and natural learning
- A semi-Markov decision model for recognizing the destination of a maneuvering agent in real time strategy games
- Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs
- scientific article; zbMATH DE number 7566077 (Why is no real title available?)
- Simultaneous learning and planning in a hierarchical control system for a cognitive agent
- Learning to steer nonlinear interior-point methods
- Partially observable environment estimation with uplift inference for reinforcement learning based recommendation
- Approximability and efficient algorithms for constrained fixed-horizon POMDPs with durative actions
- Reasoning and predicting POMDP planning complexity via covering numbers
- An integrated approach to solving influence diagrams and finite-horizon partially observable decision processes
- Multi-goal motion planning using traveling salesman problem in belief space
- Evidential Markov decision processes
- Privacy stochastic games in distributed constraint reasoning
- A reinforcement learning scheme for a partially-observable multi-agent game
- Quantitative controller synthesis for consumption Markov decision processes
- A reinforcement learning scheme for a partially-observable multi-agent game
- Meeting a deadline: shortest paths on stochastic directed acyclic graphs with information gathering
- Myopic bounds for optimal policy of POMDPs: an extension of lovejoy's structural results
- Robust almost-sure reachability in multi-environment MDPs
- Partially observable game-theoretic agent programming in Golog
- Computing rank dependent utility in graphical models for sequential decision problems
- An evidential approach to SLAM, path planning, and active exploration
- A tutorial on partially observable Markov decision processes
- Probabilistic Reasoning by SAT Solvers
- Recognizing and learning models of social exchange strategies for the regulation of social interactions in open agent societies
- scientific article; zbMATH DE number 1509479 (Why is no real title available?)
- scientific article; zbMATH DE number 1728768 (Why is no real title available?)
- Group sparse optimization for learning predictive state representations
- scientific article; zbMATH DE number 2243394 (Why is no real title available?)
- Performance prediction of an unmanned airborne vehicle multi-agent system
- From knowledge-based programs to graded belief-based programs. I: On-line reasoning
- State observation accuracy and finite-memory policy performance
- Goal-directed learning of features and forward models
- Probabilistic may/must testing: retaining probabilities by restricted schedulers
- Bounded-parameter partially observable Markov decision processes: framework and algorithm
- Computer science and decision theory
- scientific article; zbMATH DE number 5547961 (Why is no real title available?)
- scientific article; zbMATH DE number 2243378 (Why is no real title available?)
- scientific article; zbMATH DE number 2243398 (Why is no real title available?)
- Integrated common sense learning and planning in POMDPs
- Randomized belief-space replanning in partially-observable continuous spaces
- The Concept of Opposition and Its Use in Q-Learning and Q(λ) Techniques
- Partial-order planning with concurrent interacting actions
- Conformant plans and beyond: principles and complexity
- Solving for Best Responses and Equilibria in Extensive-Form Games with Reinforcement Learning Methods
- DESPOT: online POMDP planning with regularization
- Multi-stage classifier design
- Planning for multiple measurement channels in a continuous-state POMDP
- A Bayesian approach for learning and planning in partially observable Markov decision processes
- Bottom-up learning of hierarchical models in a class of deterministic pomdp environments
- Testing probabilistic equivalence through reinforcement learning
- Policy iteration for bounded-parameter POMDPs
- Open problems in universal induction \& intelligence
- Exact decomposition approaches for Markov decision processes: a survey
- Optimal cost almost-sure reachability in POMDPs
- Autonomous agents modelling other agents: a comprehensive survey and open problems
- Using machine learning for decreasing state uncertainty in planning
- Planning in partially-observable switching-mode continuous domains
- Optimal speech motor control and token-to-token variability: a Bayesian modeling approach
- Learning and planning in partially observable environments without prior domain knowledge
- An affective mobile robot educator with a full-time job
- Optimal decision rules in repeated games where players infer an opponent's mind via simplified belief calculation
This page was built for publication: Planning and acting in partially observable stochastic domains
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q72343)