Cited in
(45)- Simplified risk-aware decision making with belief-dependent rewards in partially observable domains
- Pomp++
- Optimal online learning for nonlinear belief models using discrete priors
- Qualitative analysis of partially-observable Markov decision processes
- An online multi-agent co-operative learning algorithm in POMDPs
- Recurrent policy gradients
- A tutorial on partially observable Markov decision processes
- Planning in uncertain multiagent settings for the healthcare management of Parkinson's disease
- Adaptive submodularity: theory and applications in active learning and stochastic optimization
- An educational management problem with continuous signal space
- A POMDP framework for coordinated guidance of autonomous UAVs for multitarget tracking
- Probabilistic reasoning about epistemic action narratives
- A Monte-Carlo AIXI approximation
- An Approximation Approach for Response-Adaptive Clinical Trial Design
- Planning for multiple measurement channels in a continuous-state POMDP
- Efficient planning under uncertainty with macro-actions
- Optimal and approximate Q-value functions for decentralized POMDPS
- Open problems in universal induction \& intelligence
- scientific article; zbMATH DE number 5609256 (Why is no real title available?)
- R-MAX
- POMDP
- Approxrl
- An optimal educational policy with uncertain information
- An investigation into mathematical programming for finite horizon decentralized POMDPS
- A unified framework for stochastic optimization
- POMDPs.jl
- DESPOT
- iSAM2
- The cross-entropy method for policy search in decentralized POMDPs
- Planning in partially-observable switching-mode continuous domains
- Monte Carlo value iteration for continuous-state POMDPS
- Learning and planning in partially observable environments without prior domain knowledge
- Partially observed Markov decision process multiarmed bandits-structural results
- Reasoning and predicting POMDP planning complexity via covering numbers
- Stationary policies with Markov partition property
- scientific article; zbMATH DE number 5495175 (Why is no real title available?)
- Compatible natural gradient policy search
- Partially observable Markov decision process approximations for adaptive sensing
- The post-disaster debris clearance problem under incomplete information
- Optimal Threshold Policies for Multivariate Stopping-Time POMDPs
- Monte Carlo sampling methods for approximating interactive POMDPS
- Robotic manipulation of multiple objects as a POMDP
- Planning and acting in partially observable stochastic domains
- scientific article; zbMATH DE number 5547972 (Why is no real title available?)
- Planning to chronicle
This page was built for software: POMDPS