POMDPS
From MaRDI portal
Cited in
(45)- Probabilistic reasoning about epistemic action narratives
- Learning and planning in partially observable environments without prior domain knowledge
- scientific article; zbMATH DE number 5609256 (Why is no real title available?)
- Adaptive submodularity: theory and applications in active learning and stochastic optimization
- Partially observed Markov decision process multiarmed bandits-structural results
- Open problems in universal induction \& intelligence
- A Monte-Carlo AIXI approximation
- An investigation into mathematical programming for finite horizon decentralized POMDPS
- Optimal and approximate Q-value functions for decentralized POMDPS
- A POMDP framework for coordinated guidance of autonomous UAVs for multitarget tracking
- Planning to chronicle
- scientific article; zbMATH DE number 5547972 (Why is no real title available?)
- Monte Carlo sampling methods for approximating interactive POMDPS
- Qualitative analysis of partially-observable Markov decision processes
- An optimal educational policy with uncertain information
- An Approximation Approach for Response-Adaptive Clinical Trial Design
- R-MAX
- POMDP
- Approxrl
- POMDPs.jl
- DESPOT
- iSAM2
- The cross-entropy method for policy search in decentralized POMDPs
- An online multi-agent co-operative learning algorithm in POMDPs
- scientific article; zbMATH DE number 5495175 (Why is no real title available?)
- Planning and acting in partially observable stochastic domains
- Pomp++
- Recurrent policy gradients
- Stationary policies with Markov partition property
- Planning for multiple measurement channels in a continuous-state POMDP
- Optimal Threshold Policies for Multivariate Stopping-Time POMDPs
- Planning in partially-observable switching-mode continuous domains
- Planning in uncertain multiagent settings for the healthcare management of Parkinson's disease
- An educational management problem with continuous signal space
- Optimal online learning for nonlinear belief models using discrete priors
- Robotic manipulation of multiple objects as a POMDP
- Monte Carlo value iteration for continuous-state POMDPS
- Efficient planning under uncertainty with macro-actions
- Compatible natural gradient policy search
- Partially observable Markov decision process approximations for adaptive sensing
- Reasoning and predicting POMDP planning complexity via covering numbers
- A tutorial on partially observable Markov decision processes
- Simplified risk-aware decision making with belief-dependent rewards in partially observable domains
- The post-disaster debris clearance problem under incomplete information
- A unified framework for stochastic optimization
This page was built for software: POMDPS