| Publication | Date of Publication | Type |
|---|
Low-Rank Representation of Reinforcement Learning Policies Journal of Artificial Intelligence Research | 2023-01-09 | Paper |
Estimating causal effects with optimization-based methods: a review and empirical comparison European Journal of Operational Research | 2022-09-09 | Paper |
Estimating causal effects with optimization-based methods: A review and empirical comparison (available as arXiv preprint) | 2022-02-28 | Paper |
scientific article; zbMATH DE number 7306927 (Why is no real title available?) (available as arXiv preprint) | 2021-02-05 | Paper |
| scientific article; zbMATH DE number 7306927 (Why is no real title available?) | 2021-02-05 | Paper |
The Bottleneck Simulator: A Model-Based Deep Reinforcement Learning Approach Journal of Artificial Intelligence Research | 2020-11-03 | Paper |
On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability Journal of Artificial Intelligence Research | 2019-05-17 | Paper |
An introduction to deep reinforcement learning Foundations and Trends® in Machine Learning | 2019-03-20 | Paper |
| Streaming kernel regression with provably adaptive mean, variance, and regularization | 2018-11-21 | Paper |
Streaming kernel regression with provably adaptive mean, variance, and regularization (available as arXiv preprint) | 2018-11-21 | Paper |
| Imputing missing data from sequential multiple assignment randomized trials | 2016-10-27 | Paper |
| Practical reinforcement learning in dynamic treatment regimes | 2016-10-27 | Paper |
Practical kernel-based reinforcement learning Journal of Machine Learning Research (JMLR) | 2016-06-06 | Paper |
Practical kernel-based reinforcement learning Journal of Machine Learning Research (JMLR) | 2016-06-06 | Paper |
Bayesian reinforcement learning: a survey Foundations and Trends in Machine Learning | 2016-05-30 | Paper |
| Efficient learning and planning with compressed predictive states | 2015-05-06 | Paper |
Efficient learning and planning with compressed predictive states (available as arXiv preprint) | 2015-05-06 | Paper |
Policy iteration based on stochastic factorization The Journal of Artificial Intelligence Research (JAIR) | 2014-09-05 | Paper |
| A Bayesian approach for learning and planning in partially observable Markov decision processes | 2014-02-03 | Paper |
The duality of state and observation in probabilistic transition systems Logic, Language, and Computation | 2013-04-12 | Paper |
Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs Artificial Intelligence | 2012-11-15 | Paper |
Non-deterministic policies in Markovian decision processes Journal of Artificial Intelligence Research | 2011-01-21 | Paper |
POMDP planning for robust robot control Springer Tracts in Advanced Robotics | 2010-06-02 | Paper |
Online planning algorithms for POMDPS (available as arXiv preprint) | 2009-04-28 | Paper |
Towards robotic assistants in nursing homes: Challenges and results Robotics and Autonomous Systems | 2003-04-03 | Paper |