scientific article; zbMATH DE number 7008325
From MaRDI portal
Publication:4614110
zbMath1475.68295arXiv1803.00444MaRDI QIDQ4614110
Heinz Koeppl, Abdelhak M. Zoubir, Jan Peters, Adrian Šošić, Elmar Rueckert
Publication date: 30 January 2019
Full work available at URL: https://arxiv.org/abs/1803.00444
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Gibbs samplinggraphical modelsinverse reinforcement learninglearning from demonstrationBayesian nonparametric modelingsubgoal inference
Nonparametric estimation (62G05) Bayesian inference (62F15) Learning and adaptive systems in artificial intelligence (68T05)
Related Items (1)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Optimization by Simulated Annealing
- Probabilistic inference for determining options in reinforcement learning
- Natural actor-critic algorithms
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Autonomous agents modelling other agents: a comprehensive survey and open problems
- Convex Optimization: Algorithms and Complexity
- Active Learning
- 10.1162/jmlr.2003.3.4-5.993
- Dynamic graph connectivity in polylogarithmic worst case time
This page was built for publication: