PILCO
From MaRDI portal
Cited in
(52)- High-dimensional Bayesian optimization with projections using quantile Gaussian processes
- A model for system uncertainty in reinforcement learning
- Reasoning about uncertain parameters and agent behaviors through encoded experiences and belief planning
- An incremental off-policy search in a model-free Markov decision process using a single sample path
- On the universal transformation of data-driven models to control systems
- scientific article; zbMATH DE number 7306857 (Why is no real title available?)
- SAMBA: safe model-based \& active reinforcement learning
- A new algorithm for the LQR problem with partially unknown dynamics
- Fault tolerant control using Gaussian processes and model predictive control
- Stochastic embeddings of dynamical phenomena through variational autoencoders
- Deep Reinforcement Learning: A State-of-the-Art Walkthrough
- Hybrid control for learning motor skills
- Optimized look-ahead tree policies: a bridge between look-ahead tree policies and direct policy search
- An active exploration method for data efficient reinforcement learning
- scientific article; zbMATH DE number 7370622 (Why is no real title available?)
- Non-parametric policy search with limited information loss
- Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation
- RobOptim
- RL-Glue
- TEXPLORE
- DESPOT
- gfoRmula
- TAMER
- Pybullet
- SafeOpt
- Dart
- ViZDoom
- BaRC
- Adv-BNN
- A survey of preference-based reinforcement learning methods
- scientific article; zbMATH DE number 6982305 (Why is no real title available?)
- Policy space identification in configurable environments
- DMPC: a data-and model-driven approach to predictive control
- Implicit Contact Dynamics Modeling With Explicit Inertia Matrix Representation for Real-Time, Model-Based Control in Physical Environment
- Efficient model-based reinforcement learning for approximate online optimal control
- scientific article; zbMATH DE number 7370553 (Why is no real title available?)
- Numerical trajectory optimization for stochastic mechanical systems
- CURL
- MOGPTK
- Safety Gym
- VIREL
- VIME
- SafePILCO
- Model-based reinforcement learning for approximate optimal regulation
- Model-based Reinforcement Learning: A Survey
- Grounded action transformation for sim-to-real reinforcement learning
- Online reinforcement learning using a probability density estimation
- Model-based contextual policy search for data-efficient generalization of robot skills
- scientific article; zbMATH DE number 7370547 (Why is no real title available?)
- Deep active inference as variational policy gradients
- DARLA
- Mixed density methods for approximate dynamic programming
This page was built for software: PILCO