PILCO - MaRDI portal

Cited in

(52)

High-dimensional Bayesian optimization with projections using quantile Gaussian processes
A model for system uncertainty in reinforcement learning
Reasoning about uncertain parameters and agent behaviors through encoded experiences and belief planning
An incremental off-policy search in a model-free Markov decision process using a single sample path
On the universal transformation of data-driven models to control systems
scientific article; zbMATH DE number 7306857 (Why is no real title available?)
SAMBA: safe model-based \& active reinforcement learning
A new algorithm for the LQR problem with partially unknown dynamics
Fault tolerant control using Gaussian processes and model predictive control
Stochastic embeddings of dynamical phenomena through variational autoencoders
Deep Reinforcement Learning: A State-of-the-Art Walkthrough
Hybrid control for learning motor skills
Optimized look-ahead tree policies: a bridge between look-ahead tree policies and direct policy search
An active exploration method for data efficient reinforcement learning
scientific article; zbMATH DE number 7370622 (Why is no real title available?)
Non-parametric policy search with limited information loss
Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation
RobOptim
RL-Glue
TEXPLORE
DESPOT
gfoRmula
TAMER
Pybullet
SafeOpt
Dart
ViZDoom
BaRC
Adv-BNN
A survey of preference-based reinforcement learning methods
scientific article; zbMATH DE number 6982305 (Why is no real title available?)
Policy space identification in configurable environments
DMPC: a data-and model-driven approach to predictive control
Implicit Contact Dynamics Modeling With Explicit Inertia Matrix Representation for Real-Time, Model-Based Control in Physical Environment
Efficient model-based reinforcement learning for approximate online optimal control
scientific article; zbMATH DE number 7370553 (Why is no real title available?)
Numerical trajectory optimization for stochastic mechanical systems
CURL
MOGPTK
Safety Gym
VIREL
VIME
SafePILCO
Model-based reinforcement learning for approximate optimal regulation
Model-based Reinforcement Learning: A Survey
Grounded action transformation for sim-to-real reinforcement learning
Online reinforcement learning using a probability density estimation
Model-based contextual policy search for data-efficient generalization of robot skills
scientific article; zbMATH DE number 7370547 (Why is no real title available?)
Deep active inference as variational policy gradients
DARLA
Mixed density methods for approximate dynamic programming

This page was built for software: PILCO