scientific article
From MaRDI portal
Publication:2810828
zbMath1360.68687arXiv1504.00702MaRDI QIDQ2810828
Pieter Abbeel, Chelsea Finn, Trevor Darrell, Sergey Levine
Publication date: 6 June 2016
Full work available at URL: https://arxiv.org/abs/1504.00702
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Learning and adaptive systems in artificial intelligence (68T05) Automated systems (robots, etc.) in control theory (93C85) Artificial intelligence for robotics (68T40)
Related Items (38)
Reinforcement learning for robotic manipulation using simulated locomotion demonstrations ⋮ On Efficient Reinforcement Learning for Full-length Game of StarCraft II ⋮ Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems ⋮ End-to-end learning for off-road terrain navigation using the chrono open-source simulation platform ⋮ A predictive safety filter for learning-based control of constrained nonlinear dynamical systems ⋮ Distributed inverse optimal control ⋮ Neural circuits for learning context-dependent associations of stimuli ⋮ Skill-based curiosity for intrinsically motivated reinforcement learning ⋮ Variational policy search using sparse Gaussian process priors for learning multimodal optimal actions ⋮ Deep reinforcement trading with predictable returns ⋮ \textsc{GoSafeOpt}: scalable safe exploration for global optimization of dynamical systems ⋮ Unnamed Item ⋮ Unnamed Item ⋮ On the sample complexity of the linear quadratic regulator ⋮ Uncovering instabilities in variational-quantum deep Q-networks ⋮ Tutorial on Amortized Optimization ⋮ Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning ⋮ Learning key steps to attack deep reinforcement learning agents ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ A thermodynamics-informed active learning approach to perception and reasoning about fluids ⋮ Specification-guided reinforcement learning ⋮ Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review ⋮ A Method to Effectively Detect Vulnerabilities on Path Planning of VIN ⋮ Training of deep neural networks for the generation of dynamic movement primitives ⋮ Unnamed Item ⋮ An active exploration method for data efficient reinforcement learning ⋮ A normative supervisor for reinforcement learning agents ⋮ Challenges of real-world reinforcement learning: definitions, benchmarks and analysis ⋮ Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems ⋮ Risk-averse policy optimization via risk-neutral policy optimization ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Optimal adaptive control of partially uncertain linear continuous-time systems with state delay ⋮ A top-down approach to attain decentralized multi-agents ⋮ Reinforcement learning: an industrial perspective ⋮ Unnamed Item ⋮ Unnamed Item
Uses Software
This page was built for publication: