scientific article

From MaRDI portal
Publication:2810828

zbMath1360.68687arXiv1504.00702MaRDI QIDQ2810828

Pieter Abbeel, Chelsea Finn, Trevor Darrell, Sergey Levine

Publication date: 6 June 2016

Full work available at URL: https://arxiv.org/abs/1504.00702

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (38)

Reinforcement learning for robotic manipulation using simulated locomotion demonstrationsOn Efficient Reinforcement Learning for Full-length Game of StarCraft IIDerivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic SystemsEnd-to-end learning for off-road terrain navigation using the chrono open-source simulation platformA predictive safety filter for learning-based control of constrained nonlinear dynamical systemsDistributed inverse optimal controlNeural circuits for learning context-dependent associations of stimuliSkill-based curiosity for intrinsically motivated reinforcement learningVariational policy search using sparse Gaussian process priors for learning multimodal optimal actionsDeep reinforcement trading with predictable returns\textsc{GoSafeOpt}: scalable safe exploration for global optimization of dynamical systemsUnnamed ItemUnnamed ItemOn the sample complexity of the linear quadratic regulatorUncovering instabilities in variational-quantum deep Q-networksTutorial on Amortized OptimizationConditionally Elicitable Dynamic Risk Measures for Deep Reinforcement LearningLearning key steps to attack deep reinforcement learning agentsFrom Reinforcement Learning to Deep Reinforcement Learning: An OverviewA thermodynamics-informed active learning approach to perception and reasoning about fluidsSpecification-guided reinforcement learningDeep Convolutional Neural Networks for Image Classification: A Comprehensive ReviewA Method to Effectively Detect Vulnerabilities on Path Planning of VINTraining of deep neural networks for the generation of dynamic movement primitivesUnnamed ItemAn active exploration method for data efficient reinforcement learningA normative supervisor for reinforcement learning agentsChallenges of real-world reinforcement learning: definitions, benchmarks and analysisDealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problemsRisk-averse policy optimization via risk-neutral policy optimizationUnnamed ItemUnnamed ItemUnnamed ItemOptimal adaptive control of partially uncertain linear continuous-time systems with state delayA top-down approach to attain decentralized multi-agentsReinforcement learning: an industrial perspectiveUnnamed ItemUnnamed Item


Uses Software



This page was built for publication: