Reinforcement learning endowed with safe veto policies to learn the control of linked-multicomponent robotic systems
DOI10.1016/j.ins.2015.04.005zbMath1390.68691OpenAlexW1985215440MaRDI QIDQ1749908
Ismael Etxeberria-Agiriano, Manuel Graña, Jose Manuel Lopez-Guede, Igor Ansoategui, Borja Fernandez-Gauna
Publication date: 17 May 2018
Published in: Information Sciences (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.ins.2015.04.005
reinforcement learninglinked multicomponent robotic systemssafe exploration policiesspeeding convergence of RL
Learning and adaptive systems in artificial intelligence (68T05) Automated systems (robots, etc.) in control theory (93C85) Artificial intelligence for robotics (68T40)
Related Items (1)
Uses Software
Cites Work
- Reinforcement learning algorithms with function approximation: recent advances and applications
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- \({\mathcal Q}\)-learning
- Undesired state-action prediction in multi-agent reinforcement learning for linked multi-component robotic system control
- Machine Learning: ECML 2004
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
This page was built for publication: Reinforcement learning endowed with safe veto policies to learn the control of linked-multicomponent robotic systems