MuJoCo
From MaRDI portal
Software:38932
swMATH27214MaRDI QIDQ38932FDOQ38932
Author name not available (Why is that?)
Cited In (30)
- Efficient Actor-Critic Reinforcement Learning With Embodiment of Muscle Tone for Posture Stabilization of the Human Arm
- Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets
- Reinforcement learning for robotic manipulation using simulated locomotion demonstrations
- TD-regularized actor-critic methods
- Title not available (Why is that?)
- Title not available (Why is that?)
- Neural Networks and Deep Learning
- End-to-end training of deep visuomotor policies
- Recruitment-imitation mechanism for evolutionary reinforcement learning
- Deep active inference
- Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee
- Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
- Deep Reinforcement Learning: A State-of-the-Art Walkthrough
- Multi-physics modelling of a compliant humanoid robot
- End-to-end learning for off-road terrain navigation using the chrono open-source simulation platform
- Exponential integration for efficient and accurate multibody simulation with stiff viscoelastic contacts
- A Lagrangian method for constrained dynamics in tensegrity systems with compressible bars
- Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
- Lipschitzness is all you need to tame off-policy generative adversarial imitation learning
- Constraint learning for control tasks with limited duration barrier functions
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Skill-based curiosity for intrinsically motivated reinforcement learning
- Perception and Motion Planning for Unknotting/untangling of Ropes of Finite Thickness
- Air learning: a deep reinforcement learning gym for autonomous aerial robot visual navigation
- How does momentum benefit deep neural networks architecture design? A few case studies
- Risk-averse policy optimization via risk-neutral policy optimization
- Title not available (Why is that?)
This page was built for software: MuJoCo