TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation
From MaRDI portal
Publication:6579244
Recommendations
- Optimal robust formation control for heterogeneous multi‐agent systems based on reinforcement learning
- Heterogeneous optimal formation control of nonlinear multi-agent systems with unknown dynamics by safe reinforcement learning
- Performance‐guaranteed containment control for pure‐feedback multi agent systems via reinforcement learning algorithm
- Optimal antisynchronization control for unknown multiagent systems with deep deterministic policy gradient approach
Cites work
- Adaptive Formation Tracking Control for First-Order Agents With a Time-Varying Flow Parameter
- Affine Formation Maneuver Control of Multiagent Systems
- Affine formation maneuver control of high-order multi-agent systems over directed networks
- Collision avoidance control for limited perception unmanned surface vehicle swarm based on proximal policy optimization
- Event-triggered affine formation maneuver control for second-order multi-agent systems with sampled data
- Improved DRL-based energy-efficient UAV control for maximum lifecycle
- Necessary and Sufficient Graphical Conditions for Affine Formation Control
- Optimal dynamic formation control of multi-agent systems in constrained environments
This page was built for publication: TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6579244)