TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation (Q6579244)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation |
scientific article; zbMATH DE number 7887365
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation |
scientific article; zbMATH DE number 7887365 |
Statements
TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation (English)
0 references
25 July 2024
0 references
affine transformation
0 references
deep reinforcement learning
0 references
dynamic optimization
0 references
formation shape
0 references
0 references
0 references
0 references
0.7161442041397095
0 references
0.7138738632202148
0 references
0.6736953258514404
0 references