TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation (Q6579244)

From MaRDI portal





scientific article; zbMATH DE number 7887365
Language Label Description Also known as
default for all languages
No label defined
    English
    TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation
    scientific article; zbMATH DE number 7887365

      Statements

      TD3-BC-PPO: twin delayed DDPG-based and behavior cloning-enhanced proximal policy optimization for dynamic optimization affine formation (English)
      0 references
      0 references
      0 references
      0 references
      25 July 2024
      0 references
      affine transformation
      0 references
      deep reinforcement learning
      0 references
      dynamic optimization
      0 references
      formation shape
      0 references

      Identifiers