Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent (Q5026254): Difference between revisions
From MaRDI portal
Created claim: Wikidata QID (P12): Q113424375, #quickstatements; #temporary_batch_1707303357582 |
Added link to MaRDI item. |
||
links / mardi / name | links / mardi / name | ||
Revision as of 10:32, 8 February 2024
scientific article; zbMATH DE number 7470383
Language | Label | Description | Also known as |
---|---|---|---|
English | Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent |
scientific article; zbMATH DE number 7470383 |
Statements
Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent (English)
0 references
7 February 2022
0 references
machine learning
0 references
neural networks
0 references
reinforcement learning
0 references