Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent (Q5026254): Difference between revisions
From MaRDI portal
Created a new Item |
Created claim: Wikidata QID (P12): Q113424375, #quickstatements; #temporary_batch_1707303357582 |
||
Property / Wikidata QID | |||
Property / Wikidata QID: Q113424375 / rank | |||
Normal rank |
Revision as of 17:39, 7 February 2024
scientific article; zbMATH DE number 7470383
Language | Label | Description | Also known as |
---|---|---|---|
English | Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent |
scientific article; zbMATH DE number 7470383 |
Statements
Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent (English)
0 references
7 February 2022
0 references
machine learning
0 references
neural networks
0 references
reinforcement learning
0 references