Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent (Q5026254): Difference between revisions
From MaRDI portal
Created claim: Wikidata QID (P12): Q113424375, #quickstatements; #temporary_batch_1707303357582 |
Set OpenAlex properties. |
||
(2 intermediate revisions by 2 users not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W3200083785 / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 02:34, 20 March 2024
scientific article; zbMATH DE number 7470383
Language | Label | Description | Also known as |
---|---|---|---|
English | Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent |
scientific article; zbMATH DE number 7470383 |
Statements
Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent (English)
0 references
7 February 2022
0 references
machine learning
0 references
neural networks
0 references
reinforcement learning
0 references