Risk-averse policy optimization via risk-neutral policy optimization (Q2082514): Difference between revisions
From MaRDI portal
Created claim: Wikidata QID (P12): Q113442972, #quickstatements; #temporary_batch_1707252663060 |
Changed an Item |
||
Property / describes a project that uses | |||
Property / describes a project that uses: MuJoCo / rank | |||
Normal rank |
Revision as of 10:23, 29 February 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Risk-averse policy optimization via risk-neutral policy optimization |
scientific article |
Statements
Risk-averse policy optimization via risk-neutral policy optimization (English)
0 references
4 October 2022
0 references
reinforcement learning
0 references
risk-aversion
0 references
risk-sensitivity
0 references