Risk-averse policy optimization via risk-neutral policy optimization (Q2082514): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Import241208061232 (talk | contribs)
Normalize DOI.
 
(7 intermediate revisions by 6 users not shown)
Property / DOI
 
Property / DOI: 10.1016/j.artint.2022.103765 / rank
Normal rank
 
Property / Wikidata QID
 
Property / Wikidata QID: Q113442972 / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: MuJoCo / rank
 
Normal rank
Property / describes a project that uses
 
Property / describes a project that uses: Stable Baselines / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1016/j.artint.2022.103765 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W4285403797 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Coherent Measures of Risk / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov decision processes with average-value-at-risk criteria / rank
 
Normal rank
Property / cites work
 
Property / cites work: More Risk-Sensitive Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4533362 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Convergence of Block Coordinate Descent Type Methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning for Risk-Sensitive Control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Sensitive Optimal Control for Markov Decision Processes with Monotone Cost / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Constrained Reinforcement Learning with Percentile Risk Criteria / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5744808 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-Sensitive Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2810828 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Risk-sensitive reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Robust Control of Markov Decision Processes with Uncertain Transition Matrices / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reinforcement learning with replacing eligibility traces / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence of a block coordinate descent method for nondifferentiable minimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Robust Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simple statistical gradient-following algorithms for connectionist reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Coordinate descent algorithms / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1016/J.ARTINT.2022.103765 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 00:11, 17 December 2024

scientific article
Language Label Description Also known as
English
Risk-averse policy optimization via risk-neutral policy optimization
scientific article

    Statements

    Risk-averse policy optimization via risk-neutral policy optimization (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    4 October 2022
    0 references
    reinforcement learning
    0 references
    risk-aversion
    0 references
    risk-sensitivity
    0 references

    Identifiers