Risk-averse policy optimization via risk-neutral policy optimization (Q2082514): Difference between revisions

@@ Property / DOI @@
-.1016/j.artint.2022.103765
@@ Property / DOI: 10.1016/j.artint.2022.103765 / rank @@
-Normal rank
@@ Property / Wikidata QID @@
+Q113442972
@@ Property / Wikidata QID: Q113442972 / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+MuJoCo
@@ Property / describes a project that uses: MuJoCo / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+Stable Baselines
@@ Property / describes a project that uses: Stable Baselines / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1016/j.artint.2022.103765
+Normal rank
@@ Property / OpenAlex ID @@
+W4285403797
@@ Property / OpenAlex ID: W4285403797 / rank @@
+Normal rank
@@ Property / cites work @@
+Coherent Measures of Risk
@@ Property / cites work: Coherent Measures of Risk / rank @@
+Normal rank
@@ Property / cites work @@
+Markov decision processes with average-value-at-risk criteria
+Normal rank
@@ Property / cites work @@
+More Risk-Sensitive Markov Decision Processes
@@ Property / cites work: More Risk-Sensitive Markov Decision Processes / rank @@
+Normal rank
@@ Property / cites work @@
+Q4533362
@@ Property / cites work: Q4533362 / rank @@
+Normal rank
@@ Property / cites work @@
+On the Convergence of Block Coordinate Descent Type Methods
+Normal rank
@@ Property / cites work @@
+Q-Learning for Risk-Sensitive Control
@@ Property / cites work: Q-Learning for Risk-Sensitive Control / rank @@
+Normal rank
@@ Property / cites work @@
+Risk-Sensitive Optimal Control for Markov Decision Processes with Monotone Cost
+Normal rank
@@ Property / cites work @@
+Risk-Constrained Reinforcement Learning with Percentile Risk Criteria
+Normal rank
@@ Property / cites work @@
+Q5744808
@@ Property / cites work: Q5744808 / rank @@
+Normal rank
@@ Property / cites work @@
+Risk-Sensitive Markov Decision Processes
@@ Property / cites work: Risk-Sensitive Markov Decision Processes / rank @@
+Normal rank
@@ Property / cites work @@
+Q2810828
@@ Property / cites work: Q2810828 / rank @@
+Normal rank
@@ Property / cites work @@
+Risk-sensitive reinforcement learning
@@ Property / cites work: Risk-sensitive reinforcement learning / rank @@
+Normal rank
@@ Property / cites work @@
+Robust Control of Markov Decision Processes with Uncertain Transition Matrices
+Normal rank
@@ Property / cites work @@
+Reinforcement learning with replacing eligibility traces
+Normal rank
@@ Property / cites work @@
+Q4626283
@@ Property / cites work: Q4626283 / rank @@
+Normal rank
@@ Property / cites work @@
+Convergence of a block coordinate descent method for nondifferentiable minimization
+Normal rank
@@ Property / cites work @@
+Robust Markov Decision Processes
@@ Property / cites work: Robust Markov Decision Processes / rank @@
+Normal rank
@@ Property / cites work @@
+Simple statistical gradient-following algorithms for connectionist reinforcement learning
+Normal rank
@@ Property / cites work @@
+Coordinate descent algorithms
@@ Property / cites work: Coordinate descent algorithms / rank @@
+Normal rank
@@ Property / DOI @@
+.1016/J.ARTINT.2022.103765
@@ Property / DOI: 10.1016/J.ARTINT.2022.103765 / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:2082514