Sequential Decision Making With Coherent Risk
From MaRDI portal
Publication:5358614
DOI10.1109/TAC.2016.2644871zbMath1370.90286OpenAlexW2561666900MaRDI QIDQ5358614
Aviv Tamar, Mohammad Ghavamzadeh, Yinlam Chow, Shie Mannor
Publication date: 21 September 2017
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/tac.2016.2644871
Decision theory (91B06) Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)
Related Items (13)
Zeroth-Order Stochastic Compositional Algorithms for Risk-Aware Learning ⋮ Deep reinforcement learning for option pricing and hedging under dynamic expectile risk measures ⋮ Safe reward‐based deep reinforcement learning control for an electro‐hydraulic servo system ⋮ Safe reinforcement learning: A control barrier function optimization approach ⋮ Index policy for multiarmed bandit problem with dynamic risk measures ⋮ Unnamed Item ⋮ Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning ⋮ Reinforcement learning with dynamic convex risk measures ⋮ An \((R, S)\)-norm information measure for hesitant fuzzy sets and its application in decision-making ⋮ Unnamed Item ⋮ An online algorithm for the risk-aware restless bandit ⋮ Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning ⋮ Peril, prudence and planning as risk, avoidance and worry
This page was built for publication: Sequential Decision Making With Coherent Risk