Choquet Regularization for Continuous-Time Reinforcement Learning
From MaRDI portal
Publication:6073554
DOI10.1137/22m1524734arXiv2208.08497OpenAlexW4386750089MaRDI QIDQ6073554
Ruodu Wang, Xun Yu Zhou, Xia Han
Publication date: 11 October 2023
Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2208.08497
reinforcement learningquantilelinear-quadratic controlcontinuous timeHJB equationsexplorationChoquet integralsregularizers
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Stochastic finance. An introduction in discrete time.
- Quantile based entropy function
- Linear-quadratic approximation of optimal policy problems
- Maxmin expected utility with non-unique prior
- Advances in prospect theory: cumulative representation of uncertainty
- Non-additive measure and integral
- Axiomatic characterization of insurance prices
- Convex measures of risk and trading constraints
- Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations
- Parametric measures of variability induced by risk measures
- On a family of coherent measures of variability
- Convex risk functionals: representation and applications
- Generalized deviations in risk analysis
- Coherent Measures of Risk
- Maximum Entropy Principle with General Deviation Measures
- Characterization, Robustness, and Aggregation of Signed Choquet Integrals
- Iterative linearization methods for approximately optimal control and estimation of non-linear stochastic system
- Cumulative Residual Entropy: A New Measure of Information
- Subjective Probability and Expected Utility without Additivity
- Variance Formulas for the Mean Difference and Coefficient of Concentration
- Some properties of the cumulative residual entropy of coherent and mixed systems
- The Dual Theory of Choice under Risk
- Exploratory HJB Equations and Their Convergence
- State-Dependent Temperature Control for Langevin Diffusions
- Dual Moments and Risk Attitudes
- DISTORTION RISKMETRICS ON GENERAL SPACES
- Are law-invariant risk functions concave on distributions?
- Ambiguity in portfolio selection
- Continuous‐time mean–variance portfolio selection: A reinforcement learning framework
- Entropy Regularization for Mean Field Games with Learning
- Nonmonotonic Choquet integrals
This page was built for publication: Choquet Regularization for Continuous-Time Reinforcement Learning