Choquet Regularization for Continuous-Time Reinforcement Learning (Q6073554): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Are law-invariant risk functions concave on distributions? / rank
 
Normal rank
Property / cites work
 
Property / cites work: Coherent Measures of Risk / rank
 
Normal rank
Property / cites work
 
Property / cites work: Parametric measures of variability induced by risk measures / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linear-quadratic approximation of optimal policy problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonmonotonic Choquet integrals / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4550909 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Non-additive measure and integral / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dual Moments and Risk Attitudes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convex measures of risk and trading constraints / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic finance. An introduction in discrete time. / rank
 
Normal rank
Property / cites work
 
Property / cites work: State-Dependent Temperature Control for Langevin Diffusions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Maxmin expected utility with non-unique prior / rank
 
Normal rank
Property / cites work
 
Property / cites work: Variance Formulas for the Mean Difference and Coefficient of Concentration / rank
 
Normal rank
Property / cites work
 
Property / cites work: Maximum Entropy Principle with General Deviation Measures / rank
 
Normal rank
Property / cites work
 
Property / cites work: Entropy Regularization for Mean Field Games with Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: On a family of coherent measures of variability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4225048 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4552656 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Iterative linearization methods for approximately optimal control and estimation of non-linear stochastic system / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convex risk functionals: representation and applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Ambiguity in portfolio selection / rank
 
Normal rank
Property / cites work
 
Property / cites work: Cumulative Residual Entropy: A New Measure of Information / rank
 
Normal rank
Property / cites work
 
Property / cites work: Generalized deviations in risk analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Subjective Probability and Expected Utility without Additivity / rank
 
Normal rank
Property / cites work
 
Property / cites work: Quantile based entropy function / rank
 
Normal rank
Property / cites work
 
Property / cites work: Exploratory HJB Equations and Their Convergence / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some properties of the cumulative residual entropy of coherent and mixed systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Advances in prospect theory: cumulative representation of uncertainty / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5149240 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous‐time mean–variance portfolio selection: A reinforcement learning framework / rank
 
Normal rank
Property / cites work
 
Property / cites work: DISTORTION RISKMETRICS ON GENERAL SPACES / rank
 
Normal rank
Property / cites work
 
Property / cites work: Characterization, Robustness, and Aggregation of Signed Choquet Integrals / rank
 
Normal rank
Property / cites work
 
Property / cites work: Axiomatic characterization of insurance prices / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Dual Theory of Choice under Risk / rank
 
Normal rank

Latest revision as of 03:22, 3 August 2024

scientific article; zbMATH DE number 7748432
Language Label Description Also known as
English
Choquet Regularization for Continuous-Time Reinforcement Learning
scientific article; zbMATH DE number 7748432

    Statements

    Choquet Regularization for Continuous-Time Reinforcement Learning (English)
    0 references
    0 references
    0 references
    0 references
    11 October 2023
    0 references
    reinforcement learning
    0 references
    Choquet integrals
    0 references
    continuous time
    0 references
    exploration
    0 references
    regularizers
    0 references
    quantile
    0 references
    HJB equations
    0 references
    linear-quadratic control
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers