Q5149240 (Q5149240): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Linear Thompson sampling revisited / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite-time analysis of the multiarmed bandit problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303765208377 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite state Markovian decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Compactification methods in the control of degenerate diffusions: existence of an optimal control / rank
 
Normal rank
Property / cites work
 
Property / cites work: On stochastic relaxed control for partially observed diffusions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4057976 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4002114 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-armed bandits in discrete and continuous time / rank
 
Normal rank
Property / cites work
 
Property / cites work: Existence of Markov Controls and Characterization of Optimal Markov Controls / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stationary solutions and forward equations for controlled and singular martingale problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2810828 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Iterative linearization methods for approximately optimal control and estimation of non-linear stochastic system / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous multi-armed bandits and multiparameter processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5214215 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning to Optimize via Posterior Sampling / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of model-based interval estimation for Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2880979 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous‐time mean–variance portfolio selection: A reinforcement learning framework / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4255599 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Existence of Optimal Relaxed Controls of Stochastic Partial Differential Equations / rank
 
Normal rank

Latest revision as of 12:32, 24 July 2024

scientific article; zbMATH DE number 7307478
Language Label Description Also known as
English
No label defined
scientific article; zbMATH DE number 7307478

    Statements

    8 February 2021
    0 references
    reinforcement learning
    0 references
    entropy regularization
    0 references
    stochastic control
    0 references
    relaxed control
    0 references
    linear-quadratic
    0 references
    Gaussian distribution
    0 references

    Identifiers