A stochastic maximum principle approach for reinforcement learning with parameterized environment (Q6105091): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Particle Markov Chain Monte Carlo Methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: A direct filter method for parameter estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: An efficient numerical algorithm for solving data driven feedback control problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Data assimilation of synthetic data as a novel strategy for predicting disease progression in alopecia areata / rank
 
Normal rank
Property / cites work
 
Property / cites work: A First Order Scheme for Backward Doubly Stochastic Differential Equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: A survey of convergence results on particle filtering methods for practitioners / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Efficient Gradient Projection Method for Stochastic Optimal Control Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Higher-order implicit strong numerical schemes for stochastic differential equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: A random map implementation of implicit filters / rank
 
Normal rank
Property / cites work
 
Property / cites work: A General Stochastic Maximum Principle for Optimal Control Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5149240 / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4255599 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A numerical scheme for BSDEs / rank
 
Normal rank
Property / cites work
 
Property / cites work: New Kinds of High-Order Multistep Schemes for Coupled Forward Backward Stochastic Differential Equations / rank
 
Normal rank

Revision as of 08:55, 1 August 2024

scientific article; zbMATH DE number 7696994
Language Label Description Also known as
English
A stochastic maximum principle approach for reinforcement learning with parameterized environment
scientific article; zbMATH DE number 7696994

    Statements

    A stochastic maximum principle approach for reinforcement learning with parameterized environment (English)
    0 references
    0 references
    0 references
    0 references
    16 June 2023
    0 references
    reinforcement learning
    0 references
    optimal control
    0 references
    stochastic maximum principle
    0 references
    parameter estimation
    0 references
    optimal filtering
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references