A stochastic maximum principle approach for reinforcement learning with parameterized environment (Q6105091): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Normalize DOI.
 
(2 intermediate revisions by 2 users not shown)
Property / DOI
 
Property / DOI: 10.1016/j.jcp.2023.112238 / rank
Normal rank
 
Property / OpenAlex ID
 
Property / OpenAlex ID: W4377101750 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Particle Markov Chain Monte Carlo Methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: A direct filter method for parameter estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: An efficient numerical algorithm for solving data driven feedback control problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Data assimilation of synthetic data as a novel strategy for predicting disease progression in alopecia areata / rank
 
Normal rank
Property / cites work
 
Property / cites work: A First Order Scheme for Backward Doubly Stochastic Differential Equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: A survey of convergence results on particle filtering methods for practitioners / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Efficient Gradient Projection Method for Stochastic Optimal Control Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Higher-order implicit strong numerical schemes for stochastic differential equations / rank
 
Normal rank
Property / cites work
 
Property / cites work: A random map implementation of implicit filters / rank
 
Normal rank
Property / cites work
 
Property / cites work: A General Stochastic Maximum Principle for Optimal Control Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5149240 / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4255599 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A numerical scheme for BSDEs / rank
 
Normal rank
Property / cites work
 
Property / cites work: New Kinds of High-Order Multistep Schemes for Coupled Forward Backward Stochastic Differential Equations / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1016/J.JCP.2023.112238 / rank
 
Normal rank

Latest revision as of 19:29, 30 December 2024

scientific article; zbMATH DE number 7696994
Language Label Description Also known as
English
A stochastic maximum principle approach for reinforcement learning with parameterized environment
scientific article; zbMATH DE number 7696994

    Statements

    A stochastic maximum principle approach for reinforcement learning with parameterized environment (English)
    0 references
    0 references
    0 references
    0 references
    16 June 2023
    0 references
    reinforcement learning
    0 references
    optimal control
    0 references
    stochastic maximum principle
    0 references
    parameter estimation
    0 references
    optimal filtering
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references