A stochastic maximum principle approach for reinforcement learning with parameterized environment (Q6105091): Difference between revisions

@@ Property / DOI @@
-.1016/j.jcp.2023.112238
@@ Property / DOI: 10.1016/j.jcp.2023.112238 / rank @@
-Normal rank
@@ Property / OpenAlex ID @@
+W4377101750
@@ Property / OpenAlex ID: W4377101750 / rank @@
+Normal rank
@@ Property / cites work @@
+Particle Markov Chain Monte Carlo Methods
@@ Property / cites work: Particle Markov Chain Monte Carlo Methods / rank @@
+Normal rank
@@ Property / cites work @@
+A direct filter method for parameter estimation
@@ Property / cites work: A direct filter method for parameter estimation / rank @@
+Normal rank
@@ Property / cites work @@
+An efficient numerical algorithm for solving data driven feedback control problems
+Normal rank
@@ Property / cites work @@
+Data assimilation of synthetic data as a novel strategy for predicting disease progression in alopecia areata
+Normal rank
@@ Property / cites work @@
+A First Order Scheme for Backward Doubly Stochastic Differential Equations
+Normal rank
@@ Property / cites work @@
+A survey of convergence results on particle filtering methods for practitioners
+Normal rank
@@ Property / cites work @@
+An Efficient Gradient Projection Method for Stochastic Optimal Control Problems
+Normal rank
@@ Property / cites work @@
+Higher-order implicit strong numerical schemes for stochastic differential equations
+Normal rank
@@ Property / cites work @@
+A random map implementation of implicit filters
@@ Property / cites work: A random map implementation of implicit filters / rank @@
+Normal rank
@@ Property / cites work @@
+A General Stochastic Maximum Principle for Optimal Control Problems
+Normal rank
@@ Property / cites work @@
+Q4626283
@@ Property / cites work: Q4626283 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5149240
@@ Property / cites work: Q5149240 / rank @@
+Normal rank
@@ Property / cites work @@
+\({\mathcal Q}\)-learning
@@ Property / cites work: \({\mathcal Q}\)-learning / rank @@
+Normal rank
@@ Property / cites work @@
+Q4255599
@@ Property / cites work: Q4255599 / rank @@
+Normal rank
@@ Property / cites work @@
+A numerical scheme for BSDEs
@@ Property / cites work: A numerical scheme for BSDEs / rank @@
+Normal rank
@@ Property / cites work @@
+New Kinds of High-Order Multistep Schemes for Coupled Forward Backward Stochastic Differential Equations
+Normal rank
@@ Property / DOI @@
+.1016/J.JCP.2023.112238
@@ Property / DOI: 10.1016/J.JCP.2023.112238 / rank @@
+Normal rank