SBEED
From MaRDI portal
Software:46436
swMATH34727MaRDI QIDQ46436FDOQ46436
Author name not available (Why is that?)
Cited In (10)
- Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization
- Sample Complexity of Sample Average Approximation for Conditional Stochastic Optimization
- On Generalized Bellman Equations and Temporal-Difference Learning
- An efficient algorithm for nonconvex-linear minimax optimization problem and its application in solving weighted maximin dispersion problem
- Title not available (Why is that?)
- Policy space identification in configurable environments
- Title not available (Why is that?)
- A backward SDE method for uncertainty quantification in deep learning
- Efficient Search of First-Order Nash Equilibria in Nonconvex-Concave Smooth Min-Max Problems
- Fundamental design principles for reinforcement learning algorithms
This page was built for software: SBEED