SBEED
From MaRDI portal
Software:46436
swMATH34727MaRDI QIDQ46436FDOQ46436
Author name not available (Why is that?)
Cited In (10)
- Efficient search of first-order Nash equilibria in nonconvex-concave smooth min-max problems
- On Generalized Bellman Equations and Temporal-Difference Learning
- An efficient algorithm for nonconvex-linear minimax optimization problem and its application in solving weighted maximin dispersion problem
- Title not available (Why is that?)
- Policy space identification in configurable environments
- Fast global convergence of natural policy gradient methods with entropy regularization
- Sample complexity of sample average approximation for conditional stochastic optimization
- Title not available (Why is that?)
- A backward SDE method for uncertainty quantification in deep learning
- Fundamental design principles for reinforcement learning algorithms
This page was built for software: SBEED