swMATH34727MaRDI QIDQ46436FDOQ46436
Author name not available (Why is that?)
Official website: https://arxiv.org/abs/1712.10285
Cited In (18)
- Efficient search of first-order Nash equilibria in nonconvex-concave smooth min-max problems
- An efficient algorithm for nonconvex-linear minimax optimization problem and its application in solving weighted maximin dispersion problem
- Title not available (Why is that?)
- Policy space identification in configurable environments
- Fast global convergence of natural policy gradient methods with entropy regularization
- TernGrad
- Sample complexity of sample average approximation for conditional stochastic optimization
- DSCOVR
- ckn_kernel
- Baselines
- Title not available (Why is that?)
- IQC-Game
- NC-OPT
- DualDICE
- IterNet
- A backward SDE method for uncertainty quantification in deep learning
- IMPALA
- Fundamental design principles for reinforcement learning algorithms
This page was built for software: SBEED