swMATH15330MaRDI QIDQ27219FDOQ27219
Author name not available (Why is that?)
Official website: http://arxiv.org/abs/1606.01540
Cited In (only showing first 100 items - show all)
- TD-regularized actor-critic methods
- CompEcon
- PIKAIA
- ToolboxLS
- M-TRAN
- HyFlex
- FODD-Planner
- COCO
- NMRDPP
- DSSAT
- Orbifolder
- SUMO
- Approxrl
- TEXPLORE
- RLPy
- Cimlib
- ELF
- CUBIC
- SPM
- Pandapower
- POMDPs.jl
- Pypsa
- FAUST2
- Unity3D
- APES
- Chainer
- MazeBase
- Nematus
- ParlAI
- SeqGAN
- MuJoCo
- SUMMARIST
- XNMT
- Ray
- Libratus
- PEORL
- ANNarchy
- BindsNET
- Nengo
- Catalyst.RL
- ChainerRL
- ckn_kernel
- Dopamine
- ONNX
- Horizon
- AlphaZero
- rlpyt
- RLlib
- RLgraph
- Marabou
- Reluplex
- Torchmeta
- SURREAL
- Tensorforce
- Pluribus
- advertorch
- NNV
- VERIFAI
- AirSim
- MiniGrid
- Baselines
- CARLA
- OR-Gym
- ORL
- SBEED
- PILCO
- Garage
- keras-rl
- Stable Baselines
- MushroomRL
- Pybullet
- RotorS
- RLzoo
- Sim4CV
- RLBench
- td-reg
- TORCS
- Tianshou
- Stable Baselines3
- MADRaS : Multi Agent Driving Simulator
- FinRL
- pymgrid
- AMYTISS
- StocHy
- HUAYNO
- MIPLIBing
- ast2vec
- CORe50
- AdaptiveStressTesting.jl
- Dart
- Flappy
- Ecole
- OpenGraphGym
- EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models
- MIPLearn
- ChauffeurNet
- DACBench
- MADRaS
- ViZDoom
- GAZEBO classic
This page was built for software: OpenAI Gym