OpenAI Gym
From MaRDI portal
Cited in
(only showing first 100 items - show all)- Data science applications to string theory
- Counterfactual state explanations for reinforcement learning agents via generative deep learning
- Convex optimization with an interpolation-based projection and its application to deep learning
- Branes with brains: exploring string vacua with deep reinforcement learning
- Active deep Q-learning with demonstration
- Model-free reinforcement learning for branching Markov decision processes
- Reinforcement learning for robotic manipulation using simulated locomotion demonstrations
- TD-regularized actor-critic methods
- The Hanabi challenge: a new frontier for AI research
- Neural network repair with reachability analysis
- Robust flow control and optimal sensor placement using deep reinforcement learning
- Recruitment-imitation mechanism for evolutionary reinforcement learning
- A stochastic trust-region framework for policy optimization
- Deep active inference
- SAMBA: safe model-based \& active reinforcement learning
- Deep reinforcement learning for the control of conjugate heat transfer
- Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee
- Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
- Quantum-enhanced reinforcement learning for control: a preliminary study
- Dynamic metasurface control using deep reinforcement learning
- End-to-end learning for off-road terrain navigation using the chrono open-source simulation platform
- Towards finding longer proofs
- A review on deep reinforcement learning for fluid mechanics
- scientific article; zbMATH DE number 7307475 (Why is no real title available?)
- CompEcon
- PIKAIA
- ToolboxLS
- M-TRAN
- HyFlex
- FODD-Planner
- COCO
- NMRDPP
- DSSAT
- Orbifolder
- SUMO
- Approxrl
- TEXPLORE
- RLPy
- Cimlib
- ELF
- CUBIC
- SPM
- Pandapower
- POMDPs.jl
- Pypsa
- FAUST2
- Unity3D
- APES
- Chainer
- MazeBase
- Nematus
- ParlAI
- SeqGAN
- MuJoCo
- SUMMARIST
- XNMT
- Ray
- Libratus
- PEORL
- ANNarchy
- BindsNET
- Nengo
- Catalyst.RL
- ChainerRL
- ckn_kernel
- Dopamine
- ONNX
- Horizon
- AlphaZero
- rlpyt
- RLlib
- RLgraph
- Marabou
- Reluplex
- Torchmeta
- SURREAL
- Tensorforce
- Pluribus
- advertorch
- NNV
- VERIFAI
- AirSim
- MiniGrid
- Baselines
- CARLA
- OR-Gym
- ORL
- SBEED
- PILCO
- Garage
- keras-rl
- Stable Baselines
- MushroomRL
- Pybullet
- RotorS
- RLzoo
- Sim4CV
- RLBench
- td-reg
- TORCS
This page was built for software: OpenAI Gym