OpenAI Gym - MaRDI portal

MaRDI QIDQ27219swMATHFDO

Official website http://arxiv.org/abs/1606.01540

Cited in

(only showing first 100 items - show all)

Data science applications to string theory
Counterfactual state explanations for reinforcement learning agents via generative deep learning
Convex optimization with an interpolation-based projection and its application to deep learning
Branes with brains: exploring string vacua with deep reinforcement learning
Active deep Q-learning with demonstration
Model-free reinforcement learning for branching Markov decision processes
Reinforcement learning for robotic manipulation using simulated locomotion demonstrations
TD-regularized actor-critic methods
The Hanabi challenge: a new frontier for AI research
Neural network repair with reachability analysis
Robust flow control and optimal sensor placement using deep reinforcement learning
Recruitment-imitation mechanism for evolutionary reinforcement learning
A stochastic trust-region framework for policy optimization
Deep active inference
SAMBA: safe model-based \& active reinforcement learning
Deep reinforcement learning for the control of conjugate heat transfer
Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Quantum-enhanced reinforcement learning for control: a preliminary study
Dynamic metasurface control using deep reinforcement learning
End-to-end learning for off-road terrain navigation using the chrono open-source simulation platform
Towards finding longer proofs
A review on deep reinforcement learning for fluid mechanics
scientific article; zbMATH DE number 7307475 (Why is no real title available?)
CompEcon
PIKAIA
ToolboxLS
M-TRAN
HyFlex
FODD-Planner
COCO
NMRDPP
DSSAT
Orbifolder
SUMO
Approxrl
TEXPLORE
RLPy
Cimlib
ELF
CUBIC
SPM
Pandapower
POMDPs.jl
Pypsa
FAUST2
Unity3D
APES
Chainer
MazeBase
Nematus
ParlAI
SeqGAN
MuJoCo
SUMMARIST
XNMT
Ray
Libratus
PEORL
ANNarchy
BindsNET
Nengo
Catalyst.RL
ChainerRL
ckn_kernel
Dopamine
ONNX
Horizon
AlphaZero
rlpyt
RLlib
RLgraph
Marabou
Reluplex
Torchmeta
SURREAL
Tensorforce
Pluribus
advertorch
NNV
VERIFAI
AirSim
MiniGrid
Baselines
CARLA
OR-Gym
ORL
SBEED
PILCO
Garage
keras-rl
Stable Baselines
MushroomRL
Pybullet
RotorS
RLzoo
Sim4CV
RLBench
td-reg
TORCS

This page was built for software: OpenAI Gym