OpenAI Gym
From MaRDI portal
Software:27219
No author found.
Related Items (50)
Recruitment-imitation mechanism for evolutionary reinforcement learning ⋮ SAMBA: safe model-based \& active reinforcement learning ⋮ Reinforcement learning for robotic manipulation using simulated locomotion demonstrations ⋮ Bellman's principle of optimality and deep reinforcement learning for time-varying tasks ⋮ Deep reinforcement learning for the control of conjugate heat transfer ⋮ Model-free reinforcement learning for branching Markov decision processes ⋮ Deep active inference ⋮ Quantum-enhanced reinforcement learning for control: a preliminary study ⋮ Dynamic metasurface control using deep reinforcement learning ⋮ Dependable learning-enabled multiagent systems ⋮ Towards finding longer proofs ⋮ End-to-end learning for off-road terrain navigation using the chrono open-source simulation platform ⋮ A theoretical and empirical comparison of gradient approximations in derivative-free optimization ⋮ Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning ⋮ Lipschitzness is all you need to tame off-policy generative adversarial imitation learning ⋮ Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning ⋮ Laplacian smoothing gradient descent ⋮ Neural Networks and Deep Learning ⋮ Constrained, Global Optimization of Unknown Functions with Lipschitz Continuous Gradients ⋮ Reproducible Hyperparameter Optimization ⋮ Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee ⋮ Automated Reinforcement Learning (AutoRL): A Survey and Open Problems ⋮ A Stochastic Trust-Region Framework for Policy Optimization ⋮ Data science applications to string theory ⋮ Deep active inference as variational policy gradients ⋮ Preparation of three-atom GHZ states based on deep reinforcement learning ⋮ Active deep Q-learning with demonstration ⋮ You only Lie Twice: A Multi-round Cyber Deception Game of Questionable Veracity ⋮ Counterfactual state explanations for reinforcement learning agents via generative deep learning ⋮ EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models ⋮ Unnamed Item ⋮ Unnamed Item ⋮ A review on deep reinforcement learning for fluid mechanics ⋮ ABC-LMPC: Safe Sample-Based Learning MPC for Stochastic Nonlinear Dynamical Systems with Adjustable Boundary Conditions ⋮ Importance sampling in reinforcement learning with an estimated behavior policy ⋮ The Hanabi challenge: a new frontier for AI research ⋮ Accelerating reinforcement learning with a directional-Gaussian-smoothing evolution strategy ⋮ Branes with brains: exploring string vacua with deep reinforcement learning ⋮ Convex optimization with an interpolation-based projection and its application to deep learning ⋮ Air learning: a deep reinforcement learning gym for autonomous aerial robot visual navigation ⋮ MADRaS : Multi Agent Driving Simulator ⋮ TD-regularized actor-critic methods ⋮ Permutation flow shop scheduling with multiple lines and demand plans using reinforcement learning ⋮ How does momentum benefit deep neural networks architecture design? A few case studies ⋮ Unnamed Item ⋮ Robust flow control and optimal sensor placement using deep reinforcement learning ⋮ Unnamed Item ⋮ Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning ⋮ Model-based Reinforcement Learning: A Survey ⋮ Neural network repair with reachability analysis
This page was built for software: OpenAI Gym