OpenAI Gym

From MaRDI portal
Software:27219



swMATH15330MaRDI QIDQ27219


No author found.





Related Items (50)

Recruitment-imitation mechanism for evolutionary reinforcement learningSAMBA: safe model-based \& active reinforcement learningReinforcement learning for robotic manipulation using simulated locomotion demonstrationsBellman's principle of optimality and deep reinforcement learning for time-varying tasksDeep reinforcement learning for the control of conjugate heat transferModel-free reinforcement learning for branching Markov decision processesDeep active inferenceQuantum-enhanced reinforcement learning for control: a preliminary studyDynamic metasurface control using deep reinforcement learningDependable learning-enabled multiagent systemsTowards finding longer proofsEnd-to-end learning for off-road terrain navigation using the chrono open-source simulation platformA theoretical and empirical comparison of gradient approximations in derivative-free optimizationEfficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement LearningLipschitzness is all you need to tame off-policy generative adversarial imitation learningComputational Benefits of Intermediate Rewards for Goal-Reaching Policy LearningLaplacian smoothing gradient descentNeural Networks and Deep LearningConstrained, Global Optimization of Unknown Functions with Lipschitz Continuous GradientsReproducible Hyperparameter OptimizationReinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guaranteeAutomated Reinforcement Learning (AutoRL): A Survey and Open ProblemsA Stochastic Trust-Region Framework for Policy OptimizationData science applications to string theoryDeep active inference as variational policy gradientsPreparation of three-atom GHZ states based on deep reinforcement learningActive deep Q-learning with demonstrationYou only Lie Twice: A Multi-round Cyber Deception Game of Questionable VeracityCounterfactual state explanations for reinforcement learning agents via generative deep learningEpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological ModelsUnnamed ItemUnnamed ItemA review on deep reinforcement learning for fluid mechanicsABC-LMPC: Safe Sample-Based Learning MPC for Stochastic Nonlinear Dynamical Systems with Adjustable Boundary ConditionsImportance sampling in reinforcement learning with an estimated behavior policyThe Hanabi challenge: a new frontier for AI researchAccelerating reinforcement learning with a directional-Gaussian-smoothing evolution strategyBranes with brains: exploring string vacua with deep reinforcement learningConvex optimization with an interpolation-based projection and its application to deep learningAir learning: a deep reinforcement learning gym for autonomous aerial robot visual navigationMADRaS : Multi Agent Driving SimulatorTD-regularized actor-critic methodsPermutation flow shop scheduling with multiple lines and demand plans using reinforcement learningHow does momentum benefit deep neural networks architecture design? A few case studiesUnnamed ItemRobust flow control and optimal sensor placement using deep reinforcement learningUnnamed ItemMean-Semivariance Policy Optimization via Risk-Averse Reinforcement LearningModel-based Reinforcement Learning: A SurveyNeural network repair with reachability analysis


This page was built for software: OpenAI Gym