scientific article; zbMATH DE number 7370594
From MaRDI portal
Publication:4998982
Authors: Yasuhiro Fujita, Prabhat Nagarajan, Toshiki Kataoka, Takahiro Ishikawa
Publication date: 9 July 2021
Full work available at URL: https://arxiv.org/abs/1912.03905
Title of this publication is not available (Why is that?)
Cites Work
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
- QT-Opt
- End-to-end training of deep visuomotor policies
- Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Cited In (2)
Uses Software
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4998982)