scientific article; zbMATH DE number 7370594
From MaRDI portal
Publication:4998982
Cites work
- A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
- End-to-end training of deep visuomotor policies
- QT-Opt
- Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
Cited in
(2)
Describes a project that uses
Uses Software
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4998982)