A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

From MaRDI portal
Publication:5218653

DOI10.1126/SCIENCE.AAR6404zbMATH Open1433.68320OpenAlexW2902907165WikidataQ59594962 ScholiaQ59594962MaRDI QIDQ5218653FDOQ5218653


Authors: David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, Timothy P. Lillicrap, Karen Simonyan, Demis Hassabis Edit this on Wikidata


Publication date: 4 March 2020

Published in: Science (Search for Journal in Brave)

Full work available at URL: https://discovery.ucl.ac.uk/id/eprint/10069050/




Recommendations




Cited In (78)

Uses Software





This page was built for publication: A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5218653)