Batch mode reinforcement learning based on the synthesis of artificial trajectories

From MaRDI portal
Publication:378762

DOI10.1007/S10479-012-1248-5zbMATH Open1276.68134OpenAlexW2134689794WikidataQ42258641 ScholiaQ42258641MaRDI QIDQ378762FDOQ378762


Authors: R. Fonteneau, Louis Wehenkel, D. Ernst, Susan A. Murphy Edit this on Wikidata


Publication date: 12 November 2013

Published in: Annals of Operations Research (Search for Journal in Brave)

Full work available at URL: http://europepmc.org/articles/pmc3773886




Recommendations




Cites Work


Cited In (4)

Uses Software





This page was built for publication: Batch mode reinforcement learning based on the synthesis of artificial trajectories

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q378762)