Batch mode reinforcement learning based on the synthesis of artificial trajectories

From MaRDI portal

(Redirected from Publication:378762)

Jump to:navigation, search

DOI10.1007/S10479-012-1248-5MaRDI QIDQ378762zbMATH OpenOpenAlexWikidataFDO

Authors R. Fonteneau, Louis Wehenkel, D. Ernst, Susan A. Murphy

Publication date 12 November 2013

Published in Annals of Operations Research (Search for Journal in Brave)

Full work available at URL http://europepmc.org/articles/pmc3773886

zbMATH Keywords

optimal control reinforcement learning artificial trajectories function approximators

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35)

Recommendations

Cites work

Cited in

(4)

Describes a project that uses

Uses Software

Approxrl

This page was built for publication: Batch mode reinforcement learning based on the synthesis of artificial trajectories

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q378762)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Batch_mode_reinforcement_learning_based_on_the_synthesis_of_artificial_trajectories&oldid=61466850"