Optimized look-ahead tree policies: a bridge between look-ahead tree policies and direct policy search

From MaRDI portal

Publication:2795793

Jump to:navigation, search

DOI10.1002/acs.2387zbMath1331.93226arXiv1208.4773OpenAlexW1783718104MaRDI QIDQ2795793

Louis Wehenkel, Tobias Jung, Damien Ernst, Francis Maes

Publication date: 22 March 2016

Published in: International Journal of Adaptive Control and Signal Processing (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1208.4773

zbMATH Keywords

optimal control reinforcement learning direct policy search look-ahead tree search

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Approximation methods and heuristics in mathematical programming (90C59) Optimal stochastic control (93E20)

Related Items (1)

Some recent advances in learning and adaptation for uncertain feedback control systems

Uses Software

Cites Work

This page was built for publication: Optimized look-ahead tree policies: a bridge between look-ahead tree policies and direct policy search

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2795793&oldid=15690237"