Bruno Scherrer

From MaRDI portal



List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming
Operations Research Letters
2018-09-28Paper
Improved and generalized upper bounds on the complexity of policy iteration
Mathematics of Operations Research
2016-08-10Paper
scientific article; zbMATH DE number 6542806 (Why is no real title available?)2016-02-19Paper
Off-policy learning with eligibility traces: a survey2014-12-08Paper
Off-policy learning with eligibility traces: a survey
(available as arXiv preprint)
2014-12-08Paper
Performance bounds for \(\lambda \) policy iteration and application to the game of Tetris2014-04-01Paper


Research outcomes over time


This page was built for person: Bruno Scherrer