Herke van Hoof

From MaRDI portal
(Redirected from Person:331685)



List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
Deep policy dynamic programming for vehicle routing problems
(available as arXiv preprint)
2022-08-30Paper
Ancestral Gumbel-top-\(k\) sampling for sampling without replacement2020-10-05Paper
Non-parametric policy search with limited information loss2018-04-17Paper
Probabilistic inference for determining options in reinforcement learning
Machine Learning
2016-10-27Paper


Research outcomes over time


This page was built for person: Herke van Hoof