Paulo Rauber
From MaRDI portal
Person:5004368
List of research outcomes
This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!
| Publication | Date of Publication | Type |
|---|---|---|
| Recurrent Neural-Linear Posterior Sampling for Nonstationary Contextual Bandits Neural Computation | 2022-10-24 | Paper |
| Reinforcement learning in sparse-reward environments with hindsight policy gradients Neural Computation | 2021-07-30 | Paper |
Research outcomes over time
This page was built for person: Paulo Rauber