Paulo Rauber
From MaRDI portal
Person:5004368
Available identifiers
zbMath Open rauber.pauloMaRDI QIDQ5004368
List of research outcomes
| This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon! |
| Publication | Date of Publication | Type |
|---|---|---|
| Recurrent Neural-Linear Posterior Sampling for Nonstationary Contextual Bandits | 2022-10-24 | Paper |
| Reinforcement Learning in Sparse-Reward Environments With Hindsight Policy Gradients | 2021-07-30 | Paper |
Research outcomes over time
This page was built for person: Paulo Rauber